Deep Multi-Task Conditional and Sequential Learning for Anti-Jamming

Basomingera, Robert; Choi, Young June

Publication Year: 2021-01-01

Publisher: Institute of Electrical and Electronics Engineers Inc.

Citation: IEEE Access, Vol.9, pp.123194-123207

Keyword: Ad hoc network deep learning jamming attack multi-armed bandit spectrum sensing

Mesh Keyword: Continuous actions Multi armed bandit Mutual informations Packet transmissions Sequential learning Single task learning Transmission channels Update algorithms

All Science Classification Codes (ASJC): Computer Science (all)Materials Science (all)Engineering (all)Electrical and Electronic Engineering

Abstract: Multi-task learning provides plenty of room for performance improvement to single-task learning, when learned tasks are related and learned with mutual information. In this work, we analyze the efficiency of using a single-task reinforcement learning algorithm to mitigate jamming attacks with frequency hopping strategy. Our findings show that single-task learning implementations do not always guarantee optimal cumulative reward when some jammer's parameters are unknown, notably the jamming time-slot length in this case. Therefore, to maximize packet transmission in the presence of a jammer whose parameters are unknown, we propose deep multi-task conditional and sequential learning (DMCSL), a multi-task learning algorithm that builds a transition policy to optimize conditional and sequential tasks. For the anti-jamming system, the proposed model learns two tasks: sensing time and transmission channel selection. DMCSL is a composite of the state-of-the-art reinforcement learning algorithms, multi-armed bandit and an extended deep-Q-network. To improve the chance of convergence and optimal cumulative reward of the algorithm, we also propose a continuous action-space update algorithm for sensing time action-space. The simulation results show that DMCSL guarantees better performance than single-task learning by relying on a logarithmically increased action-space sample. Against a random dynamic jamming time-slot, DMCSL achieves about three times better cumulative reward, and against a periodic dynamic jamming time-slot, it improves by 10% the cumulative reward.

ISSN: 2169-3536

Language: eng

URI: https://dspace.ajou.ac.kr/dev/handle/2018.oak/32257

DOI: https://doi.org/10.1109/access.2021.3109856

Fulltext

Type: Article

Funding: This work was supported by the Industrial Infrastructure Program for Fundamental Technologies funded by the Ministry of Trade, Industry and Energy (MOTIE), South Korea, under Grant N0002312.

Show full item record

qrcode

트윗하기

Related Researcher

Choi, Youngjune최영준: Department of Software and Computer Engineering

File Download

There are no files associated with this item.

Related Researcher

Total Views & Downloads

File Download