Ajou University repository

Deep Multi-Task Conditional and Sequential Learning for Anti-Jammingoa mark
Citations

SCOPUS

2

Citation Export

Publication Year
2021-01-01
Publisher
Institute of Electrical and Electronics Engineers Inc.
Citation
IEEE Access, Vol.9, pp.123194-123207
Keyword
Ad hoc networkdeep learningjamming attackmulti-armed banditspectrum sensing
Mesh Keyword
Continuous actionsMulti armed banditMutual informationsPacket transmissionsSequential learningSingle task learningTransmission channelsUpdate algorithms
All Science Classification Codes (ASJC)
Computer Science (all)Materials Science (all)Engineering (all)Electrical and Electronic Engineering
Abstract
Multi-task learning provides plenty of room for performance improvement to single-task learning, when learned tasks are related and learned with mutual information. In this work, we analyze the efficiency of using a single-task reinforcement learning algorithm to mitigate jamming attacks with frequency hopping strategy. Our findings show that single-task learning implementations do not always guarantee optimal cumulative reward when some jammer's parameters are unknown, notably the jamming time-slot length in this case. Therefore, to maximize packet transmission in the presence of a jammer whose parameters are unknown, we propose deep multi-task conditional and sequential learning (DMCSL), a multi-task learning algorithm that builds a transition policy to optimize conditional and sequential tasks. For the anti-jamming system, the proposed model learns two tasks: sensing time and transmission channel selection. DMCSL is a composite of the state-of-the-art reinforcement learning algorithms, multi-armed bandit and an extended deep-Q-network. To improve the chance of convergence and optimal cumulative reward of the algorithm, we also propose a continuous action-space update algorithm for sensing time action-space. The simulation results show that DMCSL guarantees better performance than single-task learning by relying on a logarithmically increased action-space sample. Against a random dynamic jamming time-slot, DMCSL achieves about three times better cumulative reward, and against a periodic dynamic jamming time-slot, it improves by 10% the cumulative reward.
ISSN
2169-3536
Language
eng
URI
https://dspace.ajou.ac.kr/dev/handle/2018.oak/32257
DOI
https://doi.org/10.1109/access.2021.3109856
Fulltext

Type
Article
Funding
This work was supported by the Industrial Infrastructure Program for Fundamental Technologies funded by the Ministry of Trade, Industry and Energy (MOTIE), South Korea, under Grant N0002312.
Show full item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Choi, Youngjune Image
Choi, Youngjune최영준
Department of Software and Computer Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.