Citation Export
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Basomingera, Robert | - |
dc.contributor.author | Choi, Young June (researcherId=7406117220; isni=0000000405323933; orcid=https://orcid.org/0000-0003-2014-6587) | - |
dc.date.issued | 2021-01-01 | - |
dc.identifier.issn | 2169-3536 | - |
dc.identifier.uri | https://dspace.ajou.ac.kr/dev/handle/2018.oak/32257 | - |
dc.description.abstract | Multi-task learning provides plenty of room for performance improvement to single-task learning, when learned tasks are related and learned with mutual information. In this work, we analyze the efficiency of using a single-task reinforcement learning algorithm to mitigate jamming attacks with frequency hopping strategy. Our findings show that single-task learning implementations do not always guarantee optimal cumulative reward when some jammer's parameters are unknown, notably the jamming time-slot length in this case. Therefore, to maximize packet transmission in the presence of a jammer whose parameters are unknown, we propose deep multi-task conditional and sequential learning (DMCSL), a multi-task learning algorithm that builds a transition policy to optimize conditional and sequential tasks. For the anti-jamming system, the proposed model learns two tasks: sensing time and transmission channel selection. DMCSL is a composite of the state-of-the-art reinforcement learning algorithms, multi-armed bandit and an extended deep-Q-network. To improve the chance of convergence and optimal cumulative reward of the algorithm, we also propose a continuous action-space update algorithm for sensing time action-space. The simulation results show that DMCSL guarantees better performance than single-task learning by relying on a logarithmically increased action-space sample. Against a random dynamic jamming time-slot, DMCSL achieves about three times better cumulative reward, and against a periodic dynamic jamming time-slot, it improves by 10% the cumulative reward. | - |
dc.description.sponsorship | This work was supported by the Industrial Infrastructure Program for Fundamental Technologies funded by the Ministry of Trade, Industry and Energy (MOTIE), South Korea, under Grant N0002312. | - |
dc.language.iso | eng | - |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
dc.subject.mesh | Continuous actions | - |
dc.subject.mesh | Multi armed bandit | - |
dc.subject.mesh | Mutual informations | - |
dc.subject.mesh | Packet transmissions | - |
dc.subject.mesh | Sequential learning | - |
dc.subject.mesh | Single task learning | - |
dc.subject.mesh | Transmission channels | - |
dc.subject.mesh | Update algorithms | - |
dc.title | Deep Multi-Task Conditional and Sequential Learning for Anti-Jamming | - |
dc.type | Article | - |
dc.citation.endPage | 123207 | - |
dc.citation.startPage | 123194 | - |
dc.citation.title | IEEE Access | - |
dc.citation.volume | 9 | - |
dc.identifier.bibliographicCitation | IEEE Access, Vol.9, pp.123194-123207 | - |
dc.identifier.doi | 10.1109/access.2021.3109856 | - |
dc.identifier.scopusid | 2-s2.0-85114722848 | - |
dc.identifier.url | http://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639 | - |
dc.subject.keyword | Ad hoc network | - |
dc.subject.keyword | deep learning | - |
dc.subject.keyword | jamming attack | - |
dc.subject.keyword | multi-armed bandit | - |
dc.subject.keyword | spectrum sensing | - |
dc.description.isoa | true | - |
dc.subject.subarea | Computer Science (all) | - |
dc.subject.subarea | Materials Science (all) | - |
dc.subject.subarea | Engineering (all) | - |
dc.subject.subarea | Electrical and Electronic Engineering | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.