Research Article
A DRL-Based Intelligent Jamming Approach for Joint Channel and Power Optimization
Algorithm 1
Channel and power joint decision jamming algorithm based on parallel DQN (CPJ-PDQN).
Initialization: | The parameters of channel network and power network are initialized to random value. | Initialize the replay memory , and the iteration time . | For do | Sense to construct state . | Calculate Q value and , select jamming channel and jamming power according to the strategy. | Execute joint action and emit jamming signals. | Calculate its reward in Equation (8). | Obtain by spectrum sensing. Update the next state , add the experience in replay memory and add the experience in replay memory . | If and do | Sample minibatch of from and | from according to the PER method | by Equations (15)–(18). Calculate loss value according to Equation (19). | Update network parameters and by SGD. | Update target network every iteration. | End If | End For |
|