Research Article

Optimal Wireless Information and Power Transfer Using Deep Q-Network

Figure 10

The comparison of average time consumption between DQN and other algorithms (myopic solution, multiarmed bandit, even power allocation, and random action selection) when the number of energy harvesters is . The number of transmit antennas is . The standard deviation of the channel amplitude is .