Wireless Communications and Mobile Computing

Research Article

Over-the-Air Computation with Quantized CSI and Discrete Power Control Levels

AirComp-DRL Training for MSE Minimization.

Require:
Ensure:
Initialize , and
Initialize a Replay Memory
Initialize action-value function with random weights
Load -level Channel Quantizer
fordo
Draw Channel Coefs
Assign Power Levels randomly
, Initial State Quantization
whiledo
Select a random number
ifthen Exploration
Select a random action
else Exploitation

end if
Take action , Observe reward and state
Store transition in Experience Replay
Select random minibatch of transitions from
Set
Perform Gradient Descent on

end while
-greedy decaying
end for