Research Article

Over-the-Air Computation with Quantized CSI and Discrete Power Control Levels

Algorithm 1

AirComp-DRL Training for MSE Minimization.
Require:
Ensure:
  Initialize , and
  Initialize a Replay Memory
  Initialize action-value function with random weights
  Load -level Channel Quantizer
 fordo
   Draw Channel Coefs
   Assign Power Levels randomly
   ,      Initial State Quantization
  whiledo
    Select a random number
   ifthen               Exploration
     Select a random action
   else                 Exploitation
     
   end if
    Take action , Observe reward and state
    Store transition in Experience Replay
    Select random minibatch of transitions from
    Set
    Perform Gradient Descent on
    
  end while
            -greedy decaying
end for