Research Article

Deep Reinforcement Learning-Based Trading Strategy for Load Aggregators on Price-Responsive Demand

Table 2

Description of the parameters of the DDPG.

ParametersMeaningValue

TAUSmoothing coefficient of target network in actor and critic network0.001
αActor network and critic network learning rate0.0005
Batch_sizeNumber drawn from the experience pool per training64
CapacitySize of the experience pool100000
γDiscount factor0.99