Research Article

EAQR: A Multiagent Q-Learning Algorithm for Coordination of Multiple Agents

Table 8

Maximal steps for the DSN problem (evaluation episodes = 5000).

 = 10,000 = 50,000 = 100,000

EAQR4.813.933.77
WoLF-PHC5.345.345.67
EMA Q-learning4.724.944.80
Single-agent RL6.006.005.66