Performance Evaluation of Multiagent Reinforcement Learning Based Training Methods for Swarm Fighting

<table class="algorithm-group"><tr><td><table class="algorithm" id="alg1"><tr><td colspan="2"> <b>Input:</b> Begin//bool value, initialize the environment,</td></tr><tr><td colspan="2">whether to start round.</td></tr><tr><td colspan="2"> <b>Output:</b> Winner of this round.</td></tr><tr><td colspan="2">  Execute this program each frame.</td></tr><tr><td colspan="2">  MaxEnviormentStep =8000//Defines the maximum.</td></tr><tr><td colspan="2">  number of steps in the environment.</td></tr><tr><td colspan="2">  <b>for</b><svg height="8.8423pt" id="M1" style="vertical-align:-0.2064009pt" version="1.1" viewbox="-0.0498162 -8.6359 3.66193 8.8423" width="3.66193pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M244 607C244 633 228 655 200 655C166 655 146 618 146 594C146 564 166 546 191 546C221 546 244 574 244 607ZM222 91L209 114C184 94 148 66 133 66C127 66 124 73 130 96L201 370C213 416 211 448 191 448C162 448 88 407 29 352L42 328C73 354 104 371 114 371C120 371 119 365 115 345L53 92C32 5 45 -12 68 -12C103 -12 186 50 222 91Z"></path></g></svg> <svg height="8.89449pt" id="M2" style="vertical-align:-1.11981pt" version="1.1" viewbox="-0.0498162 -7.77468 7.75925 8.89449" width="7.75925pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M531 71V127L115 310L531 494V550L57 335V285L531 71ZM531 -40V10H57V-40H531Z"></path></g></svg> MaxEnviormentStep &amp;&amp; Begin <b>do.</b></td></tr><tr><td colspan="2">    <b>++</b><svg height="8.8423pt" id="M3" style="vertical-align:-0.2064009pt" version="1.1" viewbox="-0.0498162 -8.6359 6.63704 8.8423" width="6.63704pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M244 607C244 633 228 655 200 655C166 655 146 618 146 594C146 564 166 546 191 546C221 546 244 574 244 607ZM222 91L209 114C184 94 148 66 133 66C127 66 124 73 130 96L201 370C213 416 211 448 191 448C162 448 88 407 29 352L42 328C73 354 104 371 114 371C120 371 119 365 115 345L53 92C32 5 45 -12 68 -12C103 -12 186 50 222 91Z"></path></g><g transform="matrix(.013,0,0,-0.013,3.549,0)"><path d="M113 -12C146 -12 170 11 170 46C170 78 146 103 114 103S58 78 58 46C58 11 82 -12 113 -12Z"></path></g></svg></td></tr><tr><td colspan="2">    <b>if</b> Red.num ==0 &amp;&amp; Green.num ==0 <b>then</b></td></tr><tr><td colspan="2">      <b>return</b> tie</td></tr><tr><td colspan="2">    <b>else if</b> Red.num ==0 <b>then</b></td></tr><tr><td colspan="2">      <b>return</b> Green.win</td></tr><tr><td colspan="2">    <b>else</b></td></tr><tr><td colspan="2">      <b>return</b> Red.win</td></tr><tr><td colspan="2"> <b>return</b> null</td></tr></table></td></tr></table>

Wireless Communications and Mobile Computing

alg1

Algorithm 1

Algorithm 1: Performance Evaluation of Multiagent Reinforcement Learning Based Training Methods for Swarm Fighting