Distributed Intelligent Learning and Decision Model Based on Logic Predictive Control

<table class="algorithm-group"><tr><td><table class="algorithm" id="alg2"><tr><td>(1)</td><td>Initialization.</td></tr><tr><td>(2)</td><td>Repeat the following steps:</td></tr><tr><td>(a)</td><td>Observe the current environmental state <i>s</i>.</td></tr><tr><td>(b)</td><td>Select actions according to the improved action selection mechanism.</td></tr><tr><td>(c)</td><td>Observe and record the new state <i>s</i>′, individual action <i>A</i>, other agent actions, and instant reward; then, update the state prediction function and state transition probability function, and calculate the strategy change function of agent <i>J</i>.</td></tr><tr><td>(d)</td><td>Enter (e) if the target is reached; otherwise, return to (a).</td></tr><tr><td>(e)</td><td>Compute the <i>Q</i> function using the priority scan algorithm.</td></tr></table></td></tr></table>

<div> Model prediction action selection mechanism algorithm.</div>

Computational Intelligence and Neuroscience

alg2

Algorithm 2

Algorithm 2: Distributed Intelligent Learning and Decision Model Based on Logic Predictive Control