A Novel Motion-Intelligence-Based Control Algorithm for Object Tracking by Controlling PAN-Tilt Automatically

<figure class="algorithm-group"><table class="algorithm" id="alg2"><tr><td colspan="2">Get the memory pool <i>D</i></td></tr><tr><td colspan="2">Initialize Q-value network with random <svg height="9.49473pt" id="M168" style="vertical-align:-0.2063999pt" version="1.1" viewbox="-0.0498162 -9.28833 6.59789 9.49473" width="6.59789pt" xmlns="http://www.w3.org/2000/svg"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M475 507C475 612 440 712 326 712C139 712 23 420 23 215C23 96 58 -12 180 -12C369 -12 475 293 475 507ZM391 522C391 486 387 448 379 394H126C155 538 222 677 310 677C386 677 391 571 391 522ZM373 346C344 193 283 22 189 22C126 22 106 114 106 196C106 243 111 293 118 346H373Z"></path></g></svg></td></tr><tr><td colspan="2"><b>For</b> epochs = 1, 1000,000 <b>do</b></td></tr><tr><td colspan="2"><span style="margin-left:1.8888888888888888em"></span>Sample random from memory pool to get 50 samples</td></tr><tr><td colspan="2"><span style="margin-left:1.8888888888888888em"></span>Perform a gradient descent step on equation (<a href="https://static.hindawi.com/articles/mpe/volume-2019/9602460/figures/#EEq21">21</a>) respect to the network parameters <svg height="9.49473pt" id="M169" style="vertical-align:-0.2063999pt" version="1.1" viewbox="-0.0498162 -9.28833 6.59789 9.49473" width="6.59789pt" xmlns="http://www.w3.org/2000/svg"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M475 507C475 612 440 712 326 712C139 712 23 420 23 215C23 96 58 -12 180 -12C369 -12 475 293 475 507ZM391 522C391 486 387 448 379 394H126C155 538 222 677 310 677C386 677 391 571 391 522ZM373 346C344 193 283 22 189 22C126 22 106 114 106 196C106 243 111 293 118 346H373Z"></path></g></svg></td></tr><tr><td colspan="2"><span style="margin-left:1.8888888888888888em"></span>Every 100 steps, clone the Q-value network to obtain the target network Q-value network</td></tr><tr><td colspan="2"><b>End For</b></td></tr></table></figure>

<p>Memory replay for Q-value network.</p>

Mathematical Problems in Engineering

alg2

Algorithm 2

Algorithm 2: A Novel Motion-Intelligence-Based Control Algorithm for Object Tracking by Controlling PAN-Tilt Automatically 

Get the memory pool D
Initialize Q-value network with random
For epochs = 1, 1000,000 do
Sample random from memory pool to get 50 samples
Perform a gradient descent step on equation (21) respect to the network parameters
Every 100 steps, clone the Q-value network to obtain the target network Q-value network
End For