Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning

<table class="figure-group"><tr class="fig-image" id="a"><td><object data="https://static.hindawi.com/articles/cin/volume-2016/4824072/figures/4824072.fig.005a.svgz" name="4824072.fig.005a" type="image/svg+xml"></object></td></tr><tr class="fig-caption"><td><b>(a) </b>Prediction of the angle at next state</td></tr><tr class="fig-image" id="b"><td><object data="https://static.hindawi.com/articles/cin/volume-2016/4824072/figures/4824072.fig.005b.svgz" name="4824072.fig.005b" type="image/svg+xml"></object></td></tr><tr class="fig-caption"><td><b>(b) </b>Prediction of the angular velocity at next state</td></tr><tr class="fig-image" id="c"><td><object data="https://static.hindawi.com/articles/cin/volume-2016/4824072/figures/4824072.fig.005c.svgz" name="4824072.fig.005c" type="image/svg+xml"></object></td></tr><tr class="fig-caption"><td><b>(c) </b>Prediction of the reward</td></tr></table>

<div>Prediction of the next state and reward according to the global model.</div>

Computational Intelligence and Neuroscience

fig5

Figure 5

Figure 5: Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning