A Reinforcement Learning Framework for Spiking Networks with Dynamic Synapses

<table>Simulation results in case of five neurons in the hidden layer and window size set to five msec. (a) Values of reward signal. (b) Distances between the reference and the output signal, <svg height="13.625" id="M136" style="vertical-align:-2.21957pt" version="1.1" viewbox="0 0 52.012501 13.625" width="52.012501" xmlns="http://www.w3.org/2000/svg">
<g transform="matrix(1.25,0,0,-1.25,0,13.625)">
<g transform="translate(72,-61.1)">
<text transform="matrix(1,0,0,-1,-71.95,63.36)">
<tspan style="font-size: 12.50px; " x="0" y="0">𝒟</tspan>
<tspan style="font-size: 12.50px; " x="11.065155" y="0">(</tspan>
<tspan style="font-size: 12.50px; " x="15.228654" y="0">𝐹</tspan>
<tspan style="font-size: 12.50px; " x="22.955507" y="0">,</tspan>
<tspan style="font-size: 12.50px; " x="28.156755" y="0">𝐺</tspan>
<tspan style="font-size: 12.50px; " x="37.333958" y="0">)</tspan>
</text>
</g>
</g>
</svg>. (c) Maximum cross-correlation coefficient observed between the reference and the output signal, <svg height="13.625" id="M137" style="vertical-align:-2.21957pt" version="1.1" viewbox="0 0 52.099998 13.625" width="52.099998" xmlns="http://www.w3.org/2000/svg">
<g transform="matrix(1.25,0,0,-1.25,0,13.625)">
<g transform="translate(72,-61.1)">
<text transform="matrix(1,0,0,-1,-71.95,63.36)">
<tspan style="font-size: 12.50px; " x="0" y="0">𝒳</tspan>
<tspan style="font-size: 12.50px; " x="11.140173" y="0">(</tspan>
<tspan style="font-size: 12.50px; " x="15.303672" y="0">𝐹</tspan>
<tspan style="font-size: 12.50px; " x="23.030525" y="0">,</tspan>
<tspan style="font-size: 12.50px; " x="28.231773" y="0">𝐺</tspan>
<tspan style="font-size: 12.50px; " x="37.408978" y="0">)</tspan>
</text>
</g>
</g>
</svg>. A snapshot from the simulation over the input/output firing patterns and internal EPSP of the output neuron is given in Figure  S1.</table>

Computational Intelligence and Neuroscience

fig3

Figure 3

Figure 3: A Reinforcement Learning Framework for Spiking Networks with Dynamic Synapses