Reinforcement Learning with Probabilistic Boolean Network Models of Smart Grid Devices

<div>Maximum reward obtained for the Fault 1 operation mode of the IPR in one year of operation.</div>

Complexity

Figure 9: Reinforcement Learning with Probabilistic Boolean Network Models of Smart Grid Devices