Research Article
An Energy Management Strategy for a Super-Mild Hybrid Electric Vehicle Based on a Known Model of Reinforcement Learning
Algorithm 1
Flowchart of the PI algorithm.
PI Algorithm | 1. Input: state transition probability , return function , discount factor . | Initialization value function: V(s)=0 | Initialization policy | 2. Repeat k=0,1…. | 3. find policy evaluation | 4. policy improvement | 5. Until | 6. Output: |
|