Research Article

A Reinforcement Learning-Based Maximum Power Point Tracking Method for Photovoltaic Array

Table 2

The percentage of choosing action in different phase.

Interval (minutes)Action (V)
+5+2+0.5−0.5−2−5

Early 0~2528%8%12%8%12%32%
Middle 100~1258%16%24%20%20%12%
Final 215~2404%12%40%32%8%4%