Research Article
Intelligent Inventory Control via Ruminative Reinforcement Learning
Table 2
Experimental results.
| Line | ā | ā | RSarsa.TD | PRS.Beta |
| Early periods | 1 | P1 | RSarsa | 0.49 | 0.16 | 2 | PRS | W | W | 3 | P2 | RSarsa | 0.95 | W | 4 | PRS | 0.10 | W | 5 | P3 | RSarsa | 0.80 | 0.37 | 6 | PRS | 0.26 | W |
| Later periods | 7 | P1 | RSarsa | W | W | 8 | PRS | 0.63 | 0.14 | 9 | P2 | RSarsa | 0.46 | W | 10 | PRS | 0.97 | 0.12 | 11 | P3 | RSarsa | 0.66 | 0.26 | 12 | PRS | 0.60 | 0.18 |
|
|