Research Article

Intelligent Inventory Control via Ruminative Reinforcement Learning

Algorithm 1

RSarsa algorithm.
(L00) Initialize .
(L01) Observe .
(L02) Determine by policy .
(L03) For each period,
(L04)  observe , and ;
(L05)  determine by policy ;
(L06)  calculate ;
(L07)  update ;
(L08)  for each ,
(L09)   calculate with ,
(L10)    determine ,
(L11)     calculate ,
(L12)    update
(L13)  until ruminated all ;
(L14)  set and
(L15) until termination.