Research Article

Constructing Temporally Extended Actions through Incremental Community Detection

Algorithm 2

Option learning with evolving communities.
(1)  Initial condition: learning agent with primitive options
(2)Start option generation;
(3)  while true do
(4)   Sampling trajectories;
(5)   if condition for option discovery is met then
(6)    Use Louvain algorithm to detect communities;
(7)    Construct options from communities, having their and specified;
(8)    Take ER to form options’ internal policies ;
(9)    Add options to set;
(10)   break;
(11)Start learning;
(12) while not converged do
(13)  Perform Intra-Option Learning with the current option sets;
(14)  if reaches incremental processing cycle then
(15)    Call Algorithm 1 to update communities;
(16)    Reconstruct options from updated communities;
(17)    for reconstructed options, previous options do
(18)     if and then
(19)      
(20)    Take ER to improve options’ internal policies ;
(21)    Add updated options to set.