Research Article

Two-Phase Iteration for Value Function Approximation and Hyperparameter Optimization in Gaussian-Kernel-Based Adaptive Critic Design

Algorithm 2

Refinement of sample set.
  l = 1         
Calculate and using (6) and (7)
Calculate using (40)
    
Sort all , in ascending order
Add the first state-action pairs to sample set to get extended set
  L + >
Calculate hyperparameters based on
  l = 1     +
Calculate and var using (6) and (7)
Calculate using (40)
    
Sort all , in descending order
Delete the first + samples to get refined set