Research Article
Two-Phase Iteration for Value Function Approximation and Hyperparameter Optimization in Gaussian-Kernel-Based Adaptive Critic Design
Algorithm 2
Refinement of sample set.
l = 1 | Calculate and using (6) and (7) | Calculate using (40) | | Sort all , in ascending order | Add the first state-action pairs to sample set to get extended set | L + > | Calculate hyperparameters based on | l = 1 + | Calculate and var using (6) and (7) | Calculate using (40) | | Sort all , in descending order | Delete the first + − samples to get refined set | |
|