Computational Intelligence and Neuroscience

Research Article

A Path-Planning Approach Based on Potential and Dynamic Q-Learning for Mobile Robots in Unknown Environment

Classical Q-learning pseudocode.

(1)	Initiate and in Q-table to zero
(2)	Select a starting state
(3)	for each step of episode do
(4)	while is not terminal do
(5)	Select and execute it
(6)	Receive an immediate reward
(7)	Observe the next state
(8)	Update the table entry by
(9)
(10)
(11)	Until is the terminal state
(12)	end while
(13)	end for
(14)	Output optimal policy