Research Article

Novel Learning Algorithms for Efficient Mobile Sink Data Collection Using Reinforcement Learning in Wireless Sensor Network

Algorithm 1

For each  state-action pair
Initialize  the table entry to zero
Observe the current state s
Do loop:
Select an action a and execute it
Receive immediate reward r
Observe the new state
Update the table entry for as follows:
=r+ ϒ max
s=
Selected action:
(s) =arg max a Q(s, a)
Exploration strategy
End Loop