Research Article
Novel Learning Algorithms for Efficient Mobile Sink Data Collection Using Reinforcement Learning in Wireless Sensor Network
For each state-action pair | Initialize the table entry to zero | Observe the current state s | Do loop: | Select an action a and execute it | Receive immediate reward r | Observe the new state | Update the table entry for as follows: | =r+ ϒ max | s= | Selected action: | (s) =arg max a Q(s, a) | Exploration strategy | | End Loop |
|