Research Article
Cooperative Behaviours with Swarm Intelligence in Multirobot Systems for Safety Inspections in Underground Terrains
Algorithm 1
Q-learning algorithm.
Steps: | initialize arbitrarily | repeat (for each episode): | initialize | Repeat (for each step of episode): | Choose from s using policy derived from | Take action , observe | | | until s is terminal |
|