Research Article

Cooperative Behaviours with Swarm Intelligence in Multirobot Systems for Safety Inspections in Underground Terrains

Algorithm 1

Q-learning algorithm.
Steps:
initialize arbitrarily
repeat (for each episode):
   initialize
   Repeat (for each step of episode):
    Choose from s using policy derived from
    Take action , observe
     
    
    until s is terminal