Research Article

Reinforcement Learning for Routing in Cognitive Radio Ad Hoc Networks

Algorithm 1

Dynamic softmax algorithm at SU node .
initialize  
while  (updating a new -value )
if  (   then  
else if     then  
else  if     then  
end  if
end