Research Article
Dynamic Routing Strategy for Directed Transmissions of High-Valued Contents in NGSO Satellite-Based Internet
Algorithm 1
-learning in unicast routing.
1: Q(S,A) Connect Matrix | 2: fordo | 3: S(1) Begin State. | 4: for1,2,…end do | 5: if Random then | 6: A Max(Q(S(), A())). | 7: else. | 8: A(1) Random(Q(S(), A())). | 9: end if. | 10: S S | 11: if S satisfy the constraints for (1), (2) and (3) | then | 12: Reward Constant | N.B.: Constant | 13: else | 14: Reward | 15: end if | 16: Q(S, A) Q(S, A) + [Reward | Max(Q(S, A)) Q(S, A)] | 17: if S is equal to End State then | 18: break | 19: else | 20: S S | 21: end if | 22: end for | 23: if converge then | 24: Counter Counter | 25: if Counter Counter_Threshold then | 26: break all | 27: end if | 28: else | 29: Counter clear | 30: end if | 31: end for |
|