Research Article

Factorization -Learning Initialization for Parameter Optimization in Cellular Networks

Table 2

Comparison of performance.

AlgorithmsFactorizationConvergence episodesAverage episode reward
-GreedyBoltzmann-GreedyBoltzmann

-learningOriginal54.6367.8-12.14-0.590
Factorized480.7774.18.51411.91
Dyna -learning [12]Original64.9357.4-10.26-1.806
Factorized471.5702.69.24613.16
-learning [13]Original57.1374-11.900.018
Factorized464.1718.67.25310.42
Double -learning [14]Original56.8340.6-12.60-0.510
Factorized479824.89.39314.25
Speedy -learning [15]Original68.7365.6-8.8841.151
Factorized522699.114.3914.11