Reinforcement Learning for Distributed Energy Efficiency Optimization in Underwater Acoustic Communication Networks

<div> Q-learning-based UACNs resource allocation algorithm for node <span class="nowrap"><svg height="15.8638pt" id="M165" style="vertical-align:-3.9436pt" version="1.1" viewbox="-0.0498162 -11.9202 45.5658 15.8638" width="45.5658pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M495 86L479 114C446 82 419 66 409 66C401 66 401 72 406 97C420 166 436 231 453 297C489 435 454 448 428 448C406 448 384 439 354 422C305 394 222 327 161 247H159L183 345C200 415 194 448 173 448C143 448 82 410 23 351L38 325C64 349 95 371 105 371C111 371 116 365 109 336L25 -4L31 -12C50 -4 77 3 107 9C119 69 132 122 145 168C197 254 321 381 370 381C387 381 393 374 378 305L329 95C309 17 320 -12 345 -12C372 -12 430 19 495 86Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,6.526,-5.741)"><path d="M655 676H628C608 656 600 650 571 650H156C118 650 114 651 102 676H77C63 615 45 548 24 485L60 486C81 531 97 562 114 580C134 603 148 611 252 611H308L217 127C201 41 196 38 105 32L98 0H384L393 32C300 38 288 41 304 127L393 611H466C553 611 571 604 582 580C592 559 597 530 595 485L630 489C636 548 645 624 655 676Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,6.461,3.784)"><path d="M250 606C250 634 233 656 203 656C168 656 146 618 146 593C146 564 169 545 192 545C227 545 250 573 250 606ZM227 95L212 119C187 98 152 71 135 71C129 71 128 78 134 102L207 373C219 418 217 451 194 451C165 451 92 411 30 351L44 326C77 353 106 371 114 371C124 371 121 357 117 341L55 97C32 5 46 -12 70 -12C108 -12 191 51 227 95Z"></path></g><g transform="matrix(.013,0,0,-0.013,12.916,0)"><path d="M95 130C70 130 46 113 46 88C46 72 54 64 59 64C93 55 121 33 121 -3C121 -41 93 -68 44 -88L55 -117C117 -98 186 -56 186 22C186 91 131 130 95 130Z"></path></g><g transform="matrix(.013,0,0,-0.013,18.059,0)"><path d="M244 607C244 633 228 655 200 655C166 655 146 618 146 594C146 564 166 546 191 546C221 546 244 574 244 607ZM222 91L209 114C184 94 148 66 133 66C127 66 124 73 130 96L201 370C213 416 211 448 191 448C162 448 88 407 29 352L42 328C73 354 104 371 114 371C120 371 119 365 115 345L53 92C32 5 45 -12 68 -12C103 -12 186 50 222 91Z"></path></g><g transform="matrix(.013,0,0,-0.013,25.24,0)"><path d="M448 1V51H364C248 51 153 129 140 230H448V280H140C153 381 248 459 364 459H448V509H365C208 509 80 395 80 255S208 1 365 1H448Z"></path></g><g transform="matrix(.013,0,0,-0.013,35.736,0)"><path d="M724 650H480V616C554 612 574 595 579 552C582 523 587 476 587 387V211H584L201 650H16V616C59 612 81 604 100 580S120 538 120 472V264C120 176 114 131 110 98C106 55 83 39 27 35V0H272V35C198 38 181 57 176 101C173 131 168 176 168 264V488H171L587 -10H635V387C635 476 641 523 644 554C649 599 670 613 724 616V650Z"></path></g></svg>.</span></div>

Wireless Communications and Mobile Computing

alg1

Algorithm 1

Algorithm 1: Reinforcement Learning for Distributed Energy Efficiency Optimization in Underwater Acoustic Communication Networks