Model-Free Attitude Control of Spacecraft Based on PID-Guide TD3 Algorithm

<table class="table-group" id="tab2"><tr><td><table class="table"><tr><td class="thead-hr" colspan="3"><hr/></td></tr><tr class="thead"><td class="align_left">Hyperparameters</td><td class="align_center">Symbol</td><td class="align_center">Value</td></tr><tr><td class="thead-hr" colspan="3"><hr/></td></tr><tr><td class="align_left">Random seed</td><td class="align_center">—</td><td class="align_center">2</td></tr><tr><td class="align_left">Max episodes</td><td class="align_center"><span style="width: 12.9526ptpx;"><svg height="8.68572pt" id="M161" style="vertical-align:-0.0498209pt" version="1.1" viewbox="-0.0498162 -8.6359 12.9526 8.68572" width="12.9526pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M962 650H795L470 145L347 650H176L170 622C268 613 275 606 244 503L190 322C150 188 132 126 118 91C102 50 80 33 18 28L12 0H245L251 28C175 35 170 48 174 93C177 128 191 180 220 284L292 542H294C331 392 383 150 409 4H432L774 555H776L714 137C700 40 694 34 612 28L606 0H868L874 28C793 34 784 37 797 137L849 533C859 612 863 616 956 622L962 650Z"></path></g></svg></span></td><td class="align_center">400</td></tr><tr><td class="align_left">Max steps per episode</td><td class="align_center"><span style="width: 8.41168ptpx;"><svg height="9.01194pt" id="M162" style="vertical-align:-0.04981995pt" version="1.1" viewbox="-0.0498162 -8.96212 8.41168 9.01194" width="8.41168pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M620 675H597C578 656 570 650 541 650H144C112 650 104 653 94 675H72C59 618 42 552 23 493L53 491C71 534 88 564 105 585C124 608 144 615 238 615H290L197 121C182 40 174 34 88 28L82 0H361L367 28C275 34 266 38 281 121L374 615H441C522 615 543 608 553 583C562 560 566 531 565 493L597 494C603 551 612 629 620 675Z"></path></g></svg></span></td><td class="align_center">200</td></tr><tr><td class="align_left">Sample time</td><td class="align_center"><span style="width: 11.2808ptpx;"><svg height="12.2532pt" id="M163" style="vertical-align:-3.29108pt" version="1.1" viewbox="-0.0498162 -8.96212 11.2808 12.2532" width="11.2808pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M620 675H597C578 656 570 650 541 650H144C112 650 104 653 94 675H72C59 618 42 552 23 493L53 491C71 534 88 564 105 585C124 608 144 615 238 615H290L197 121C182 40 174 34 88 28L82 0H361L367 28C275 34 266 38 281 121L374 615H441C522 615 543 608 553 583C562 560 566 531 565 493L597 494C603 551 612 629 620 675Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,7.176,3.132)"><path d="M357 391C357 416 325 451 270 451C241 451 179 431 146 400C109 365 97 334 97 307C97 248 143 213 196 180C250 146 261 123 261 101C261 75 237 40 190 40C153 40 110 72 86 109C81 116 68 119 55 112C37 102 24 87 24 66C24 30 85 -12 137 -12C229 -12 331 62 331 140C331 189 304 218 236 260C197 284 164 309 164 346C164 381 194 400 220 400C259 400 282 380 304 350C310 342 321 339 330 344C347 353 357 372 357 391Z"></path></g></svg></span></td><td class="align_center">1</td></tr><tr><td class="align_left">Replay buffer size</td><td class="align_center"><span style="width: 13.7747ptpx;"><svg height="9.49473pt" id="M164" style="vertical-align:-0.3238297pt" version="1.1" viewbox="-0.0498162 -9.1709 13.7747 9.49473" width="13.7747pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M934 175C849 68 804 41 775 41C758 41 749 50 749 68C749 108 827 177 827 227C827 262 820 286 793 313V315C886 323 971 383 971 476C971 536 934 586 878 622C895 633 913 642 933 651L914 698C878 685 845 671 816 654C749 683 672 699 592 699C492 699 406 681 337 648C226 596 162 510 162 423C162 323 242 266 332 266C488 266 543 397 543 520H493C490 463 477 419 458 387C427 332 389 321 345 321C297 321 253 360 253 419C253 544 403 644 599 644C646 644 701 633 750 610C654 535 589 429 509 282C400 85 348 34 233 34C198 34 165 42 146 65V66C186 73 204 105 204 134C204 169 180 201 136 201C89 201 55 159 55 109C55 33 123 -21 240 -21C405 -21 547 67 667 299C672 296 682 294 697 294C706 291 711 280 711 267C711 211 626 147 626 76C626 24 670 -18 749 -18C834 -18 900 38 973 142L934 175ZM857 481C857 423 811 357 760 349C741 361 720 367 698 367C730 436 755 509 814 569C843 546 857 516 857 481Z"></path></g></svg></span></td><td class="align_center">10<sup>6</sup></td></tr><tr><td class="align_left">Batch size</td><td class="align_center"><span style="width: 11.0475ptpx;"><svg height="8.8423pt" id="M165" style="vertical-align:-0.2064009pt" version="1.1" viewbox="-0.0498162 -8.6359 11.0475 8.8423" width="11.0475pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M822 650H589L583 622C660 617 677 607 674 561C672 534 664 481 647 390L600 137H596L273 650H126L120 622C176 620 194 615 207 594C221 571 225 557 214 504L161 257C141 166 129 112 121 85C108 42 83 30 29 28L23 0H260L266 28C193 33 173 42 176 89C178 122 186 172 202 255L256 527H259L583 -8H612L690 390C708 481 720 535 728 558C744 603 756 619 816 622L822 650Z"></path></g></svg></span></td><td class="align_center">250</td></tr><tr><td class="align_left">Policy network learning rate</td><td class="align_center"><span style="width: 14.7412ptpx;"><svg height="14.7625pt" id="M166" style="vertical-align:-5.47417pt" version="1.1" viewbox="-0.0498162 -9.28833 14.7412 14.7625" width="14.7412pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M393 379C402 394 400 411 393 422C384 437 365 448 348 448C301 448 237 372 186 285H182L193 335C210 408 205 448 178 448C150 448 80 402 29 344L45 321C80 355 114 373 122 373C128 373 130 365 124 330C106 228 76 98 50 -5L57 -12C82 -5 112 3 132 6L172 203C196 256 234 304 254 329C275 355 293 367 306 367C318 367 330 360 342 348C347 343 355 343 365 350S386 367 393 379Z"></path></g><g transform="matrix(.013,0,0,-0.013,5.488,0)"><path d="M238 681C243 705 239 712 230 712C217 712 156 682 75 674L70 648H105C148 648 153 641 144 598L39 110C18 11 35 -12 55 -12C90 -12 166 36 221 103L205 125C174 93 130 65 118 65C112 65 108 68 114 96L238 681Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,8.66,3.132)"><path d="M573 302C573 402 527 451 414 451C386 451 343 446 313 437L330 513L320 522C295 508 261 484 243 463L230 415C194 400 159 383 126 359L131 330C159 344 187 357 222 368L109 -147C96 -204 80 -214 18 -223L13 -255L256 -244L259 -212L236 -210C184 -205 180 -195 191 -141L219 -1C240 -10 268 -12 284 -12C352 4 431 48 484 104C543 166 573 240 573 302ZM481 290C481 165 381 37 305 37C280 37 249 56 235 71L302 395C328 399 353 402 372 402C427 402 481 378 481 290Z"></path></g></svg></span></td><td class="align_center">0.0003</td></tr><tr><td class="align_left">Critic network learning rate</td><td class="align_center"><span style="width: 13.0423ptpx;"><svg height="12.5794pt" id="M167" style="vertical-align:-3.29107pt" version="1.1" viewbox="-0.0498162 -9.28833 13.0423 12.5794" width="13.0423pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M393 379C402 394 400 411 393 422C384 437 365 448 348 448C301 448 237 372 186 285H182L193 335C210 408 205 448 178 448C150 448 80 402 29 344L45 321C80 355 114 373 122 373C128 373 130 365 124 330C106 228 76 98 50 -5L57 -12C82 -5 112 3 132 6L172 203C196 256 234 304 254 329C275 355 293 367 306 367C318 367 330 360 342 348C347 343 355 343 365 350S386 367 393 379Z"></path></g><g transform="matrix(.013,0,0,-0.013,5.488,0)"><path d="M238 681C243 705 239 712 230 712C217 712 156 682 75 674L70 648H105C148 648 153 641 144 598L39 110C18 11 35 -12 55 -12C90 -12 166 36 221 103L205 125C174 93 130 65 118 65C112 65 108 68 114 96L238 681Z"></path></g><g transform="matrix(.0091,0,0,-0.0091,8.66,3.132)"><path d="M387 400C387 425 348 451 303 451C247 451 176 414 132 376C69 322 24 228 24 148C24 43 74 -12 147 -12C211 -12 301 33 363 103L346 128C319 99 249 51 193 51C148 51 112 84 112 165C112 230 130 287 154 330C170 359 199 400 243 400C277 400 304 383 326 354C333 345 343 343 354 348C378 360 387 382 387 400Z"></path></g></svg></span></td><td class="align_center">0.001</td></tr><tr><td class="align_left">Exploration noise scale</td><td class="align_center"><span style="width: 5.39742ptpx;"><svg height="6.1673pt" id="M168" style="vertical-align:-0.2063904pt" version="1.1" viewbox="-0.0498162 -5.96091 5.39742 6.1673" width="5.39742pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M383 397C383 424 344 448 299 448C244 448 172 409 132 375C66 319 23 227 23 146C23 42 74 -12 146 -12C208 -12 298 30 359 103L343 124C315 95 248 48 192 48C145 48 111 85 111 163C111 228 129 294 151 330C171 363 201 401 241 401C275 401 302 384 325 356C332 347 339 344 348 348C373 360 383 381 383 397Z"></path></g></svg></span></td><td class="align_center">0.1</td></tr><tr><td class="align_left">Delay update</td><td class="align_center"><span style="width: 7.34169ptpx;"><svg height="9.49473pt" id="M169" style="vertical-align:-0.2063999pt" version="1.1" viewbox="-0.0498162 -9.28833 7.34169 9.49473" width="7.34169pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M530 686C535 705 530 712 521 712C504 712 448 684 359 674L358 648H393C437 648 439 646 429 593L400 435C372 447 345 448 332 448C286 448 194 414 144 373C68 311 23 203 23 111C23 26 57 -12 91 -12C120 -12 147 3 188 29C227 54 290 102 341 170H343L322 71C308 6 320 -12 341 -12C373 -12 442 27 501 96L485 120C455 91 422 67 408 67C401 67 401 76 404 91C440 294 479 473 530 686ZM387 375L355 241C326 187 200 53 142 53C126 53 109 73 109 130C109 217 154 337 218 381C240 396 265 404 297 404S372 390 387 375Z"></path></g></svg></span></td><td class="align_center">3</td></tr><tr><td class="align_left">Discount factor</td><td class="align_center"><span style="width: 6.63704ptpx;"><svg height="9.39034pt" id="M170" style="vertical-align:-3.42943pt" version="1.1" viewbox="-0.0498162 -5.96091 6.63704 9.39034" width="6.63704pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M478 372C478 418 458 448 431 448C409 448 389 431 389 410C389 404 391 400 394 395C398 388 406 371 406 348C406 253 308 122 251 51H249C254 122 249 257 231 336C212 421 189 448 159 448C126 448 75 412 23 327L48 306C83 354 103 371 115 371C125 371 134 360 144 334C185 224 192 64 183 -19C146 -100 116 -202 110 -244L125 -261C154 -259 208 -234 222 -220C222 -194 225 -84 235 -23C247 -3 273 36 308 79C379 165 478 288 478 372Z"></path></g></svg></span></td><td class="align_center">0.99</td></tr><tr><td class="align_left">Soft update rate</td><td class="align_center"><span style="width: 6.40217ptpx;"><svg height="6.1673pt" id="M171" style="vertical-align:-0.2063904pt" version="1.1" viewbox="-0.0498162 -5.96091 6.40217 6.1673" width="6.40217pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M471 456L444 459C426 433 414 430 388 430C324 430 270 434 216 434C103 434 51 374 23 338L43 317C96 366 146 380 221 375L154 109C149 86 147 68 147 52C147 4 168 -12 197 -12C240 -12 291 25 334 71L320 96C295 75 268 58 252 58C238 58 227 79 238 138C251 211 272 296 292 372C310 372 332 368 350 368C391 368 421 369 434 371C444 388 455 413 471 456Z"></path></g></svg></span></td><td class="align_center">0.01</td></tr><tr class="table-tr"><td colspan="3"><hr class="tbody-hr"/></td></tr></table></td></tr></table>

International Journal of Aerospace Engineering

tab2

Table 2

Table 2: Model-Free Attitude Control of Spacecraft Based on PID-Guide TD3 Algorithm