Research Article

Optimizing the Pairs-Trading Strategy Using Deep Reinforcement Learning with Trading and Stop-Loss Boundaries

Table 7

Average top-5 performance results for XOM and CVX using TLS within the training period.

ModelMDDSharpe ratioProfit# of open portfolios# of closed portfolios# of stop-loss portfolios# of exited portfolios

PTDQN−0.09440.21334.876054139910463
PTA0−0.12100.15224.194857941312541
PTA1−0.10150.16503.88344303105070
PTA2−0.14830.17223.34253202091398
PTA3−0.13860.17712.43852171013113
PTA4−0.17490.16021.685211938279
PTA5−0.28620.01371.03625510045