Research Article

Optimizing the Pairs-Trading Strategy Using Deep Reinforcement Learning with Trading and Stop-Loss Boundaries

Table 6

Average top-5 performance results for XOM and CVX using OLS within the training period.

ModelMDDSharpe ratioProfit# of open portfolios# of closed portfolios# of stop-loss portfolios# of exited portfolios

PTDQN−0.08420.18353.40684693366496
PTA0−0.20140.14522.593456538213250
PTA1−0.14310.17732.76034092794584
PTA2−0.12340.19552.630732519116118
PTA3−0.25860.08611.3850208862120
PTA4−0.25910.08031.193312439283
PTA5−0.2448−0.06380.85884711036