Research Article

Optimizing the Pairs-Trading Strategy Using Deep Reinforcement Learning with Trading and Stop-Loss Boundaries

Table 9

Average top-5 performance results of the proposed method and the traditional pairs-trading strategy in the out-of-sample dataset using OLS.

PairsModelMDDSharpe ratioProfit# of open portfolios# of closed portfolios# of stop-loss portfolios# of exited portfolios

MSFT/JPMPTDQN−0.20960.12281.92552151375462
PTA0−0.36180.04921.33652251416123
PTA1−0.50360.01881.01851681022838
PTA2−0.40450.06111.359112459857
PTA3−0.5055−0.00940.86369733361
PTA4−0.4195−0.00090.94595812145
PTA5−0.20180.12361.1593296023

MSFT/TXNPTDQN−0.28780.06981.34662441536568
PTA0−0.52710.00700.84892521567224
PTA1−0.47210.02551.02861871172644
PTA2−0.38160.02150.9912145711064
PTA3−0.6553−0.10150.505310430272
PTA4−0.27190.04221.05326316146
PTA5−0.18500.00680.9785347027

BRKa/ABTPTDQN−0.12820.16441.50761801094857
PTA0−0.5073−0.02650.70701831124822
PTA1−0.26490.04531.0786139801346
PTA2−0.22460.10561.294212160456
PTA3−0.16860.12411.27189138152
PTA4−0.14830.01760.97784912037
PTA5−0.16020.00040.9830162014

BRKa/UTXPTDQN−0.52310.08161.29762151325769
PTA0−1.1928−0.06470.33322161335725
PTA1−0.8697−0.01570.74451671001551
PTA2−0.7815−0.00710.839113570560
PTA3−0.35730.03151.02929436058
PTA4−0.20960.06841.08575211041
PTA5−0.1317−0.11740.9312162014

JPM/TPTDQN−0.13380.13911.45472051276050
PTA0−0.35880.00690.90542081306116
PTA1−0.25350.04051.0902151961935
PTA2−0.18720.05421.119811966548
PTA3−0.25740.03361.05029439055
PTA4−0.22120.03451.03125720037
PTA5−0.2348−0.19220.8299205015

JPM/HONPTDQN−0.38690.10711.51752501625768
PTA0−0.71410.01810.94442561665930
PTA1−0.50650.07021.30711981272249
PTA2−0.46490.10711.426015284365
PTA3−0.48710.07631.209810244058
PTA4−0.3503−0.06940.81785013037
PTA5−0.2980−0.17210.8040236017

JPM/GEPTDQN−0.11950.14431.76822261336469
PTA0−0.43790.00360.85492321376629
PTA1−0.15230.09871.4814165981651
PTA2−0.17380.12641.566113462567
PTA3−0.26800.07291.20269329064
PTA4−0.21040.12981.32425112039
PTA5−0.1461−0.04230.9586183015

JNJ/WFCPTDQN−0.18900.12661.71942021304756
PTA0−0.8705−0.03260.46352071315322
PTA1−0.6189−0.01340.7318150911939
PTA2−0.47630.03091.056312457462
PTA3−0.23180.14471.60729733262
PTA4−0.24150.05491.06325013037
PTA5−0.08800.24681.1886204016

XOM/CVXPTDQN−0.33160.02651.1517141812343
PTA0−0.7629−0.05470.41862401496130
PTA1−0.56480.01320.87541931142356
PTA2−0.6977−0.03870.665515470777
PTA3−0.52350.02770.986511738178
PTA4−0.4781−0.05770.81176312150
PTA5−0.3787−0.14920.8090293026

HON/TXNPTDQN−0.13390.15341.88522701756469
PTA0−0.41350.02120.94552761777028
PTA1−0.27580.06661.32162071242755
PTA2−0.26140.10541.503115984569
PTA3−0.17590.14131.561711745270
PTA4−0.08340.26501.70446623043
PTA5−0.06640.46061.68303013017

GE/TXNPTDQN−0.16760.12631.64112061404362
PTA0−0.61330.01780.97422111444423
PTA1−0.30850.05861.27431661091938
PTA2−0.24020.05851.221612868555
PTA3−0.3190−0.00130.91939131258
PTA4−0.2493−0.02850.9117498041
PTA5−0.08620.14171.0936234019

MO/UTXPTDQN−0.31810.05241.14021881174959
PTA0−0.46880.00410.86671951215221
PTA1−0.6166−0.02300.7470144841346
PTA2−0.5034−0.00760.866611551459
PTA3−0.28330.04571.08738832056
PTA4−0.29010.03561.02804412032
PTA5−0.15000.09921.0297132011