Research Article

Optimizing the Pairs-Trading Strategy Using Deep Reinforcement Learning with Trading and Stop-Loss Boundaries

Table 8

Average top-5 performance results of the proposed method and the traditional pairs-trading strategy in the out-of-sample dataset using TLS.

PairsModelMDDSharpe ratioProfit# of open portfolios# of closed portfolios# of stop-loss portfolios# of exited portfolios

MSFT/JPMPTDQN−0.11220.22943.04461861263862
PTA0−0.34110.07421.62362111365718
PTA1−0.29070.09791.80011621042632
PTA2−0.15070.19362.630313164760
PTA3−0.40320.15421.82829739157
PTA4−0.43400.04001.04805513042
PTA5−0.18360.30981.5524307023

MSFT/TXNPTDQN−0.34200.10011.54232041324765
PTA0−1.2094−0.05710.00132441527616
PTA1−0.9225−0.01770.61311781102543
PTA2−0.55740.03511.088713468858
PTA3−0.5375−0.01280.83269734162
PTA4−0.44850.02601.01186615150
PTA5−0.10480.12331.1502325027

BRKa/ABTPTDQN−0.07400.31592.36551621113043
PTA0−0.13920.15541.71571821283518
PTA1−0.10480.24642.1508138961527
PTA2−0.11330.25381.957810864340
PTA3−0.10400.24801.75767635140
PTA4−0.08290.20871.31714413031
PTA5−0.07040.43661.4013197012

BRKa/UTXPTDQN−0.54010.11741.57441671053558
PTA0−1.2143−0.01990.59181921175519
PTA1−0.93400.03461.0701147891245
PTA2−0.9099−0.00090.843512260557
PTA3−0.56730.04731.15208932156
PTA4−0.36410.06941.1628539044
PTA5−0.23090.04081.0405183015

JPM/TPTDQN−0.13840.12831.46531751134253
PTA0−0.36300.00710.89682051296015
PTA1−0.28010.04601.1595144941732
PTA2−0.37500.01920.998711962551
PTA3−0.5241−0.07170.66099235056
PTA4−0.3607−0.05500.84115618038
PTA5−0.22350.00610.9851226016

JPM/HONPTDQN−0.18720.15232.25102231553962
PTA0−0.67690.01901.00772741807023
PTA1−0.46440.06221.63312011392438
PTA2−0.45370.08401.716514987260
PTA3−0.24100.14141.764810743064
PTA4−0.33130.08791.31506216046
PTA5−0.16930.18031.2777287021

JPM/GEPTDQN−0.10980.21232.82501931244665
PTA0−0.38970.05071.51372241426517
PTA1−0.34040.06401.69121631091836
PTA2−0.16280.12841.903213273653
PTA3−0.29800.11421.755510638167
PTA4−0.28170.07901.28845513042
PTA5−0.06120.47761.7489216015

JNJ/WFCPTDQN−0.15760.24372.37411431002838
PTA0−0.28720.08921.49321641153712
PTA1−0.22190.19482.1147127901521
PTA2−0.31880.13221.63629955538
PTA3−0.23240.10841.31416827041
PTA4−0.15320.10431.12284014026
PTA5−0.09700.12031.0734166010

XOM/CVXPTDQN−0.42650.06051.19242181354577
PTA0−0.61890.02360.88122561616728
PTA1−0.59990.01540.88091971182554
PTA2−0.6034−0.00730.779215370875
PTA3−0.5628−0.02240.773411438274
PTA4−0.5311−0.02000.86437018151
PTA5−0.25830.00600.9692314027

HON/TXNPTDQN−0.08740.26793.27552331644963
PTA0−0.51080.10801.92192761866623
PTA1−0.58410.16252.33782071402838
PTA2−0.19260.20862.309615892462
PTA3−0.16110.15571.710011449263
PTA4−0.12540.22891.63746923046
PTA5−0.15780.19241.1925289019

GE/TXNPTDQN−0.11330.18712.13981721173048
PTA0−0.33480.09671.63982011364421
PTA1−0.16560.10701.63551531011933
PTA2−0.20430.13881.756811768841
PTA3−0.23350.15911.55558939248
PTA4−0.3847−0.13550.6570457038
PTA5−0.3489−0.27300.7218212019

MO/UTXPTDQN−0.52640.08401.2940150883558
PTA0−1.0950−0.02720.62311781025619
PTA1−0.72050.02861.0362125731239
PTA2−0.8361−0.00400.865810551350
PTA3−0.43110.00520.93237924054
PTA4−0.39160.11411.21294812036
PTA5−0.13110.29481.1276143011