Research Article

Application of Multivariate Adaptive Regression Splines (MARSplines) for Predicting Hansen Solubility Parameters Based on 1D and 2D Molecular Descriptors Computed from SMILES String

Table 3

MARSplines HB(25, 2) model regression factors along with their weights.

Factorai ± SDMathematical relationships

F012.6280 ± 0.4535
F15.5560 ± 0.7079max(0; SHBd-0.84757)
F2−10.4070 ± 1.0496max(0; 0.84757-SHBd)
F31.0900 ± 0.1333max(0; 2.3406-CrippenLogP)
F40.0000 ± 0.0000max(0; ECCEN-20.00000)·F2
F50.0000 ± 0.0000max(0; 20.00000-ECCEN)·F2
F6−4.0810 ± 0.4901max(0; GATS2e-0.92565)
F7−4.9500 ± 0.5455max(0; 0.92565-GATS2e)
F8−0.1460 ± 0.0470max(0; WTPT-4-2.32775)
F9−1.5640 ± 0.1466max(0; 2.32775-WTPT-4)
F10−62.8000 ± 7.3785F1·max(0; SIC1-0.59306)
F11−20.6450 ± 5.3855F1·max(0; 0.59306-SIC1)
F1221.0280 ± 3.0488max(0; ETA_dEpsilon_D-0.05394)
F1379.3130 ± 14.5139max(0; 0.05394-ETA_dEpsilon_D)
F14−0.3920 ± 0.0593max(0; VE3_DzZ + 3.00162)·F8
F15−88.4270 ± 13.1857max(0; AATSC1i + 0.83463)·F13
F16−100.3560 ± 19.2748max(0; −0.83463-AATSC1i)·F13
F173.4670 ± 0.5511max(0; AATSC7i-0.42042)
F183.1050 ± 0.6674max(0; 0.42042-AATSC7i)
F190.1370 ± 0.0591max(0; ATSC1v + 23.64635)·F12
F200.2160 ± 0.0470max(0; −23.64635-ATSC1v)·F12
F211.8170 ± 0.7239F2·max(0; AATSC2i + 0.09514)
F226.9340 ± 1.5981F2·max(0; −0.09514-AATSC2i)

Model statistics: fitting criteria: N = 130, R2 = 0.974,  = 0.970, F = 216.6, and LOF = 2.955; internal validation criteria: LMO (30%), Q2loo = 0.960, RMSE = 1.509, and MAE = 1.150.