Research Article

A Novel Molecular Representation Learning for Molecular Property Prediction with a Multiple SMILES-Based Augmentation

Table 1

The description of public molecular datasets, including data type, task type, and the number of compounds before and after augmentation.

DatasetData typeTask typeCompoundCompound after augmentation

ESOLSMILESRegression11276762
LipophilicitySMILESRegression420025200
FreeSolvSMILESRegression6423852
HIVSMILESClassification4112769987
BACESMILESClassification15139078