Performance Evaluation of Frequent Subgraph Discovery Techniques
Table 1
Dataset statistics.
Dataset
Description of the dataset used in the experiments
Chemical compound
340 chemical compounds, 24 different atoms, 66 atom types, and 4 types of bonds [5]
AIDS antiviral screen compound
The dataset contains 43,095 chemical compounds [12]
DTP human tumor cell line screen (CANSO3SD)
It consists of 42,247 molecules. Each molecule corresponds to a graph, where atoms are represented using nodes and the bonds between them are represented by edges