Research Article

Metagenomics Biomarkers Selected for Prediction of Three Different Diseases in Chinese Population

Figure 1

The pipeline of data mining procedures. The whole pipeline of this study consists of preprocessing data (SRA to FASTQ, clinical information available, and discarding samples without complete clinical information), aligning to IGC and constructing the abundance matrix, feature selection and training algorithm, and biological interpretation.