Research Article

[Retracted] Identification of Tumor Tissue of Origin with RNA-Seq Data and Using Gradient Boosting Strategy

Table 1

The disease name and sample number in TCGA data.

DiseaseCodeTumor samplesPercentage

Bladder urothelial carcinomaBLCA3013.9025%
Breast invasive carcinomaBRCA105613.6912%
Cervical squamous cell carcinoma and endocervical adenocarcinomaCESC2583.3450%
Colon adenocarcinomaCOAD4515.8473%
Glioblastoma multiformeGBM1531.9837%
Head and neck squamous cell carcinomaHNSC4806.2233%
Kidney renal clear cell carcinomaKIRC5266.8197%
Kidney renal papillary cell carcinomaKIRP2222.8783%
Acute myeloid leukemiaLAML1732.2430%
Brain lower grade gliomaLGG4395.6917%
Liver hepatocellular carcinomaLIHC2943.8117%
Lung adenocarcinomaLUAD4866.3011%
Lung squamous cell carcinomaLUSC4285.5491%
Ovarian serous cystadenocarcinomaOV2613.3839%
Pancreatic adenocarcinomaPAAD1421.8410%
Prostate adenocarcinomaPRAD3794.9138%
Rectum adenocarcinomaREAD1531.9837%
Skin cutaneous melanomaSKCM801.0372%
Stomach adenocarcinomaSTAD4155.3805%
Thyroid carcinomaTHCA5006.4826%
Uterine corpus endometrial carcinomaUCEC5166.6900%
Total7713