BioMed Research International / 2020 / Article / Fig 1

Research Article

Identification of Latent Oncogenes with a Network Embedding Method and Random Forest

Figure 1

Flow chart to show the procedures of the method for inferring latent oncogenes. Seven protein-protein interaction (PPI) networks are constructed using the PPI information reported in STRING, from which feature vectors of oncogenes and unlabelled genes are extracted via Mashup. Unlabelled genes are divided into 39 parts; each of which combines the oncogenes to comprise a dataset. Each dataset induces a random forest (RF) model. A level value is computed for each unlabelled gene, which is the average of the probabilities yielded by 38 RF models. Unlabelled genes are ranked by the decreasing order of their level values. The same procedures are done for each oncogene, thereby ranking oncogenes. Finally, all genes are sorted in a list and ROC and PR curves are plotted to evaluate the performance of the method.

We are committed to sharing findings related to COVID-19 as quickly as possible. We will be providing unlimited waivers of publication charges for accepted research articles as well as case reports and case series related to COVID-19. Review articles are excluded from this waiver policy. Sign up here as a reviewer to help fast-track new submissions.