Scalable Machine Learning Algorithms in Computational Biology and Biomedicine 2020View this Special Issue
Research Article | Open Access
YuHang Zhang, Tao Zeng, Lei Chen, ShiJian Ding, Tao Huang, Yu-Dong Cai, "Identification of COVID-19 Infection-Related Human Genes Based on a Random Walk Model in a Virus–Human Protein Interaction Network", BioMed Research International, vol. 2020, Article ID 4256301, 7 pages, 2020. https://doi.org/10.1155/2020/4256301
Identification of COVID-19 Infection-Related Human Genes Based on a Random Walk Model in a Virus–Human Protein Interaction Network
Coronaviruses are specific crown-shaped viruses that were first identified in the 1960s, and three typical examples of the most recent coronavirus disease outbreaks include severe acute respiratory syndrome (SARS), Middle East respiratory syndrome (MERS), and COVID-19. Particularly, COVID-19 is currently causing a worldwide pandemic, threatening the health of human beings globally. The identification of viral pathogenic mechanisms is important for further developing effective drugs and targeted clinical treatment methods. The delayed revelation of viral infectious mechanisms is currently one of the technical obstacles in the prevention and treatment of infectious diseases. In this study, we proposed a random walk model to identify the potential pathological mechanisms of COVID-19 on a virus–human protein interaction network, and we effectively identified a group of proteins that have already been determined to be potentially important for COVID-19 infection and for similar SARS infections, which help further developing drugs and targeted therapeutic methods against COVID-19. Moreover, we constructed a standard computational workflow for predicting the pathological biomarkers and related pharmacological targets of infectious diseases.
Coronaviruses are specific crown-shaped viruses that were first identified in the 1960s [1, 2]. In the 1960s, they were first identified as pathogens for zoonotic diseases without a direct and clear origin trace . They are highly transmissible viruses that can be spread via droplets and skin contact [3, 4]. Most coronaviruses are widely spread around the world. They cause simple and mild symptoms that are the same as cold and mild flu symptoms. However, specific coronavirus subtypes cause severe and even deadly symptoms, inducing large-scale pandemics regionally or worldwide. Three typical examples of the most recent coronavirus disease outbreaks include severe acute respiratory syndrome (SARS) [5, 6], Middle East respiratory syndrome (MERS) [7, 8], and COVID-19 [9, 10].
In 2003, the SARS outbreak occurred; it spread to more than 20 countries and regions in the Eurasian continent and resulted in the deaths of 774 people [5, 6]. The typical symptoms of SARS are high fevers in the early stage and severe inflammations in lung tissues after 2–7 days of quick progression . The large-scale SARS pandemic ended in less than half a year as a result of the effective epidemiological control exerted by the government . After a decade, MERS, another deadly and highly transmissible coronavirus, emerged in 2012. MERS has more severe early cellular infection and invasion capacity than SARS . It causes typical symptoms in less than 24 h, whereas SARS causes symptoms in 72 h. Therefore, treating and curing patients infected with MERS are more difficult than treating and curing patients with SARS . At the end of 2019, COVID-19, a new coronavirus subtype, broke out. In accordance with current epidemiological and clinical data, COVID-19 has been confirmed to be a similar coronavirus subtype as MERS and SARS . However, compared with the two other coronavirus subtypes, COVID-19 causes more complicated clinical symptoms  and has higher transmissible capacity  and faster mutational rates . These characteristics make its prevention and treatment difficult. More than 150,000 people have been confirmed to be infected with COVID-19, which has caused 5735 deaths. COVID-19 is currently causing a worldwide pandemic, threatening the health of human beings globally .
The identification of viral pathogenic mechanisms is important for further developing effective drugs and targeted clinical treatment methods. The delayed revelation of viral infectious mechanisms is currently one of the technical obstacles in the prevention and treatment of infectious diseases. Although SARS and MERS have already been finally controlled by regional governments, the pathogenic mechanisms of these diseases have not been fully revealed [12, 13]. SARS was controlled and banished in 2003, but its detailed mechanisms were not finally determined until 2005 [14, 15]. Meanwhile, the pathogenic mechanisms of MERS have still not been fully identified. The investigation of the pathological mechanisms of virulent viral pathogens by using traditional methods remains challenging.
In this study, we proposed a computational method to identify the potential pathological mechanisms of COVID-19, the coronavirus subtype that is now spreading all over the world and causing the 2019–2020 coronavirus pandemic. By using our prediction model, which is based on a random walk algorithm on a virus–human protein interaction network, we effectively identified a group of proteins that have already been determined to be potentially important for COVID-19 infection and for similar SARS infections. Through our newly presented computational methods, we identified a group of potential biomarkers for further developing drugs and targeted therapeutic methods against COVID-19. Moreover, we constructed a standard computational workflow for predicting the pathological biomarkers and related pharmacological targets of infectious diseases.
2. Materials and Methods
2.1. Dataset Construction of Target Human Proteins
Similar to the construction strategy used in a previous work , protein–protein interactions (PPIs) between the virus and its host (human) can be used to determine the course of COVID-19 infection. Whether a human protein interacts with viral proteins can be determined on the basis of functional terms from the Gene Ontology (GO) database . A human protein and COVID-19 protein that shared at least 1 GO term were assumed to interact with each other. The human protein was called the target human protein. Only GO terms at levels below 3 were considered to remove protein pairs sharing generic GO terms. For example, root GO terms (“GO: 0008150: biological process,” “GO: 0005575: cellular component,” and “GO:0003674: molecular function”), their children, and the children of their children terms were ignored in the following analysis. Through this approach, we constructed a dataset of target human proteins. All protein sequences of COVID-19 (Reference Genome MN908947) were downloaded from the NCBI protein database (http://www.ncbi.nlm.nih.gov/) in accordance with a preprinted article of Fast and Chen , and the sequences of the 11 COVID-19 proteins (orf1ab, S, orf3a, orf6, E, M, orf8, N, orf7b, orf7a, and orf10) are listed in Supporting Information S1.
2.2. PPI Data from STRING
Search Tool for the Retrieval of Interacting Genes (STRING) (http://string.embl.de/) is an online database resource. It compiles experimental and predicted PPIs with a confidence score. The PPIs in STRING are derived from several sources, such as (conserved) coexpression, automated text mining, genomic context predictions, high-throughput lab experiments, and previous knowledge in databases. Accordingly, they can widely measure the associations of proteins, which is a great advantage compared with PPIs reported in other databases, such as DIP (Database of Interaction Proteins) database  and BioGRID . Thus, we collected a weighted PPI network from STRING, in which the network nodes represent proteins and the network edges represent interactions between proteins with weights that indicate the significance of shared similar biological functions [21, 22]. Notably, the weight of each interaction edge was assigned with a weight, which was defined as the original confidence score reported in STRING. In this study, we analyzed the network wherein every two proteins in one interaction were in the target human protein dataset. The constructed network contained 19,247 nodes and 4,274,001 edges.
2.3. Random Walk with Restart Algorithm
The random walk with restart (RWR) algorithm, one of the typical network-based feature ranking algorithms [23, 24], can simulate a random walker that starts from one or several seed nodes and walks on a network.
RWR can update the weight (probability) vector of network nodes in an iterative manner in accordance with the following mathematical description: let be the probability vector after the th updating procedure. The next new probability vector will be updated depending on the previous vector as where is the column-wise normalized adjacency matrix of the given network, indicates the probability of the walker returning to the seed nodes, and is an initial probability vector. When the probability vector becomes convergent, the RWR algorithm stops and outputs the final . Each element of the final indicates the probability that the corresponding nodes are associated with the seed nodes.
In this work, 11,419 mapped candidate human proteins shared the same GO functions of COVID-19 were picked up as seed nodes in the RWR algorithm. Initialization vector was constructed. It consisted of 19,247 elements, wherein the elements corresponding to the seed genes were set to 1/11419, the other elements were set to zero, and the probability of returning to seed node was set to 0.8 as suggested in some studies [25–29]. The algorithm termination rule was .
2.4. Permutation Test
Based on the RWR algorithm, each protein in the PPI network was assigned a probability, which can indicate the associations between it and seed nodes. However, this value was determined by not only the seed nodes but also the topological structure of the network. Some special nodes in the network may more easily receive high probabilities than others. To control the influence of such factor, a permutation test was designed. First, we randomly constructed 1000 node sets, each of which contained 11,419 nodes. Second, for each node set, it was picked up as the input of the RWR algorithm; accordingly, each node in the network received a lot of probabilities. Finally, a value was computed for each node , which was defined by where denoted the number of probabilities on the randomly produced node sets that were larger than the probability on the actual seed nodes. Clearly, a node with a low value indicated that it was special for the seed nodes. Considering that 0.05 is a widely used threshold for statistical significance, we selected nodes with values less than 0.05 and their corresponding proteins were picked up for detailed analysis.
3. Results and Discussion
In this study, we designed a computation method to investigate the COVID-19 infection-related human genes. The entire procedures are shown in Figure 1.
3.1. Human Proteins Sharing Common GO Functions with COVID-19 Proteins
We downloaded the sequences of 11 COVID-19 proteins which are listed in Supplementary file 1. We predicted the domains and GO functions of these virus proteins based on their sequences using InterProScan (http://www.ebi.ac.uk/interpro/search/sequence/), and the InterPro results are given in Supplementary file 2. The 20 GO terms with were selected for the following analysis, as listed in Supplementary file 3. Next, we extracted the human proteins shared with the same 20 GO functions and these candidate human proteins are listed in Supplementary file 4. Then, we mapped them into the STRING network and finally obtained 11,419 proteins (Supplementary file 5).
3.2. Results of the RWR Algorithm and Permutation Test
After the above data preparation, we applied the RWR method with these 11,491 proteins as seeds on the STRING network and calculated the RWR probabilities of all proteins on the network. Meanwhile, we randomly selected the same number of seed proteins 1000 times and calculated all proteins’ permuted RWR probabilities. By comparing the actual and 1000 permutated RWR probabilities, we estimated the value of a protein being truly associated with COVID-19. Finally, we captured 486 highly confident human proteins associated with COVID-19 according to their RWR probabilities and permutation values (<0.05), as listed in Supplementary file 6. We analyzed and discussed a few representative candidates as listed in Table 1 in reference to recent reports on their functions.
3.3. Analysis on Some Essential Human Proteins
On the basis of our newly presented computational method, we applied the RWR algorithm to identify potential proteins that might functionally interact with the infectious process of COVID-19, which causes the typical disease coronavirus disease-19. According to recent publications, such proteins may not be only functionally related to the infection process of COVID-19 but may also participate in the infectious process of SARS, another famous infectious disease of the respiratory system. The detailed analyses can be seen below.
The first protein is ubiquitin-like 4A (UBL4A, ENSP00000358674). This protein, which is one of the major functional components of the BAG6/BAT3 complex, has been widely reported to participate in the recognition and delivery of misfolded and hydrophobic patch-containing proteins to proteasomes for degradation [30, 31]. In 2017, this protein was confirmed to participate in the endoplasmic-reticulum-associated protein degradation (ERAD) pathway in the viral infection cycle . In fact, the ERAD pathway has been reported to participate in the infection processes of various well-known viruses, e.g., the ERAD pathway has been confirmed to participate in the pathogenesis of the SARS coronavirus . Given that the infectious mechanism of COVID-19 has also been confirmed to be similar to that of SARS, we can reasonably predict that the ERAD pathway and one of its specific components, UBL4A, contribute to the pathogenesis of COVID-19 infection . Another predicted protein, ubiquitin-like 4b (UBL4B, ENSP00000334044), was identified by our newly presented computational models. UBL4B is also a component of the ERAD pathway and is therefore definitely functionally correlated with the pathogenesis of multiple coronaviruses, including SARS and COVID-19 [33, 34].
The next protein identified was uridine monophosphate synthetase (UMPS, ENSP00000232607), which contributes to the de novo pyrimidine biosynthetic pathway. As a biofunctional enzyme, this protein can help convert orotic acid into orotidine-5-monophosphate and further convert orotidine-5-monophosphate into uridine monophosphate [35, 36]. Uridine monophosphate, the terminal product of UMPS, has been widely reported to participate in coronaviral infection processes [37–39], especially RNA-polymerase-associated processes [38, 39]. In contrast to the infection processes of other coronaviruses, SARS infection exhibits abnormal uridine monophosphate regulation . Although experimental evidence remains lacking, we can still reasonably speculate that UMPS is functionally correlated with the COVID-19 infectious process considering that uridine monophosphate and its regulator UMPS are essential for RNA polymerases in RNA viruses, such as COVID-19.
POTEF (ENSP00000350052) is the next predicted protein that potentially contributes to COVID-19 infection. Similar to UBL4A, POTEF is a specific protein that contributes to the regulation of protein binding [40, 41]. This protein has been identified as A26C1B in various infectious models. In 2010, it was found to participate in the infection of HIV in chimpanzees . Furthermore, POTE, the family of POTEF, contributes to macrophage-mediated antiviral biological processes . Considering that the pathogeneses of SARS and other coronavirus have been confirmed to interact with macrophages and related immune responses  and macrophage infiltration in the lungs has already been widely reported [45, 46], this protein may also participate in COVID-19 pathogenesis.
The next predicted protein is LOC101927789 (ENSP00000310146). This novel identified fusion pseudogene has been identified to be a reactor against chemical exposure and is functionally correlated with another effective protein, MALAT1 . Recent publications confirm that MALAT1 is functionally related to the infectious processes of various viruses, including coronaviruses [48–50]. Although direct evidence showing that our predicted protein contributes to the infection of coronaviruses (SARS or COVID-19) remains lacking, we can reasonably speculate that MALAT1, together with our predicted fusion pseudogene, participates in some conserved biological or pathological processes of coronaviruses.
The final discussed protein is UBBP4 (ENSP00000464265), which has been widely reported to act as a pseudogene and is functionally correlated with psoriasis [51, 52]. Similar to UBL4A and UBL4B, UBBP4 can contribute to the regulation of the ERAD pathway  and therefore may be functionally correlated with COVID-19 infection. This relationship validates the efficacy and accuracy of our prediction.
All the predicted proteins were functionally confirmed to contribute to viral and coronaviral infection processes. Notably, many of the predicted proteins were functionally correlated with protein degradation and RNA metabolism, which are essential for viral infection, implying their potential functional relationships with COVID-19 infection. Our results will help design drugs and targeted therapy against COVID-19.
No data were used to support this study.
Conflicts of Interest
The authors declare that there is no conflict of interest regarding the publication of this paper.
Yu-Hang Zhang and Tao Zeng contributed equally to this work.
This study was supported by the Shanghai Municipal Science and Technology Major Project (2017SHZDZX01), the National Key R&D Program of China (2018YFC0910403), the National Natural Science Foundation of China (31701151), the Natural Science Foundation of Shanghai (17ZR1412500), the Shanghai Sailing Program (16YF1413800), and the Youth Innovation Promotion Association of Chinese Academy of Sciences (CAS) (2016245).
Supplementary File 1: the sequences of COVID-19 proteins. Supplementary File 2: the predicted domains and GO functions of COVID-19 proteins by InterProScan. Supplementary File 3: the GO terms with of COVID-19 proteins. Supplementary File 4: the candidate human proteins shared the same GO functions of COVID-19. Supplementary File 5: the mapped COVID-19 candidate human proteins on STRING network. Supplementary File 6: the highly confident human proteins associated with infection of COVID-19. (Supplementary Materials)
- C. C. Bergmann, T. E. Lane, and S. A. Stohlman, “Coronavirus infection of the central nervous system: host–virus stand-off,” Nature Reviews Microbiology, vol. 4, no. 2, pp. 121–132, 2006.
- W. K. Leung, K. F. To, P. K. S. Chan et al., “Enteric involvement of severe acute respiratory syndrome-associated coronavirus infection,” Gastroenterology, vol. 125, no. 4, pp. 1011–1017, 2003.
- Q. Li, X. Guan, P. Wu et al., “Early transmission dynamics in Wuhan, China, of novel coronavirus–infected pneumonia,” New England Journal of Medicine, vol. 382, no. 13, pp. 1199–1207, 2020.
- C. Drosten, B. Meyer, M. A. Müller et al., “Transmission of MERS-coronavirus in household contacts,” New England Journal of Medicine, vol. 371, no. 9, pp. 828–835, 2014.
- L. J. Stockman, R. Bellamy, and P. Garner, “SARS: systematic review of treatment effects,” PLoS Medicine, vol. 3, no. 9, article e343, 2006.
- L.-F. Wang, Z. Shi, S. Zhang, H. Field, P. Daszak, and B. Eaton, “Review of bats and SARS,” Emerging Infectious Diseases, vol. 12, no. 12, pp. 1834–1840, 2006.
- H. Momattin, K. Mohammed, A. Zumla, Z. A. Memish, and J. A. al-Tawfiq, “Therapeutic options for Middle East respiratory syndrome coronavirus (MERS-CoV)–possible lessons from a systematic review of SARS-CoV therapy,” International Journal of Infectious Diseases, vol. 17, no. 10, pp. e792–e798, 2013.
- J.-E. Park, S. Jung, A. Kim, and J. E. Park, “MERS transmission and risk factors: a systematic review,” BMC Public Health, vol. 18, no. 1, p. 574, 2018.
- C. Huang, Y. Wang, X. Li et al., “Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China,” The Lancet, vol. 395, no. 10223, pp. 497–506, 2020.
- J. T. Wu, K. Leung, and G. M. Leung, “Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study,” The Lancet, vol. 395, no. 10225, pp. 689–697, 2020.
- D. M. Skowronski, C. Astell, R. C. Brunham et al., “Severe acute respiratory syndrome (SARS): a year in review,” Annual Review of Medicine, vol. 56, no. 1, pp. 357–381, 2005.
- Y. Yin and R. G. Wunderink, “MERS, SARS and other coronaviruses as causes of pneumonia,” Respirology, vol. 23, no. 2, pp. 130–137, 2018.
- S. K. P. Lau, C. C. Y. Lau, K. H. Chan et al., “Delayed induction of proinflammatory cytokines and suppression of innate antiviral response by the novel Middle East respiratory syndrome coronavirus: implications for pathogenesis and treatment,” Journal of General Virology, vol. 94, no. 12, pp. 2679–2690, 2013.
- Z. Shi and Z. Hu, “A review of studies on animal reservoirs of the SARS coronavirus,” Virus Research, vol. 133, no. 1, pp. 74–87, 2008.
- D. A. Groneberg, R. Hilgenfeld, and P. Zabel, “Molecular mechanisms of severe acute respiratory syndrome (SARS),” Respiratory Research, vol. 6, no. 1, 2005.
- N. Zhang, M. Jiang, T. Huang, and Y. D. Cai, “Identification of influenza A/H7N9 virus infection-related human genes based on shortest paths in a virus-human protein interaction network,” BioMed Research International, vol. 2014, Article ID 239462, 11 pages, 2014.
- The Gene Ontology Consortium, “Gene ontology consortium: going forward,” Nucleic Acids Research, vol. 43, no. D1, pp. D1049–D1056, 2015.
- E. Fast and B. Chen, Potential T-cell and B-cell epitopes of 2019-nCoV, biorxiv, 2020.
- I. Xenarios, D. W. Rice, L. Salwinski, M. K. Baron, E. M. Marcotte, and D. Eisenberg, “DIP: the database of interacting proteins,” Nucleic Acids Research, vol. 28, no. 1, pp. 289–291, 2000.
- C. Stark, B. J. Breitkreutz, T. Reguly, L. Boucher, A. Breitkreutz, and M. Tyers, “BioGRID: a general repository for interaction datasets,” Nucleic Acids Research, vol. 34, no. 90001, pp. D535–D539, 2006.
- D. Szklarczyk, A. Franceschini, M. Kuhn et al., “The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored,” Nucleic Acids Research, vol. 39, no. Database, pp. D561–D568, 2010.
- D. Szklarczyk, A. L. Gable, D. Lyon et al., “STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets,” Nucleic Acids Research, vol. 47, no. D1, pp. D607–D613, 2019.
- S. Köhler, S. Bauer, D. Horn, and P. N. Robinson, “Walking the interactome for prioritization of candidate disease genes,” The American Journal of Human Genetics, vol. 82, no. 4, pp. 949–958, 2008.
- X. Yu, T. Zeng, and G. Li, “Integrative enrichment analysis: a new computational method to detect dysregulated pathways in heterogeneous samples,” BMC Genomics, vol. 16, no. 1, 2015.
- H. Liang, L. Chen, X. Zhao, and X. Zhang, “Prediction of drug side effects with a refined negative sample selection strategy,” Computational and Mathematical Methods in Medicine, vol. 2020, Article ID 1573543, 16 pages, 2020.
- L. Chen, T. Liu, and X. Zhao, “Inferring anatomical therapeutic chemical (ATC) class of drugs using shortest path and random walk with restart algorithms,” BBA - Molecular Basis of Disease, vol. 1864, no. 6, pp. 2228–2240, 2018.
- J. Zhang, Y. Suo, M. Liu, and X. Xu, “Identification of genes related to proliferative diabetic retinopathy through RWR algorithm based on protein-protein interaction network,” Biochimica et Biophysica Acta (BBA) - Molecular Basis of Disease, vol. 1864, no. 6, pp. 2369–2375, 2018.
- F. Yuan and W. C. Lu, “Prediction of potential drivers connecting different dysfunctional levels in lung adenocarcinoma via a protein-protein interaction network,” Biochimica et Biophysica Acta (BBA) - Molecular Basis of Disease, vol. 1864, no. 6, pp. 2284–2293, 2018.
- Y. Zhang, L. Dai, Y. Liu, Y. H. Zhang, and S. P. Wang, “Identifying novel fruit-related genes in Arabidopsis thaliana based on the random walk with restart algorithm,” PLoS One, vol. 12, no. 5, article e0177017, 2017.
- Y. Xu, M. Cai, Y. Yang, L. Huang, and Y. Ye, “SGTA recognizes a noncanonical ubiquitin-like domain in the Bag6-Ubl4A-Trc35 complex to promote endoplasmic reticulum-associated degradation,” Cell Reports, vol. 2, no. 6, pp. 1633–1644, 2012.
- N. Kuwabara, R. Minami, N. Yokota et al., “Structure of a BAG6 (Bcl-2-associated athanogene 6)-Ubl4a (ubiquitin-like protein 4a) complex reveals a novel binding interface that functions in tail-anchored protein biogenesis,” Journal of Biological Chemistry, vol. 290, no. 15, pp. 9387–9398, 2015.
- T. Wandtke, E. Wędrowska, A. Goede, P. Owczarska, E. Piskorska, and P. Kopiński, “Role of endoplasmic-reticulum-associated protein degradation pathway in the virus infection cycle,” Journal of Education, Health and Sport, vol. 7, no. 8, pp. 607–635, 2017.
- R. Minakshi, K. Padhan, M. Rani, N. Khan, F. Ahmad, and S. Jameel, “The SARS coronavirus 3a protein causes endoplasmic reticulum stress and induces ligand-independent downregulation of the type 1 interferon receptor,” PLoS One, vol. 4, no. 12, article e8342, 2009.
- T. Ahmed, M. Noman, A. Almatroudi et al., Coronavirus Disease 2019 Assosiated pneumonia in China: current 23 status and future prospects, Preprints, 2020.
- M. Griffith, J. C. Mwenifumbo, P. Y. Cheung et al., “Novel mRNA isoforms and mutations of uridine monophosphate synthetase and 5-fluorouracil resistance in colorectal cancer,” The Pharmacogenomics Journal, vol. 13, no. 2, pp. 148–158, 2013.
- B. Schwenger, S. Schöber, and D. Simon, “DUMPS cattle carry a point mutation in the uridine monophosphate synthase gene,” Genomics, vol. 16, no. 1, pp. 241–244, 1993.
- B. W. J. Mahy, S. Siddell, H. Wege, and V. ter Meulen, “RNA-dependent RNA polymerase activity in murine coronavirus-infected cells,” Journal of General Virology, vol. 64, no. 1, pp. 103–111, 1983.
- A. Azzi and S. X. Lin, “Human SARS-coronavirus RNA-dependent RNA polymerase: activity determinants and nucleoside analogue inhibitors,” Proteins: Structure, Function, and Bioinformatics, vol. 57, no. 1, pp. 12–14, 2004.
- G. A. Tannock and J. C. Hierholzer, “Presence of genomic polyadenylate and absence of detectable virion transcriptase in human coronavirus OC-43,” Journal of General Virology, vol. 39, no. 1, pp. 29–39, 1978.
- A. Misawa, K. I. Takayama, T. Fujimura, Y. Homma, Y. Suzuki, and S. Inoue, “Androgen-induced lncRNA POTEF-AS1 regulates apoptosis-related pathway to facilitate cell survival in prostate cancer cells,” Cancer Science, vol. 108, no. 3, pp. 373–379, 2017.
- S.-M. Zhou, L. Cheng, S. J. Guo et al., “Lectin RCA-I specifically binds to metastasis-associated cell surface glycans in triple-negative breast cancer,” Breast Cancer Research, vol. 17, no. 1, 2015.
- O. Fedrigo, L. R. Warner, A. D. Pfefferle, C. C. Babbitt, P. Cruz-Gordillo, and G. A. Wray, “A pipeline to determine RT-QPCR control genes for evolutionary studies: application to primate gene expression across multiple tissues,” PLoS One, vol. 5, no. 9, article e12545, 2010.
- U. Vekariya, R. Saxena, P. Singh et al., “HIV-1 Nef-POTEE; a novel interaction modulates macrophage dissemination via mTORC2 signaling pathway,” Life Sciences, vol. 214, pp. 158–166, 2018.
- M. Yilla, B. H. Harcourt, C. J. Hickman et al., “SARS-coronavirus replication in human peripheral monocytes/macrophages,” Virus Research, vol. 107, no. 1, pp. 93–101, 2005.
- J. Liu, X. Zheng, Q. Tong et al., “Overlapping and discrete aspects of the pathology and pathogenesis of the emerging human pathogenic coronaviruses SARS-CoV, MERS-CoV, and 2019-nCoV,” Journal of Medical Virology, vol. 92, no. 5, pp. 491–494, 2020.
- M. Yang, Cell pyroptosis, a potential pathogenic mechanism of 2019-nCoV infection, 2020, https://ssrn.com/abstract=3527420.
- H. Tani, S. Okuda, K. Nakamura, M. Aoki, and T. Umemura, “Short-lived long non-coding RNAs as surrogate indicators for chemical exposure and LINC00152 and MALAT1 modulate their neighboring genes,” PLoS One, vol. 12, no. 7, article e0181628, 2017.
- S. Saha, S. Murthy, and P. N. Rangarajan, “Identification and characterization of a virus-inducible non-coding RNA in mouse brain,” Journal of General Virology, vol. 87, no. 7, pp. 1991–1995, 2006.
- A. Ramaiah, D. Contreras, V. Gangalapudi, M. S. Padhye, J. Tang, and V. Arumugaswami, Dysregulation of long non-coding RNA (lncRNA) genes and predicted lncRNA-protein interactions during Zika virus infection, no. article 061788, bioRxiv, 2016.
- Y. Xiao, J. Zhang, and L. Deng, “Prediction of lncRNA-protein interactions using HeteSim scores based on heterogeneous networks,” Scientific Reports, vol. 7, no. 1, p. 3664, 2017.
- S. Munir, S. . Rahman, S. Rehman et al., “Association analysis of GWAS and candidate gene loci in a Pakistani population with psoriasis,” Molecular Immunology, vol. 64, no. 1, pp. 190–194, 2015.
- S. J. Elliman, J. Kavanaugh, and L. Couture, Sdc-2 exosome compositions and methods of isolation and use, 2019, US Patent Application 16/070,202.
- A. A. A. Al-Khedhairy, “Characterization of the nucleotide sequence of a polyubiquitin gene (PUBC1) from Arabian camel, Camelus dromedarius,” BMB Reports, vol. 37, no. 2, pp. 144–147, 2004.
Copyright © 2020 Yu Hang Zhang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.