Review Article | Open Access
Magbubah Essack, Adil Salhi, Julijana Stanimirovic, Faroug Tifratene, Arwa Bin Raies, Arnaud Hungler, Mahmut Uludag, Christophe Van Neste, Andreja Trpkovic, Vladan P. Bajic, Vladimir B. Bajic, Esma R. Isenovic, "Literature-Based Enrichment Insights into Redox Control of Vascular Biology", Oxidative Medicine and Cellular Longevity, vol. 2019, Article ID 1769437, 16 pages, 2019. https://doi.org/10.1155/2019/1769437
Literature-Based Enrichment Insights into Redox Control of Vascular Biology
In cellular physiology and signaling, reactive oxygen species (ROS) play one of the most critical roles. ROS overproduction leads to cellular oxidative stress. This may lead to an irrecoverable imbalance of redox (oxidation-reduction reaction) function that deregulates redox homeostasis, which itself could lead to several diseases including neurodegenerative disease, cardiovascular disease, and cancers. In this study, we focus on the redox effects related to vascular systems in mammals. To support research in this domain, we developed an online knowledge base, DES-RedoxVasc, which enables exploration of information contained in the biomedical scientific literature. The DES-RedoxVasc system analyzed 233399 documents consisting of PubMed abstracts and PubMed Central full-text articles related to different aspects of redox biology in vascular systems. It allows researchers to explore enriched concepts from 28 curated thematic dictionaries, as well as literature-derived potential associations of pairs of such enriched concepts, where associations themselves are statistically enriched. For example, the system allows exploration of associations of pathways, diseases, mutations, genes/proteins, miRNAs, long ncRNAs, toxins, drugs, biological processes, molecular functions, etc. that allow for insights about different aspects of redox effects and control of processes related to the vascular system. Moreover, we deliver case studies about some existing or possibly novel knowledge regarding redox of vascular biology demonstrating the usefulness of DES-RedoxVasc. DES-RedoxVasc is the first compiled knowledge base using text mining for the exploration of this topic.
In cellular physiology and signaling, reactive oxygen species (ROS) are involved in various processes including cellular growth, gene expression, activation of signal transduction pathways, and induction of transcription factors in defense against infection [1–3]. In the vascular system, ROS play an important role in regulating endothelial function and vascular tone in physiological condition . However, ROS are also involved in pathophysiological processes such as inflammation, endothelial dysfunction, and vascular remodeling in cardiovascular diseases (CVD), including hypertension [5–8]. ROS are implicated in vascular pathophysiology, leading to atherosclerosis and arterial hypertension. Moreover, ROS-generating systems were found to facilitate diseases which promote vascular pathologies, such as hypercholesterolemia, diabetes mellitus, and obesity . Within the cardiovascular system (CVS), ROS have the role of signaling molecules and facilitate cellular differentiation and growth, cell migration, inactivation of NO, protein phosphorylation, and extracellular matrix production and breakdown. However, many of these effects relate to pathological changes in the vasculature . ROS are produced by endothelial cells (EC), vascular smooth muscle cells (VSMC), and adventitial cells and can be generated by various enzymes .
We are witnessing an enormous increase in the volume of published research material, which makes it infeasible for an individual researcher or a team of researchers to track all important developments even in a specific field. This is very prominent in the biomedical domain where, in addition to the great volume of published scientific reports, the information contained in these documents is itself highly complex. For example, the following query: “(human OR mouse OR rat OR mammal) AND (radical OR peroxide OR “reductive stress” OR ROS OR “reactive oxygen species” OR RNS OR “reactive nitrogen species” OR redox OR “reduction-oxidation reaction” OR oxidative OR nitrosative OR peroxide OR superoxide OR detoxifi OR antioxid OR “polyunsaturated fatty acids” OR “arachidonic acid” OR “linoleic acid” OR hydroperoxide OR “hypochlorous acid” OR peroxynitrit flavoprot OR xanthine oxidase OR “cytochromes P450” OR catalase OR sulfiredoxin OR peroxiredoxin) AND (“angina pectoris” OR anemia OR aneurysm OR angio OR arter OR atrial OR atrioventricular OR aort OR bradycardia OR blood OR brain OR circulati OR clogging OR cardio OR coronary OR edema OR heart OR ishemic OR hemo OR hypertension OR leukemia OR leuko OR macroangiopathy OR microangiopathy OR neovascularization OR occlusion OR pericardi OR sepsis OR “sickle cell” OR tachycardia OR tachyarrhythmia OR thromb OR vaso OR vein OR ventricular OR vascular OR vessel)” was used to retrieve all literature specifically focused on the problems related to redox effects on the cardiovascular system in mammalian organisms. Clarivate Analytics (https://clarivate.com/) has indexed in the Web of Science (All Databases), having 36063 and 169212 scientific articles published in 2017 and in the 2013-2017 period, respectively. This clearly highlights the challenges of analyzing information even in specialized domains.
The problem of how to explore such a voluminous information pool leads to looking for ways to simplify the search for useful information. This problem is not new, and it has been clear that one needs automated systems to support analysis of information contained in published literature. The last three decades have seen numerous attempts devoted to developments in this direction. This problem is addressed through text mining. Different aspects of text mining and a complementary set of techniques for the so-called natural language processing (NLP) have been applied for the exploration of biomedical information from free text [11–23].
Different methods were used for obtaining information from free text [24–33], many based on heavy utilization of ontologies and ontology structures . Also, there have been systematic efforts to combine text mining with other methods to enhance the capacity to extract useful information (for example, [30–32, 34]).
Text mining found applications in different biomedical domains [31, 35–48], for example, dealing with problems of cancers , disease biomarkers , sickle cell disease , tomato species , medicinal herbs , sodium channels , drug repurposing , protein analysis [40, 52], prioritization of cancer genes and pathways , hepatitis C virus , cancer risk assessment , associations of mutations and human diseases , or association of transcription factors .
Research in the utilization of text mining in the biomedical field has resulted in a number of applications that are accessible online, such as [56–79]. These demonstrate the increasing value of applying text mining to the biomedical field.
In this study, to support research in redox biology and its effects on CVS, we developed an online knowledge base (KB), DES-RedoxVasc (http://www.cbrc.kaust.edu.sa/des-rv), which enables exploration of information contained in biomedical scientific literature focused on redox control of vascular systems in mammals. We provide examples of DES-RedoxVasc use.
2. Exploration System
2.1. Server Architecture and Underlying Systems
2.2. The Literature Corpus and Dictionaries Incorporated into DES-RedoxVasc
The MongoDB literature repository contains only documents that are tagged as open access, which means that they are freely amenable to text mining. Thus, to create the literature corpus to be analyzed, the local MongoDB repository, last updated on September 03, 2018, was queried for all topic-specific PubMed and PMC articles. The same query used to query Web of Science (All Databases) above was used to create the literature corpus. The literature index server is designed to match the query to the titles, abstracts, and full-text article when available through the PMC set. The query retrieved 233399 articles.
Also, 28 topic-relevant dictionaries were used in this KB, of which eight dictionaries were newly compiled (see Table 1). The remaining 20 dictionaries were previously used in other KBs developed using the DES framework and in Table 1.
All dictionary concepts (see Table 2 for definitions) are normalized where possible. Normalization of concepts ensures that when concepts can be referred to by different symbols, names, or synonyms, it is always associated to a single entity (using an internal identifier) and it also ensures that concepts can be recognized through universal IDs such as NCBI Taxonomy ID, Entrez Gene ID, and UniProt ID that are regarded as trusted sources. For example, dealing with genes and proteins is frequently problematic in text mining. This is as a consequence of gene/protein names/symbols and their aliases, frequently denoting more than one gene/protein. We combined Entrez Gene (for genes) with UniProt (for proteins) nomenclatures which provide the official names/symbols/aliases routinely used. Then the normalization is applied in the DES system. The normalization of dictionary concepts improves the accuracy of concepts’ enrichment estimates.
Some concepts are relevant to more than one dictionary, for example, enzymes are gene products, and it is expected that nomenclatures of these entity types would have a substantial intersection. The same goes for drugs and chemicals, drugs and antibiotics, gene functions and pathways, etc. It is worth noting that normalization is done at the dictionary level and not across dictionaries because (1) it is the semantically valid approach, as biological entities might be pertinent to, say, both chemicals and drugs, and should be viewed as such depending on the scope of the literature and the user’s interest and (2) these dictionaries are used in a modular fashion independently from each other; it is not redundant to keep a reference to the same entity in two or more dictionaries. For example, a user might be interested only in drugs, and not in the more general collection of chemicals, and as such chooses only drugs for the KB annotation; therefore, they should have access to all drugs that are also part of the chemical dictionary. This also applies when doing dictionary specific searches within the same KB. It is not however acceptable to have redundant concepts within the same dictionary.
The literature corpus and 28 dictionaries were used for concept document mapping. The concept document mapping results were then used to statistically determine enriched concepts and enriched pairs of concepts.
2.2.1. Enriched Concepts
In a KB, concepts could be statistically enriched or not. If they are enriched in the KB, this is based on their abundance in the KB corpus which should be greater than one would expect as compared to the rest of the PubMed/PMC literature. The frequency of the concept across the entire literature is indicative of the expectation of its frequency in any randomly selected sample from the literature. A concept is enriched when its frequency in the KB is significantly higher than the expected frequency. To quantify determination of which concepts are enriched, a concept has to have a in the DES-RedoxVasc corpus when compared to the complete set of PubMed Central and PubMed articles in our local repository; in this manner, concepts most relevant to the KB are identified. The value was calculated based on the Benjamini-Hochberg procedure to correct for multiplicity testing. Note that this value is also known as a false discovery rate (FDR).
2.2.2. Enriched Concept Pairs
Pairs of enriched concepts are considered enriched for association by considering the abundance of their cooccurrence as compared to the individual occurrence of concepts that form the pair. So, for example, if two concepts occur 100 times each and they cooccur 90 times, there is a high chance that they are associated, because they each occurred with the other concept 90% of the time. The situation is of course not typically symmetric, but the example is just for clarification. The resulting enriched pairs of concepts may or may not be directly associated; however, the more a pair is enriched this way, the higher the probability for the association between the two concepts.
Using cooccurrence as a proxy for semantic relatedness, or association, is a well-established, if not the dominant, approach to semantic analysis and association extraction and is by no means particular to DES. PMI (pointwise mutual information) and cosine distance from Word2Vec embeddings are some of the mainstream examples of such an approach. Establishing association between two biomedical entities from the text in a biologically meaningful way (e.g., causality, inhibition, and coexpression) is however a much more challenging task, that is, the subject of much research pertinent to the more general question of NLU (natural language understanding). Focusing on one type of association, with certain simplifying assumptions, can render the task of targeted association extraction more amenable to computation, but this is not the purpose of our explorative system.
The total number of statistically enriched concepts from all 28 dictionaries used is 101938. The number of enriched concepts per dictionary is provided in Table 1. The total number of statistically enriched pairs of concepts that are themselves found statistically enriched is 5631393. The literature corpus, 28 dictionaries, enriched concepts, and enriched pairs of concepts were integrated to create DES-RedoxVasc. The resulting network of concept pairs was also embedded in a high-dimensional semantic space, therefore enabling the computation of semantic similarity between any two concepts within the KB.
2.2.3. Semantic Similarity
This similarity is a metric which establishes the likeness or closeness of two concepts in terms of their potential meaning. Semantic similarity can be the result of semantic relatedness, such as synonymy, antonymy, and hypernymy. For example, tall and short are semantically similar even though they are antonyms because they both share the semantic dimension of “height.” Semantic similarity within DES is calculated as the cosine distance between two concept embeddings (vector representations in a latent semantic space). These embeddings are obtained using a skip-gram Word2Vec model trained on the DES-RedoxVasc literature corpus with normalized concept annotation. Therefore, the underlying assumption for semantic similarity in DES is concept cooccurrence, but not necessarily direct cooccurrence.
3. DES-RedoxVasc Overview and Case Studies
DES-RedoxVasc allows oxidative control and vascular system-related literature to be easily explored using terms and associations that are determined to be statistically enriched in topic-specific publication. Briefly, these enriched terms/concepts can be explored using the “Enriched Concepts” (Enriched Terms) link or via the “Enriched Pairs” (Enriched Term Pairs) link that provides enriched cooccurring concepts. Concepts are regarded as cooccurring based on their cooccurrence in the text within a 200-character distance from each other. However, DES-RedoxVasc only reports the portion of cooccurring concepts (pairs of concepts) where pairs are statistically enriched, thereby increasing the probability that the reported associations could have “biological relevance.” However, “biological relevance” is left to the user to check on by exploring the actual related literature provided through the interface. So, if genes or proteins keep cooccurring with a particular disease or process much more frequently than is statistically expected, then we assume that these genes or proteins are deemed to be important to the disease pathology or process (also refer to Enriched Concept Pairs).
Users can also use the “Column visibility” tab in these links to explore enriched terms using ranking options for the false discovery rate (FDR), density, kb_frequency, and bkg_freq. Also, concepts are color coded to indicate the dictionary from which the concepts are retrieved.
Moreover, each concept is linked to a clickable box through which the “Network” and “Term Co-occurrences” links can be examined. Detailed description is provided in . There is also the “Literature” link that allows users to explore the literature in DES-RedoxVasc (PubMed abstracts and PMC full-text articles) and the “Network” link that allows users to explore and generate networks of enriched concept pairs. This version of DES also provides a new link named “Semantic Similarity.” Users are also provided with a “Software Manual” on the “Home” page of DES-RedoxVasc. Below, we provide several examples wherein a range of biomedical entities are used to develop insights into redox control in vascular systems.
3.1. Example 1: Finding the Relevant Concepts of Different Categories Using “Enriched Concepts” View
One rather simple but useful use of DES-RedoxVasc is a possibility to quickly find some of the most relevant concepts related to redox processes in CVS. For this, one can choose the “Enriched concepts” view button (on the left side). Then the page will show the list of most characteristics concepts from all dictionaries as found by the system. If one wants to see the most enriched concepts from a specific dictionary, this is possible by selecting the dictionary from the dropdown menu from the right side. As the inspection of these most characteristic concepts will show, most of them are very clearly related to the topic that we study. In the following, we examine such singled-out genes/proteins and microRNAs in more details.
Oxidants classified either as ROS [109, 110] or reactive nitrogen species (RNS) [109, 110] are generated through the cells’ normal metabolic processes as well as exogenous factors such as atmospheric pollutants and irradiation. These oxidants play important physiological roles in cell maintenance and are considered not to harm the human body when oxidant-antioxidant levels are relatively in equilibrium . However, in cases where the levels of these oxidants exceed the levels of antioxidants, oxidative stress (OS) is triggered . To counteract this state of oxidative stress, the cells increase antioxidant production to reestablish redox homeostasis [113, 114]. However, in contrast to the oxidative mechanisms, excess levels of antioxidants lead to excess reducing equivalents of glutathione (GSH), NADPH, and NADH that depletes ROS and triggers reductive stress (RS) . This state of chronic reductive stress stimulates an increase in the production of oxidants only to establish an oxidative stress state that is eventually driven back to the reductive stress state. Thus, excess antioxidant agents may also induce prooxidant effects .
These counter mechanisms describe the general processes that govern redox control. Moreover, the lack of redox control in the form of prolonged oxidative or reductive stresses has been linked to several disease states [117–119] including cardiovascular diseases.
Thus, we start exploring the efficacy of DES-RedoxVasc to retrieve established associations through the “Enriched Concepts” link (see Figure 1 and also see the “‘Published Examples” link for a more detailed description of how examples were generated).
3.1.1. Gene/Protein Associations with “Oxidative Stress”
Figure 1 shows that the gene/protein nodes are connected with “Oxidative stress” by a large number of articles. To confirm that the genes/proteins nodes and microRNA have true associations retrieved by DES-RedoxVasc, we checked the literature suggested by DES-RedoxVasc. Li et al. demonstrated that eNOS knockout mice exhibit cardiac aging prematurely and early mortality . In line with this finding, Zanetti et al. used aortae of rats (old and young) to demonstrate that the activated inducible nitric oxide synthase (iNOS), impaired SOD1 activity, and increased OS are associated with vascular aging. They also showed that caloric restriction blunts oxidative stress, reduced iNOS expression, and increased SOD1 activity . They further reported that SIRT1 expression remains unchanged. However, it has been shown that human coronary arterial endothelial cells treated with resveratrol induced SIRT1, as well as upregulated eNOS in a SIRT1-dependent manner . Also, OS induced with SOD1 deficiency triggers oxidatively modified CA2 to accumulate in erythrocytes .
ROS is also produced in normal airway epithelial cells stimulated with human neutrophil elastase (also known as HNE or ELANE) . It was also shown in a large gene set that Nrf2 binds to the antioxidant response element (ARE) (including glutamate-cysteine ligase (GCL), NAD(P)H-quinone oxidoreductase 1 (NQO1), heme oxygenase-1 (HMOX1), which encodes HO-1, and thioredoxin reductase 1 (Txnrd1)) to alleviate oxidative stress . Thus HO-1 was shown to play a key role in oxidative stress-related pathologies such as CVDs and atherosclerosis . OGG1 repairs DNA damage induced by OS, and an OGG1 (rs1052133) polymorphism has been associated with atherosclerosis  and CVD  risk.
All genes/proteins from Figure 1 had an association with “oxidative stress” except LPO. The reason is that LPO in the text was used to refer to lipid peroxide instead of “lactoperoxidase.” Despite ELANE (with one of its synonyms being HNE) and CA2 being associated with “Oxidative stress,” in most of the articles that putatively linked these concepts to “Oxidative stress,” HNE refers to the peroxidation by-product 4-hydroxy-2-nonenal instead of the human neutrophil elastase gene or product and CA2 refers to calcium. These examples illustrate a limitation of text mining caused by multiple meanings of the same symbol.
3.1.2. MicroRNA Associations with “Oxidative Stress”
On the other hand, if we look at nodes that are connected by a small number of articles such as the nodes for microRNAs in Figure 1, Step 3, we find “MIR23A” , “MIR34A” , “MIR155” , “MIR210” , and “MIR106B”  being associated in our KB with “oxidative stress” via “6,” “4,” “3,” “2,” and “1” articles, respectively.
The literature focused on “MIR23A-” (miR-23a-) revealed areas of research that may increase our insight of miR-23a-related redox control in various diseases. Dubois-Deruy et al. demonstrated that SOD2 is increased in the left ventricle after heart failure in rats, as well as miRNAs (miR-222-3p, miR-23a-3p, and miR-21-5p) targeting SOD2 . They further demonstrated that left ventricular remodeling postmyocardial infarction in REVE-2 patients  exhibits high levels of these SOD2-targeting miRNAs. In line with this finding, it was demonstrated that inhibiting oxidative stress-induced miR-23a (MIR23A) reduces degeneration of retinal pigment epithelium (RPE) cells . They further demonstrated that glutaminase (GLS) is a direct target of miR-23a and oxidative stress in miR-23a-overexpressed RPE cells is alleviated by GLS expression. This is interesting as GLS converts glutamine to glutamate, the precursor needed for synthesis of the antioxidant glutathione (see Figure 2).
Figure 2 depicts an overview of how redox control contributes to maintaining a healthy state and how redox dysfunction contributes to different disease states. When an increase in OS is coupled with the inhibition of the oxidative stress-induced microRNA, antioxidant synthesis is increased which reduces the oxidative stress back to the redox “homeostasis” state. Conversely, redox dysregulation in the form of increased expression of microRNAs inhibits antioxidant synthesis possibly leading to a disease state. This contradicts Lin et al. who, instead of inhibition, suggested the expression of miR-23a is required for the maintenance of healthy RPE cells .
3.2. Example 2: Hypotheses and Potentially New Insights Derived through the Use of DES-RedoxVasc
3.2.1. Hypothesis 1: Heart Failure May Occur in Response to Oxidative Stress
On the page “Enriched pairs,” “Oxidative stress response” in column 1 is linked to a number of miRNAs (see column 2 when the “Human miRNAs” dictionary is selected), among which there is “MIR4639” (hsa-miR-4639). We checked the FARNA database  for hsa-miR-4639 and found that this miRNA is expressed in the heart . Furthermore, FARNA suggests that hsa-miR-4639 is implicated in heart failure. On the other hand, Chen et al. demonstrated that increased levels of hsa-miR-4639 in plasma leads to downregulation of the DJ-1 protein activity in patients with Parkinson’s disease . Moreover, they demonstrated that miR-4639-5p directly binds the DJ-1 transcript at its 3UTR that results in the downregulation of the DJ-1 protein activity. This is interesting, as oxidative stress activates DJ-1 and DJ-1 is shown to inhibit alpha-synuclein aggregate formation that leads to Parkinson’s disease . The relationship between miR-4639 and oxidative stress is via DJ-1, as the Nrf2-regulated antioxidant defense mechanism is impaired when levels of DJ-1 are decreased . DJ-1 has also been shown to protect the heart against oxidative damage. That is, Billia et al. demonstrated that DJ-1 (with synonym PARK7) protects murine hearts against oxidative damage . DJ-1 was also shown to protect the heart from ischemia-reperfusion injury [142, 143]. Moreover, the work of Li et al. shows that miR-4639 is almost 3-fold overexpressed in chronic heart failure patients compared to the control group . All this leads us to the following hypothesis (see Figure 3): “overexpression of miR-4639 in the heart downregulates DJ-1 that protects the heart from oxidative damage, which may be one of the causes leading to heart failure.”
3.2.2. Hypothesis 2: Vascularization Redox Is Relevant to Alzheimer’s Disease
In search of novel insights, it is also useful to look at the concepts from different dictionaries that are associated with each other. For this analysis, we looked at all connections/association found between concepts in DES-RedoxVasc. Figure 4 shows the interconnectedness of the dictionaries with themselves and with the other dictionaries based on the cooccurring concept pairs, in the form of a heatmap. As shown in Figure 4, after normalization, the concepts from the ADO dictionary have the most connections to concepts from other dictionaries. This might seem surprising, but within the field of Alzheimer’s disease research, vascularization is intensely researched as a mechanism for the disease development, with some researchers proposing that it is primarily a vascular disorder rather than a neurodegenerative disease . However, since this link is based on the analysis of literature focused on redox effects to CVS, this implicitly suggests that redox-related vascular disorders may link to Alzheimer’s disease. We take this observation based on Figure 4 cautiously, as the number of concepts included in different dictionaries varies as well as the coverage of a particular domain by these concepts. So, it also could be that the quality of the ontologies from which we derived some of our dictionaries is affecting the heatmap in Figure 1. In any case, it was interesting to observe potential support for the hypothesis on a link of vascularization to Alzheimer’s disease.
3.2.3. Hypothesis 3: ZFAS1 May Play a Role in the Fine-Tuning of the Oxidative Stress-Responsive miR-27B
In search of novel insights, we also looked at the associations of concepts based on semantic similarity using the “Semantic Similarity” link (see Figure 5 and also see the “Published Examples” link for a more detailed description of how examples were generated). One of the semantic similarities () established by DES-RedoxVasc is between miR-27b and long non-coding RNA, ZFAS1. Xu et al. demonstrated that collagenase-induced intracerebral hemorrhage (ICH) in the rat brain reduces the expression of the oxidative stress-responsive miR-27b. It was also shown that overexpression of miR-27b reduced expression of Nrf2, SOD1, Hmox1, and Nqo1 and that miR-27b targets Nrf2 mRNA directly. They further demonstrated that miR-27b inhibition promotes the opposite effects, such as activation of the Nrf2/ARE pathway and reduced OS; these effects are blocked by Nrf2 knockdown . Thus, miR-27b is reduced to reestablish redox homeostasis. The dysfunction of this mechanism leads to vascular diseases. That is, it was demonstrated that when miR-27b overexpresses, it induces cardiac dysfunction and hypertrophy in mice . Also, Signorelli et al. demonstrated that the levels of miR-27b, miR-130a, and miR-210 are increased in patients with peripheral artery disease when compared to healthy controls . However, miR-27b has not been linked to ZFAS1. Despite that, this link may be correct as ZFAS1 is predicted to bind hsa-miR-27b-3p using the DIANA tool, LncBase Predicted v.2 .
Current research to a certain extent supports this hypothesis, as Pan et al. reported overexpression of ZFAS1 in gastric cancer (GC) serum and tissue samples and demonstrated that ZFAS1 knockdown inhibits the proliferation and migration of GC cells by suppressing cell cycle progression and apoptosis , while Chen et al. demonstrated that miR-27b is downregulated in GC and show miR-27b to be a potential GC biomarker. Moreover, they show that miR-27b functions as a tumor suppressor in GC by targeting VEGFC . This shows a possible inverse relationship between ZFAS1 and miR-27b. Moreover, Shin et al. report the risk of ischemic stroke and coronary heart disease incidence in GC patients . ZFAS1 was also determined to be a potential biomarker for coronary artery disease/acute myocardial infarction . Lyu et al. also showed ZFAS1 to be upregulated in rats with traumatic brain injury . This shows that miR-27b has been linked to OS and vascular disease and that ZFAS1 has been linked to vascular disease but its possible role in the fine-tuning of miR-27b in these pathologies have not been explored.
4. Discussion and Concluding Remarks
DES-RedoxVasc allows for exploration of numerous associations between different concepts as they are found in the analyzed literature. Over 5.6 million such associations have been identified by DES-RedoxVasc. These potential concept associations are based on the cooccurrence of the concepts in the text placed relatively close to each other (up to a 200-character distance). Moreover, these associations are found statistically enriched in the analyzed literature with and are made of concepts that themselves are statistically enriched in the same document set with , compared to documents in the background. Users can evaluate if such association found is meaningful by inspecting the text from where the association is derived. Another set of associations is between any of the individually enriched concepts and statistically enriched concepts that are semantically similar to them. In total, there are over 10 billion such associations found in the analyzed documents. Usually, when similarity between concepts is high, i.e., >0.75, such associations appear mostly meaningful, which reduces the number of concept pairs to an estimated 50 million.
Being primarily based on the text mining approach, DES-RedoxVasc carries all shortcomings of text mining. As we used dictionaries of terms related to different categories of concepts, the quality and completeness of these dictionaries affect the results. If a term that represent a synonym of a concept or the concept itself is not present in the dictionary, the system will not be able to identify it in the text. Also, some terms are “promiscuous” as they are very common and thus do not convey significant information. That is, promiscuous terms are terms which have very high connectivity in the knowledge graph. This is in turn due to their high frequency, because the more frequent a term is, the greater the probability for it to cooccur with more concepts. Usually, promiscuous terms have a broad semantic coverage like “function” or “disease.” Term ambiguity can also result in term promiscuity, such as the use of the term HAND or PDF as a gene symbol. Promiscuous terms might have thousands of edges, where every single edge might refer to thousands of cooccurrence hits within the annotation. Consequently, they inflate the index and the knowledge graph and therefore pose more demands on computation. More importantly, they affect the quality of extracted information and any inferences thereof, because they affect the very topology of the knowledge graph and act as high centrality hubs, creating short paths between concepts which are not otherwise associated. For example, the term “disease” can potentially link most disease concepts which are not necessarily linked, the same for pathological mutations, pathological microorganisms, etc., which are all related to the concept of disease. Removing promiscuous terms restores the intended topology of the knowledge graph. Pair enrichment provides another corrective layer for cases where promiscuous or irrelevant concepts seeped through the dictionary cleaning phase.
Computationally, to understand the improvements gained by removing these terms, we refer to the concept of term frequency distribution and in particular to Zipf’s law, which establishes that a term frequency and its rank (within a descending frequency-ordered list of terms within a corpus) obey a simple power law. The main consequence of this law is that a very small proportion of top-frequency-ranked terms (usually promiscuous in a biological context) account for a substantial amount of the text (in our case, the annotation and the knowledge graph). In our latest dictionary cleaning process, the removal of 0.1% of such high-frequency terms resulted in reducing the annotation size by a third.
An additional observation is that the Cardiovascular Disease Ontology (CVDO) on the other hand does not seem to resonate well within the knowledge base, having relatively few connections, despite being conceptually of central importance. Compared to CVDO, the Heart Failure Ontology (HFO) is much better connected to the other ontologies that we used. It is possible that this is the consequence of relatively incomplete CVDO that may need some improvements if it is to show the full usefulness in text mining tasks.
Despite these limitations, the examples provided hereby as “case studies” demonstrate that the KB can be useful and that the user-friendly interface allows users to easily navigate and explore information in the KB. The DES-RedoxVasc KB literature and dictionaries will be updated biannually, and the KB will be updated accordingly.
|CVDO:||Cardiovascular Disease Ontology|
|FDR:||False discovery rate|
|GCL:||Glutamate cysteine ligase|
|HFO:||Heart Failure Ontology|
|lncRNA:||Long non-coding RNA|
|NLP:||Natural language processing|
|Nrf2:||Nuclear erythroid 2-related factor 2|
|PMI:||Pointwise mutual information|
|RNS:||Reactive nitrogen species|
|ROS:||Reactive oxygen species|
|VSMC:||Vascular smooth muscle cells.|
This work is part of a collaboration between the Laboratory of Radiobiology and Molecular Genetics, Institute of Nuclear Sciences, Vinca, University of Belgrade, Belgrade, Serbia, and King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Center (CBRC), Thuwal, Saudi Arabia.
Conflicts of Interest
The authors confirm that this article content has no conflict of interest.
ME, AS, JS, FT, ABR, AH, MU, CVN, AT, VPB, VBB, and ERI wrote the paper; ERI and VBB designed, supervised, and critically revised the paper. Magbubah Essack and Adil Salhi are co-first authors.
This work has been supported by grant no. 173033 (ERI) and no. 173034 (BSP) from the Ministry of Education, Science and Technological Development, Republic of Serbia. VBB has been supported by the King Abdullah University of Science and Technology (KAUST) Base Research Fund (BAS/1/1606-01-01), and ME has been supported by KAUST Office of Sponsored Research (OSR) Award no. FCC/1/1976-24-01.
- K. K. Griendling, D. Sorescu, B. Lassègue, and M. Ushio-Fukai, “Modulation of protein kinase activity and gene expression by reactive oxygen species and their role in vascular physiology and pathophysiology,” Arteriosclerosis, Thrombosis, and Vascular Biology, vol. 20, no. 10, pp. 2175–2183, 2000.
- H. M. Lander, “An essential role for free radicals and derived species in signal transduction,” The FASEB Journal, vol. 11, no. 2, pp. 118–124, 1997.
- W. Droge, “Free radicals in the physiological control of cell function,” Physiological Reviews, vol. 82, no. 1, pp. 47–95, 2002.
- H. Kalwa, J. L. Sartoretto, S. M. Sartoretto, and T. Michel, “Angiotensin-II and MARCKS: a hydrogen peroxide- and RAC1-dependent signaling pathway in vascular endothelium,” Journal of Biological Chemistry, vol. 287, no. 34, pp. 29147–29158, 2012.
- I. Al Ghouleh, N. K. H. Khoo, U. G. Knaus et al., “Oxidases and peroxidases in cardiovascular and lung disease: new concepts in reactive oxygen species signaling,” Free Radical Biology & Medicine, vol. 51, no. 7, pp. 1271–1288, 2011.
- S. C. Bir, G. K. Kolluru, K. Fang, and C. G. Kevil, “Redox balance dynamically regulates vascular growth and remodeling,” Seminars in Cell & Developmental Biology, vol. 23, no. 7, pp. 745–757, 2012.
- F. Tabet, E. L. Schiffrin, G. E. Callera et al., “Redox-sensitive signaling by angiotensin II involves oxidative inactivation and blunted phosphorylation of protein tyrosine phosphatase SHP-2 in vascular smooth muscle cells from SHR,” Circulation Research, vol. 103, no. 2, pp. 149–158, 2008.
- M. Ushio-Fukai, R. W. Alexander, M. Akers, and K. K. Griendling, “p38 Mitogen-activated protein kinase is a critical component of the redox-sensitive signaling pathways activated by angiotensin II Role in vascular smooth muscle cell hypertrophy,” Journal of Biological Chemistry, vol. 273, no. 24, pp. 15022–15029, 1998.
- N. S. Zinkevich and D. D. Gutterman, “ROS-induced ROS release in vascular biology: redox-redox signaling,” American Journal of Physiology-Heart and Circulatory Physiology, vol. 301, no. 3, pp. H647–H653, 2011.
- T. Paravicini and R. Touyz, “Redox signaling in hypertension,” Cardiovascular Research, vol. 71, no. 2, pp. 247–258, 2006.
- D. Rebholz-Schuhmann, A. Oellrich, and R. Hoehndorf, “Text-mining solutions for biomedical research: enabling integrative biology,” Nature Reviews Genetics, vol. 13, no. 12, pp. 829–839, 2012.
- P. F. Anderson, C. Shannon, S. Bickett et al., “Systematic reviews and tech mining: a methodological comparison with case study,” Research Synthesis Methods, vol. 9, no. 4, pp. 540–550, 2018.
- H. Kilicoglu, “Biomedical text mining for research rigor and integrity: tasks, challenges, directions,” Briefings in Bioinformatics, vol. 19, no. 6, pp. 1400–1414, 2018.
- C. C. Huang and Z. Lu, “Community challenges in biomedical text mining over 10 years: success, failure and the future,” Briefings in Bioinformatics, vol. 17, no. 1, pp. 132–144, 2016.
- J. Jovanovic and E. Bagheri, “Semantic annotation in biomedicine: the current landscape,” Journal of Biomedical Semantics, vol. 8, no. 1, p. 44, 2017.
- M. Krallinger, O. Rabal, A. Lourenço, J. Oyarzabal, and A. Valencia, “Information retrieval and text mining technologies for chemistry,” Chemical Reviews, vol. 117, no. 12, pp. 7673–7761, 2017.
- R. Mishra, J. Bian, M. Fiszman et al., “Text summarization in the biomedical domain: a systematic review of recent research,” Journal of Biomedical Informatics, vol. 52, pp. 457–467, 2014.
- R. Rodriguez-Esteban and M. Bundschus, “Text mining patents for biomedical knowledge,” Drug Discovery Today, vol. 21, no. 6, pp. 997–1002, 2016.
- Z. Zeng, H. Shi, Y. Wu, and Z. Hong, “Survey of natural language processing techniques in bioinformatics,” Computational and Mathematical Methods in Medicine, vol. 2015, Article ID 674296, 10 pages, 2015.
- J. D. Saffer and V. L. Burnett, “Introduction to biomedical literature text mining: context and objectives,” in Biomedical Literature Mining, V. Kumar and H. Tipney, Eds., vol. 1159 of Methods in Molecular Biology (Methods and Protocols), pp. 1–7, Humana Press, New York, NY, USA, 2014.
- J. Fluck and M. Hofmann-Apitius, “Text mining for systems biology,” Drug Discovery Today, vol. 19, no. 2, pp. 140–144, 2014.
- A. M. Cohen and W. R. Hersh, “A survey of current work in biomedical text mining,” Briefings in Bioinformatics, vol. 6, no. 1, pp. 57–71, 2005.
- H. Shatkay and R. Feldman, “Mining the biomedical literature in the genomic era: an overview,” Journal of Computational Biology, vol. 10, no. 6, pp. 821–855, 2003.
- A. Bin Raies, H. Mansour, R. Incitti, and V. B. Bajic, “Combining position weight matrices and document-term matrix for efficient extraction of associations of methylated genes and diseases from free text,” PLoS One, vol. 8, no. 10, article e77848, 2013.
- R. Hoehndorf, L. Slater, P. N. Schofield, and G. V. Gkoutos, “Aber-OWL: a framework for ontology-based data access in biology,” BMC Bioinformatics, vol. 16, no. 1, p. 26, 2015.
- M. A. Rodríguez-García and R. Hoehndorf, “Inferring ontology graph structures using OWL reasoning,” BMC Bioinformatics, vol. 19, no. 1, p. 7, 2018.
- R. Hoehndorf, M. Dumontier, and G. V. Gkoutos, “Identifying aberrant pathways through integrated analysis of knowledge in pharmacogenomics,” Bioinformatics, vol. 28, no. 16, pp. 2169–2175, 2012.
- R. Hoehndorf, M. Dumontier, A. Oellrich et al., “A common layer of interoperability for biomedical ontologies based on OWL EL,” Bioinformatics, vol. 27, no. 7, pp. 1001–1008, 2011.
- M. Alshahrani, M. A. Khan, O. Maddouri, A. R. Kinjo, N. Queralt-Rosinach, and R. Hoehndorf, “Neuro-symbolic representation learning on biological knowledge graphs,” Bioinformatics, vol. 33, no. 17, pp. 2723–2730, 2017.
- P. Ruch, “Text mining to support gene ontology curation and vice versa,” Methods in Molecular Biology, vol. 1446, pp. 69–84, 2017.
- O. S. Kwon, J. Kim, K. H. Choi, Y. Ryu, and J. E. Park, “Trends in deqi research: a text mining and network analysis,” Integrative Medicine Research, vol. 7, no. 3, pp. 231–237, 2018.
- M. Bada, “Mapping of biomedical text to concepts of lexicons, terminologies, and ontologies,” Methods in Molecular Biology, vol. 1159, pp. 33–45, 2014.
- D. Chung, A. Lawson, and W. J. Zheng, “A statistical framework for biomedical literature mining,” Statistics in Medicine, vol. 36, no. 22, pp. 3461–3474, 2017.
- N. Tiffin, J. F. Kelso, A. R. Powell, H. Pan, V. B. Bajic, and W. A. Hide, “Integration of text- and data-mining using ontologies successfully selects disease gene candidates,” Nucleic Acids Research, vol. 33, no. 5, pp. 1544–1552, 2005.
- S. H. Park, M. S. Hwang, H. J. Park, H. K. Shin, J. U. Baek, and B. T. Choi, “Herbal prescriptions and medicinal herbs for Parkinson-related rigidity in Korean medicine: identification of candidates using text mining,” The Journal of Alternative and Complementary Medicine, vol. 24, no. 7, pp. 733–740, 2018.
- F. Xiao, C. Li, J. Sun, and L. Zhang, “Knowledge domain and emerging trends in organic photovoltaic technology: a scientometric review based on CiteSpace analysis,” Frontiers in Chemistry, vol. 5, p. 67, 2017.
- H. T. Yang, J. H. Ju, Y. T. Wong, I. Shmulevich, and J. H. Chiang, “Literature-based discovery of new candidates for drug repurposing,” Briefings in Bioinformatics, vol. 18, no. 3, pp. 488–497, 2017.
- A. S. Carvalho, M. S. Rodriguez, and R. Matthiesen, “Review and literature mining on proteostasis factors and cancer,” Methods in Molecular Biology, vol. 1449, pp. 71–84, 2016.
- A. Abbe, C. Grouin, P. Zweigenbaum, and B. Falissard, “Text mining applications in psychiatry: a systematic literature review,” International Journal of Methods in Psychiatric Research, vol. 25, no. 2, pp. 86–100, 2016.
- H. Shatkay, S. Brady, and A. Wong, “Text as data: using text-based features for proteins representation and for computational prediction of their characteristics,” Methods, vol. 74, pp. 54–64, 2015.
- Y. Luo, G. Riedlinger, and P. Szolovits, “Text mining in cancer gene and pathway prioritization,” Cancer Informatics, vol. 13s1, 2014.
- I. Spasić, J. Livsey, J. A. Keane, and G. Nenadić, “Text mining of cancer-related information: review of current status and future directions,” International Journal of Medical Informatics, vol. 83, no. 9, pp. 605–623, 2014.
- À. Bravo, M. Cases, N. Queralt-Rosinach, F. Sanz, and L. I. Furlong, “A knowledge-driven approach to extract disease-related biomarkers from the literature,” BioMed Research International, vol. 2014, Article ID 253128, 11 pages, 2014.
- L. B. Tari and J. H. Patel, “Systematic drug repurposing through text mining,” in Biomedical Literature Mining, V. Kumar and H. Tipney, Eds., vol. 1159 of Methods in Molecular Biology (Methods and Protocols), pp. 253–267, Humana Press, New York, NY, USA, 2014.
- K. M. Verspoor, “Roles for text mining in protein function prediction,” in Biomedical Literature Mining, V. Kumar and H. Tipney, Eds., vol. 1159 of Methods in Molecular Biology (Methods and Protocols), pp. 95–108, Humana Press, New York, NY, USA, 2014.
- D. Piedra, A. Ferrer, and J. Gea, “Text mining and medicine: usefulness in respiratory diseases,” Archivos de Bronconeumología, vol. 50, no. 3, pp. 113–119, 2014.
- J. M. Izarzugaza, M. Krallinger, and A. Valencia, “Interpretation of the consequences of mutations in protein kinases: combined use of bioinformatics and text mining,” Frontiers in Physiology, vol. 3, p. 323, 2012.
- A. Korhonen, D. Ó Séaghdha, I. Silins, L. Sun, J. Högberg, and U. Stenius, “Text mining for literature review and knowledge discovery in cancer risk assessment and research,” PLoS One, vol. 7, no. 4, article e33427, 2012.
- M. Essack, A. Radovanovic, and V. B. Bajic, “Information exploration system for sickle cell disease and repurposing of hydroxyfasudil,” PLoS One, vol. 8, no. 6, article e65190, 2013.
- A. Salhi, S. Negrão, M. Essack et al., “DES-TOMATO: a knowledge exploration system focused on tomato species,” Scientific Reports, vol. 7, no. 1, p. 5968, 2017.
- S. Sagar, M. Kaur, A. Dawe et al., “DDESC: Dragon database for exploration of sodium channels in human,” BMC Genomics, vol. 9, no. 1, p. 622, 2008.
- R. Chowdhary, J. Zhang, S. L. Tan, D. E. Osborne, V. B. Bajic, and J. S. Liu, “PIMiner: a web tool for extraction of protein interactions from biomedical literature,” International Journal of Data Mining and Bioinformatics, vol. 7, no. 4, pp. 450–462, 2013.
- S. K. Kwofie, A. Radovanovic, V. S. Sundararajan, M. Maqungo, A. Christoffels, and V. B. Bajic, “Dragon exploratory system on hepatitis C virus (DESHCV),” Infection, Genetics and Evolution, vol. 11, no. 4, pp. 734–739, 2011.
- V. Kordopati, A. Salhi, R. Razali et al., “DES-mutation: system for exploring links of mutations and diseases,” Scientific Reports, vol. 8, no. 1, article 13359, 2018.
- H. Pan, L. Zuo, V. Choudhary et al., “Dragon TF Association Miner: a system for exploring transcription factor associations through text-mining,” Nucleic Acids Research, vol. 32, Supplement 2, pp. W230–W234, 2004.
- Y. Liu, Y. Liang, and D. Wishart, “PolySearch2: a significantly improved text-mining system for discovering associations between human diseases, genes, drugs, metabolites, toxins and more,” Nucleic Acids Research, vol. 43, no. W1, pp. W535–W542, 2015.
- D. Cheng, C. Knox, N. Young, P. Stothard, S. Damaraju, and D. S. Wishart, “PolySearch: a web-based text mining system for extracting relationships between human diseases, genes, mutations, drugs and metabolites,” Nucleic Acids Research, vol. 36, Supplement 2, pp. W399–W405, 2008.
- A. Salhi, M. Essack, T. Alam et al., “DES-ncRNA: a knowledgebase for exploring information about human micro and long noncoding RNAs based on literature-mining,” RNA Biology, vol. 14, no. 7, pp. 963–971, 2017.
- M. Neves and U. Leser, “A survey on annotation tools for the biomedical literature,” Briefings in Bioinformatics, vol. 15, no. 2, pp. 327–340, 2014.
- K. Kreiner, D. Hayn, and G. Schreier, “Twister: a tool for reducing screening time in systematic literature reviews,” Studies in Health Technology and Informatics, vol. 255, pp. 5–9, 2018.
- R. Paynter, L. L. Bañez, E. Berliner et al., EPC Methods: An Exploration of the Use of Text-Mining Software in Systematic Reviews, Agency for Healthcare Research and Quality, 2016.
- M. Essack, A. Radovanovic, U. Schaefer et al., “DDEC: Dragon database of genes implicated in esophageal cancer,” BMC Cancer, vol. 9, no. 1, p. 219, 2009.
- S. Baker, I. Ali, I. Silins et al., “Cancer Hallmarks Analytics Tool (CHAT): a text mining approach to organize and evaluate scientific literature on cancer,” Bioinformatics, vol. 33, no. 24, pp. 3973–3981, 2017.
- B. E. Howard, J. Phillips, K. Miller et al., “SWIFT-Review: a text-mining workbench for systematic review,” Systematic Reviews, vol. 5, no. 1, p. 87, 2016.
- S. K. Kwofie, U. Schaefer, V. S. Sundararajan, V. B. Bajic, and A. Christoffels, “HCVpro: hepatitis C virus protein interaction database,” Infection, Genetics and Evolution, vol. 11, no. 8, pp. 1971–1977, 2011.
- L. French, P. Liu, O. Marais et al., “Text mining for neuroanatomy using WhiteText with an updated corpus and a new web application,” Frontiers in Neuroinformatics, vol. 9, p. 13, 2015.
- J. A. Bachman, B. M. Gyori, and P. K. Sorger, “FamPlex: a resource for entity recognition and relationship resolution of human protein families and complexes in biomedical text mining,” BMC Bioinformatics, vol. 19, no. 1, p. 248, 2018.
- M. Maqungo, M. Kaur, S. K. Kwofie et al., “DDPC: Dragon Database of Genes associated with Prostate Cancer,” Nucleic Acids Research, vol. 39, pp. D980–D985, 2011.
- Z. Ye, A. P. Tafti, K. Y. He, K. Wang, and M. M. He, “SparkText: biomedical text mining on big data framework,” PLoS One, vol. 11, no. 9, article e0162721, 2016.
- S. Sagar, M. Kaur, A. Radovanovic, and V. B. Bajic, “Dragon exploration system on marine sponge compounds interactions,” Journal of Cheminformatics, vol. 5, no. 1, p. 11, 2013.
- A. G. Jácome, F. Fdez-Riverola, and A. Lourenço, “BIOMedical Search Engine Framework: lightweight and customized implementation of domain-specific biomedical search engines,” Computer Methods and Programs in Biomedicine, vol. 131, pp. 63–77, 2016.
- A. Salhi, M. Essack, A. Radovanovic et al., “DESM: portal for microbial knowledge exploration systems,” Nucleic Acids Research, vol. 44, no. D1, pp. D624–D633, 2016.
- R. Khare, C. H. Wei, Y. Mao, R. Leaman, and Z. Lu, “tmBioC: improving interoperability of text-mining tools with BioC,” Database, vol. 2014, no. 10, article bau073, 2014.
- K. Raja, S. Subramani, and J. Natarajan, “PPInterFinder—a mining tool for extracting causal relations on human proteins from literature,” Database, vol. 2013, article bas052, 2013.
- R. M. Dohmen, “Cell lineage in molluscan development,” Microscopy Research & Technique, vol. 22, no. 1, pp. 75–102, 1992.
- H. Liu, T. Christiansen, W. A. Baumgartner, and K. Verspoor, “BioLemmatizer: a lemmatization tool for morphological processing of biomedical text,” Journal of Biomedical Semantics, vol. 3, no. 1, p. 3, 2012.
- C. Roeder, C. Jonquet, N. H. Shah, W. A. Baumgartner, K. Verspoor, and L. Hunter, “A UIMA wrapper for the NCBO annotator,” Bioinformatics, vol. 26, no. 14, pp. 1800-1801, 2010.
- M. Kaur, A. Radovanovic, M. Essack et al., “Database for exploration of functional context of genes implicated in ovarian cancer,” Nucleic Acids Research, vol. 37, pp. D820–D823, 2009.
- J. H. Chiang, H. C. Yu, and H. J. Hsu, “GIS: a biomedical text-mining system for gene information discovery,” Bioinformatics, vol. 20, no. 1, pp. 120-121, 2004.
- A. B. Raies, H. Mansour, R. Incitti, and V. B. Bajic, “DDMGD: the database of text-mined associations between genes methylated in diseases from different species,” Nucleic Acids Research, vol. 43, no. D1, pp. D879–D886, 2015.
- A. S. Dawe, A. Radovanovic, M. Kaur et al., “DESTAF: a database of text-mined associations for reproductive toxins potentially affecting human fertility,” Reproductive Toxicology, vol. 33, no. 1, pp. 99–105, 2012.
- V. B. Bajic, M. Veronika, P. S. Veladandi et al., “Dragon Plant Biology Explorer. A text-mining tool for integrating associations between genetic and biochemical entities with genome annotation and biochemical terms lists,” Plant Physiology, vol. 138, no. 4, pp. 1914–1925, 2005.
- R. Chowdhary, S. L. Tan, J. Zhang, S. Karnik, V. B. Bajic, and J. S. Liu, “Context-specific protein network miner – an online system for exploring context-specific protein interaction networks from the literature,” PLoS One, vol. 7, no. 4, article e34480, 2012.
- J. Hastings, P. de Matos, A. Dekker et al., “The ChEBI reference database and ontology for biologically relevant chemistry: enhancements for 2013,” Nucleic Acids Research, vol. 41, no. D1, pp. D456–D463, 2013.
- D. Wishart, D. Arndt, A. Pon et al., “T3DB: the toxic exposome database,” Nucleic Acids Research, vol. 43, no. D1, pp. D928–D934, 2015.
- D. Cotter, A. Maer, C. Guda, B. Saunders, and S. Subramaniam, “LMPD: LIPID MAPS proteome database,” Nucleic Acids Research, vol. 34, no. 90001, pp. D507–D510, 2006.
- M. Sud, E. Fahy, D. Cotter et al., “LMSD: LIPID MAPS structure database,” Nucleic Acids Research, vol. 35, pp. D527–D532, 2007.
- The Gene Ontology Consortium, “Gene ontology consortium: going forward,” Nucleic Acids Research, vol. 43, no. D1, pp. D1049–D1056, 2015.
- H. Ogata, S. Goto, K. Sato, W. Fujibuchi, H. Bono, and M. Kanehisa, “KEGG: Kyoto Encyclopedia of Genes and Genomes,” Nucleic Acids Research, vol. 27, no. 1, pp. 29–34, 1999.
- A. Fabregat, K. Sidiropoulos, P. Garapati et al., “The Reactome pathway Knowledgebase,” Nucleic Acids Research, vol. 44, no. D1, pp. D481–D487, 2016.
- A. Morgat, E. Coissac, E. Coudert et al., “UniPathway: a resource for the exploration and annotation of metabolic pathways,” Nucleic Acids Research, vol. 40, no. D1, pp. D761–D769, 2012.
- H. Mi, B. Lazareva-Ulitsky, R. Loo et al., “The PANTHER database of protein families, subfamilies, functions and pathways,” Nucleic Acids Research, vol. 33, pp. D284–D288, 2005.
- W. A. Kibbe, C. Arze, V. Felix et al., “Disease Ontology 2015 update: an expanded and updated database of human diseases for linking biomedical knowledge through disease data,” Nucleic Acids Research, vol. 43, no. D1, pp. D1071–D1078, 2015.
- A. Malhotra, E. Younesi, M. Gündel, B. Müller, M. T. Heneka, and M. Hofmann-Apitius, “ADO: a disease ontology representing the domain knowledge specific to Alzheimer’s disease,” Alzheimer's & Dementia, vol. 10, no. 2, pp. 238–246, 2014.
- S. El-Sappagh, D. Kwak, F. Ali, and K.-S. Kwak, “DMTO: a realistic ontology for standard diabetes mellitus treatment,” Journal of Biomedical Semantics, vol. 9, no. 1, p. 8, 2018.
- L. Wang, B. E. Bray, J. Shi, G. del Fiol, and P. J. Haug, “A method for the development of disease-specific reference standards vocabularies from textual biomedical literature resources,” Artificial Intelligence in Medicine, vol. 68, pp. 47–57, 2016.
- M. Arguello Casteleiro, G. Demetriou, W. Read et al., “Deep learning meets ontologies: experiments to anchor the cardiovascular disease ontology in the biomedical literature,” Journal of Biomedical Semantics, vol. 9, no. 1, p. 13, 2018.
- S. Köhler, N. A. Vasilevsky, M. Engelstad et al., “The Human Phenotype Ontology in 2017,” Nucleic Acids Research, vol. 45, no. D1, pp. D865–D876, 2017.
- C. J. Mungall, C. Torniai, G. V. Gkoutos, S. E. Lewis, and M. A. Haendel, “Uberon, an integrative multi-species anatomy ontology,” Genome Biology, vol. 13, no. 1, article R5, 2012.
- WHO, International statistical classification of diseases and related health problems, World Health Organization, Geneva, Switzerland, 10th edition, 2010.
- D. S. Wishart, Y. D. Feunang, A. C. Guo et al., “DrugBank 5.0: a major update to the DrugBank database for 2018,” Nucleic Acids Research, vol. 46, no. D1, pp. D1074–D1082, 2018.
- L. Chen, W. M. Zeng, Y. D. Cai, K. Y. Feng, and K. C. Chou, “Predicting Anatomical Therapeutic Chemical (ATC) classification of drugs by integrating chemical-chemical interactions and similarities,” PLoS One, vol. 7, no. 4, article e35254, 2012.
- M. Kuhn, I. Letunic, L. J. Jensen, and P. Bork, “The SIDER database of drugs and side effects,” Nucleic Acids Research, vol. 44, no. D1, pp. D1075–D1079, 2016.
- D. Maglott, J. Ostell, K. D. Pruitt, and T. Tatusova, “Entrez Gene: gene-centered information at NCBI,” Nucleic Acids Research, vol. 39, pp. D52–D57, 2011.
- S. Schmeier, T. Alam, M. Essack, and V. B. Bajic, “TcoF-DB v2: update of the database of human and mouse transcription co-factors and transcription factor interactions,” Nucleic Acids Research, vol. 45, no. D1, pp. D145–D150, 2017.
- B. Yates, B. Braschi, K. A. Gray, R. L. Seal, S. Tweedie, and E. A. Bruford, “Genenames.org: the HGNC and VGNC resources in 2017,” Nucleic Acids Research, vol. 45, no. D1, pp. D619–D625, 2017.
- C. H. Wei, B. R. Harris, H. Y. Kao, and Z. Lu, “tmVar: a text mining approach for extracting sequence variants in biomedical literature,” Bioinformatics, vol. 29, no. 11, pp. 1433–1439, 2013.
- J. Huang, J. Dang, G. M. Borchert et al., “OMIT: dynamic, semi-automated ontology development for the microRNA domain,” PLoS One, vol. 9, no. 7, article e100855, 2014.
- H. J. Forman, J. M. Fukuto, and M. Torres, “Redox signaling: thiol chemistry defines which reactive oxygen and nitrogen species can act as second messengers,” American Journal of Physiology-Cell Physiology, vol. 287, no. 2, pp. C246–C256, 2004.
- L. Packer, S. U. Weber, and G. Rimbach, “Molecular aspects of α-tocotrienol antioxidant action and cell signalling,” The Journal of Nutrition, vol. 131, no. 2, pp. 369S–373S, 2001.
- J. M. McCord, “The evolution of free radicals and oxidative stress,” The American Journal of Medicine, vol. 108, no. 8, pp. 652–659, 2000.
- D. Harman, “Aging: a theory based on free radical and radiation chemistry,” Journal of Gerontology, vol. 11, no. 3, pp. 298–300, 1956.
- B. Halliwell, “Free radicals, proteins and DNA: oxidative damage versus redox regulation,” Biochemical Society Transactions, vol. 24, no. 4, pp. 1023–1027, 1996.
- B. Halliwell, “How to characterize a biological antioxidant,” Free Radical Research Communications, vol. 9, no. 1, pp. 1–32, 1990.
- M. Narasimhan and N. S. Rajasekaran, “Reductive potential — a savior turns stressor in protein aggregation cardiomyopathy,” Biochimica et Biophysica Acta (BBA) - Molecular Basis of Disease, vol. 1852, no. 1, pp. 53–60, 2015.
- I. Perez-Torres, V. Guarner-Lans, and M. E. Rubio-Ruiz, “Reductive stress in inflammation-associated diseases and the pro-oxidant effect of antioxidant agents,” International Journal of Molecular Sciences, vol. 18, no. 10, p. 2098, 2017.
- N. D. Vaziri and B. Rodriguez-Iturbe, “Mechanisms of disease: oxidative stress and inflammation in the pathogenesis of hypertension,” Nature Clinical Practice Nephrology, vol. 2, no. 10, pp. 582–593, 2006.
- S. Moretti, S. Mrakic-Sposta, L. Roncoroni et al., “Oxidative stress as a biomarker for monitoring treated celiac disease,” Clinical and Translational Gastroenterology, vol. 9, no. 6, article e157, 2018.
- U. Forstermann, “Oxidative stress in vascular disease: causes, defense mechanisms and potential therapies,” Nature Clinical Practice Cardiovascular Medicine, vol. 5, no. 6, pp. 338–349, 2008.
- W. Li, S. Mital, C. Ojaimi, A. Csiszar, G. Kaley, and T. H. Hintze, “Premature death and age-related cardiac dysfunction in male eNOS-knockout mice,” Journal of Molecular and Cellular Cardiology, vol. 37, no. 3, pp. 671–680, 2004.
- M. Zanetti, G. G. Cappellari, I. Burekovic, R. Barazzoni, M. Stebel, and G. Guarnieri, “Caloric restriction improves endothelial dysfunction during vascular aging: effects on nitric oxide synthase isoforms and oxidative stress in rat aorta,” Experimental Gerontology, vol. 45, no. 11, pp. 848–855, 2010.
- A. Csiszar, N. Labinskyy, J. T. Pinto et al., “Resveratrol induces mitochondrial biogenesis in endothelial cells,” American Journal of Physiology-Heart and Circulatory Physiology, vol. 297, no. 1, pp. H13–H20, 2009.
- Y. Iuchi, F. Okada, K. Onuma et al., “Elevated oxidative stress in erythrocytes due to a SOD1 deficiency causes anaemia and triggers autoantibody production,” Biochemical Journal, vol. 402, no. 2, pp. 219–227, 2007.
- K. Aoshiba, K. Yasuda, S. Yasui, J. Tamaoki, and A. Nagai, “Serine proteases increase oxidative stress in lung cells,” American Journal of Physiology-Lung Cellular and Molecular Physiology, vol. 281, no. 3, pp. L556–L564, 2001.
- A. Loboda, M. Damulewicz, E. Pyza, A. Jozkowicz, and J. Dulak, “Role of Nrf2/HO-1 system in development, oxidative stress response and diseases: an evolutionarily conserved mechanism,” Cellular and Molecular Life Sciences, vol. 73, no. 17, pp. 3221–3247, 2016.
- S. W. Ryter, J. Alam, and A. M. K. Choi, “Heme oxygenase-1/carbon monoxide: from basic science to therapeutic applications,” Physiological Reviews, vol. 86, no. 2, pp. 583–650, 2006.
- A. Izzotti, A. Piana, G. Minniti, M. Vercelli, L. Perrone, and S. de Flora, “Survival of atherosclerotic patients as related to oxidative stress and gene polymorphisms,” Mutation Research/Fundamental and Molecular Mechanisms of Mutagenesis, vol. 621, no. 1-2, pp. 119–128, 2007.
- C. Gokkusu, B. Cakmakoglu, S. Dasdemir et al., “Association between genetic variants of DNA repair genes and coronary artery disease,” Genetic Testing and Molecular Biomarkers, vol. 17, no. 4, pp. 307–313, 2013.
- E. Dubois-Deruy, M. Cuvelliez, J. Fiedler et al., “MicroRNAs regulating superoxide dismutase 2 are new circulating biomarkers of heart failure,” Scientific Reports, vol. 7, no. 1, article 14747, 2017.
- Y. Wan, R. Cui, J. Gu et al., “Identification of four oxidative stress-responsive microRNAs, miR-34a-5p, miR-1915-3p, miR-638, and miR-150-3p, in Hepatocellular Carcinoma,” Oxidative Medicine and Cellular Longevity, vol. 2017, Article ID 5189138, 12 pages, 2017.
- Y. Yokoyama, N. Mise, Y. Suzuki et al., “MicroRNAs as potential mediators for cigarette smoking induced atherosclerosis,” International Journal of Molecular Sciences, vol. 19, no. 4, p. 1097, 2018.
- L. Ayaz and E. Dinc, “Evaluation of microRNA responses in ARPE-19 cells against the oxidative stress,” Cutaneous and Ocular Toxicology, vol. 37, no. 2, pp. 121–126, 2018.
- P. Berber, F. Grassmann, C. Kiel, and B. H. F. Weber, “An eye on age-related macular degeneration: the role of microRNAs in disease pathology,” Molecular Diagnosis & Therapy, vol. 21, no. 1, pp. 31–43, 2017.
- M. Fertin, B. Hennache, M. Hamon et al., “Usefulness of serial assessment of B-type natriuretic peptide, troponin I, and C-reactive protein to predict left ventricular remodeling after acute myocardial infarction (from the REVE-2 study),” The American Journal of Cardiology, vol. 106, no. 10, pp. 1410–1416, 2010.
- D. D. Li, B. W. Zhong, H. X. Zhang et al., “Inhibition of the oxidative stress-induced miR-23a protects the human retinal pigment epithelium (RPE) cells from apoptosis through the upregulation of glutaminase and glutamine uptake,” Molecular Biology Reports, vol. 43, no. 10, pp. 1079–1087, 2016.
- H. Lin, J. Qian, A. C. Castillo et al., “Effect of miR-23 on oxidant-induced injury in human retinal pigment epithelial cells,” Investigative Opthalmology & Visual Science, vol. 52, no. 9, pp. 6308–6314, 2011.
- T. Alam, M. Uludag, M. Essack et al., “FARNA: knowledgebase of inferred functions of non-coding RNA transcripts,” Nucleic Acids Research, vol. 45, no. 5, pp. 2838–2848, 2017.
- Y. Chen, C. Gao, Q. Sun et al., “MicroRNA-4639 is a regulator of DJ-1 expression and a potential early diagnostic marker for Parkinson’s disease,” Frontiers in Aging Neuroscience, vol. 9, p. 232, 2017.
- S. Shendelman, A. Jonason, C. Martinat, T. Leete, and A. Abeliovich, “DJ-1 is a redox-dependent molecular chaperone that inhibits α-synuclein aggregate formation,” PLoS Biology, vol. 2, no. 11, article e362, 2004.
- C. Liu, Y. Chen, I. E. Kochevar, and U. V. Jurkunas, “Decreased DJ-1 leads to impaired Nrf2-regulated antioxidant defense and increased UV-A–induced apoptosis in corneal endothelial cells,” Investigative Opthalmology & Visual Science, vol. 55, no. 9, pp. 5551–5560, 2014.
- F. Billia, L. Hauck, D. Grothe et al., “Parkinson-susceptibility gene DJ-1/PARK7 protects the murine heart from oxidative damage in vivo,” Proceedings of the National Academy of Sciences of the United States of America, vol. 110, no. 15, pp. 6085–6090, 2013.
- R. K. Dongworth, U. A. Mukherjee, A. R. Hall et al., “DJ-1 protects against cell death following acute cardiac ischemia–reperfusion injury,” Cell Death & Disease, vol. 5, no. 2, article e1082, 2014.
- Y. Shimizu, J. P. Lambert, C. K. Nicholson et al., “DJ-1 protects the heart against ischemia–reperfusion injury by regulating mitochondrial fission,” Journal of Molecular and Cellular Cardiology, vol. 97, pp. 56–66, 2016.
- H. Li, J. Fan, Z. Yin, F. Wang, C. Chen, and D. W. Wang, “Identification of cardiac-related circulating microRNA profile in human chronic heart failure,” Oncotarget, vol. 7, no. 1, pp. 33–45, 2016.
- J. C. de la Torre, “Is Alzheimer’s disease a neurodegenerative or a vascular disorder? Data, dogma, and dialectics,” The Lancet Neurology, vol. 3, no. 3, pp. 184–190, 2004.
- W. Xu, F. Li, Z. Liu et al., “MicroRNA-27b inhibition promotes Nrf2/ARE pathway activation and alleviates intracerebral hemorrhage-induced brain injury,” Oncotarget, vol. 8, no. 41, pp. 70669–70684, 2017.
- J. Wang, Y. Song, Y. Zhang et al., “Cardiomyocyte overexpression of miR-27b induces cardiac hypertrophy and dysfunction in mice,” Cell Research, vol. 22, no. 3, pp. 516–527, 2012.
- S. S. Signorelli, G. L. Volsi, A. Pitruzzella et al., “Circulating miR-130a, miR-27b, and miR-210 in patients with peripheral artery disease and their potential relationship with oxidative stress: a pilot study,” Angiology, vol. 67, no. 10, pp. 945–950, 2016.
- M. D. Paraskevopoulou, I. S. Vlachos, D. Karagkouni et al., “DIANA-LncBase v2: indexing microRNA targets on non-coding transcripts,” Nucleic Acids Research, vol. 44, no. D1, pp. D231–D238, 2016.
- L. Pan, W. Liang, M. Fu et al., “Exosomes-mediated transfer of long noncoding RNA ZFAS1 promotes gastric cancer progression,” Journal of Cancer Research and Clinical Oncology, vol. 143, no. 6, pp. 991–1004, 2017.
- X. Chen, Y. Cui, X. Xie, Y. Xing, Z. Yuan, and Y. Wei, “Functional role of miR-27b in the development of gastric cancer,” Molecular Medicine Reports, vol. 17, no. 4, pp. 5081–5087, 2018.
- D. W. Shin, B. Suh, Y. Park et al., “Risk of coronary heart disease and ischemic stroke incidence in gastric cancer survivors: a nationwide study in Korea,” Annals of Surgical Oncology, vol. 25, no. 11, pp. 3248–3256, 2018.
- M. G. Andreassi, “Non-coding RNA in cardiovascular disease: a general overview on microRNAs, long non-coding RNAs and circular RNAs,” Non-coding RNA Investigation, vol. 2, p. 63, 2018.
- Q. Lyu, Z. B. Zhang, S. J. Fu, L. L. Xiong, J. Liu, and T. H. Wang, “Microarray expression profile of lncRNAs and mRNAs in rats with traumatic brain injury after A2B5+ cell transplantation,” Cell Transplantation, vol. 26, no. 10, pp. 1622–1635, 2017.
Copyright © 2019 Magbubah Essack et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.