Novel Bioinformatics Approaches for Analysis of High-Throughput Biological DataView this Special Issue
Research Article | Open Access
Evolution of Network Biomarkers from Early to Late Stage Bladder Cancer Samples
We use a systems biology approach to construct protein-protein interaction networks (PPINs) for early and late stage bladder cancer. By comparing the networks of these two stages, we find that both networks showed very significantly different mechanisms. To obtain the differential network structures between cancer and noncancer PPINs, we constructed cancer PPIN and noncancer PPIN network structures for the two bladder cancer stages using microarray data from cancer cells and their adjacent noncancer cells, respectively. With their carcinogenesis relevance values (CRVs), we identified 152 and 50 significant proteins and their PPI networks (network markers) for early and late stage bladder cancer by statistical assessment. To investigate the evolution of network biomarkers in the carcinogenesis process, primary pathway analysis showed that the significant pathways of early stage bladder cancer are related to ordinary cancer mechanisms, while the ribosome pathway and spliceosome pathway are most important for late stage bladder cancer. Their only intersection is the ubiquitin mediated proteolysis pathway in the whole stage of bladder cancer. The evolution of network biomarkers from early to late stage can reveal the carcinogenesis of bladder cancer. The findings in this study are new clues specific to this study and give us a direction for targeted cancer therapy, and it should be validated in vivo or in vitro in the future.
Cancer is the leading cause of death worldwide and its etiology occurs at the DNA, RNA, or protein level. It is a very complex disease involving cascades of spatial and temporal changes in the genetic network and metabolic pathways . Various research studies have revealed that cancers are caused by multiple factors and intertwined events. Thus, in cancer therapy, it is important to dissect the diverse molecular mechanisms of cancer to identify potential cancers. Bladder cancer is amongst the 10 most common carcinomas in the USA, with 72,570 newly diagnosed cases, and it was the cause of 15,120 deaths in 2013 . In particular, Kaufman et al. pointed out to it as the second most common form of cancer in 2008 . In this study, we compared the early and late stages of bladder cancer to reveal additional mechanisms of bladder cancer development .
Biomarker discovery of various cancers is one of the key topic areas of cancer research. It can aid investigations into carcinogenesis and novel drug designs for cancer therapy. Several bioinformatics methods have been developed and applied to compare normal tissue with cancerous tissue to determine what cancer driving genes can act as cancer biomarkers [5–12].
Genes and proteins function cooperatively to regulate common biological cell processes by coregulating each other . Generally, molecular regulation and interaction proceed with time and vary in different tissues. There must exist great differences in these variations between cancer and normal tissue. Proteins mutually interact with each other in the cell, and they form the PPI networks (PPINs). Currently, a lot of the research has focused on the relationship between PPINs and cancer development. For example, analysis of the cancer-related PPINs of apoptosis has unraveled the molecular mechanisms of cancer, which has helped to identify potential novel drug targets . Our previous work  had successfully identified the network markers of lung cancer. In this study, we modified our previous method and applied the novel concept to study the evolution of network markers from early to late stage bladder cancer.
Based on their PPI information and the gene expression profiles from cancer and surrounding normal samples, two PPI networks with quantitative protein association abilities for each cancer stage (early stage and late stage) and the surrounding noncancerous tissue are constructed, respectively. For each stage, the network structure and protein association abilities of the cancer and noncancer PPI networks are then compared to obtain sets of significant proteins which play important roles in the carcinogenesis process of bladder cancer.
Recently, PPI targets seem to have become a paradigm for the drug discovery of cancer therapy and precision medicine . Unlike conventional drug design focusing on the inhibition of a single protein, usually an enzyme or receptor, small-molecule inhibition of direct PPIs that mediate many important biological processes is an emerging and challenging concept in drug design, especially for cancer. Extensive biological and clinical investigations have led to the identification of PPI hubs and nodes that have been critical for the acquisition and maintenance of characteristics for cell transformation in cancer. Such cancer-enabling PPIs will become promising therapeutic targets in anticancer strategies as the technologies in PPI modulator discovery and validating agents in the clinical setting advance in the future .
Therefore, future research directed at PPI target discovery, PPI interface characterization, and PPI-focused chemical libraries are expected to accelerate the development of the next generation of PPI-based anticancer agents. However, the PPI networks of cancer are very complex and quite differ between early and late stage cancer. In such circumstances, we will focus on the PPI network markers with their significant carcinogenesis relevance value (CRV) to exploit the important targets and their PPI interface for early and late stage cancer characterization. Then, we will not only gain insight into the crucial common pathways involved in bladder carcinogenesis, but we will also obtain a highly promising PPI target for bladder cancers. If we are then able to develop various combined anticancer strategies to target PPIs in the early and late stage network markers in the future, it may provide emerging opportunities for anticancer therapeutic approaches.
Chen et al. developed a dynamical network biomarker (DNB) that can serve as a general early warning signal to indicate an imminent bifurcation or sudden deterioration before the critical transition occurs; that means it can identify predisease state by time series microarray data. We use different approach from their methods by sample microarray data from bladder cancer patients of different stages. Our approach could also be extended to predict some similar results as their research. That is, in this study, we simply divided the cancer into early and late stages, but there are more stages of cancer, such as stages I, II, III, and IV. If we could observe the time evolution of the cancer biomarkers at these more different stages, we could also predict the predisease state by comparing it with these cancer biomarkers at different stages [16–18].
2. Materials and Methods
2.1. Overview of the Bladder Cancer Network Markers Construction Process
A flowchart representing the construction of network biomarkers for early and late stage bladder cancer is shown in Figure 1. We combined two data sources: microarray data of bladder cancer and noncancer samples from the GEO database, while the cancer samples were divided into two groups: early stage and late stage bladder cancer. The PPI database was required to construct the PPINs for bladder cancer. This data was used for PPI pool selection and the selected PPIs and the microarray data were then used for PPI network (PPIN) construction. Through regression modeling and the maximum likelihood parameter estimation method, a cancer PPIN (CPPIN) and a noncancer PPIN (NPPIN) was then obtained. The two constructed cancer and noncancer PPINs were compared to obtain the sets of significant proteins for bladder cancer based on the carcinogenesis relevance value (CRV) for each protein and the statistical assessment. The significant proteins and PPIs within these proteins were used to construct network markers at early and late stage bladder cancer.
2.2. Data Selection and Preprocessing
The microarray gene expression dataset of bladder cancer was obtained from the NCBI gene expression omnibus (GEO) . In this study, we chose GSE13507  and its corresponding platform GPL6012 as our research object. The same dataset contained the early and late stage bladder cancer and noncancer samples. We only used the data derived from nonprocessed primary biopsies to avoid the discrepancies in gene expression that are intrinsic to cell culture and fixation. Therefore, the dataset utilized contained primary tumor samples of both stages from patients and adjacent nontumor tissue samples from the same cancer patients, which could be considered as control samples. To describe the extent of a patient’s cancer, the cancers were classified into four stages according to their degree of invasion and migration using the TNM staging system, as defined by the American Joint Committee on Cancer (AJCC) and the International Union against Cancer (UICC). We then divided the cancer samples into two groups. In general, stages I and II described early stage cancers that have higher curability rates with medical treatment, while stages III and IV described the late stages. However, there were no corresponding noncancer samples in the surrounding area for each stage and we had only one group of surrounding noncancer samples (Table 1). We built CPPIN and NPPIN for both early and late stage bladder cancer in this study. We obtained 37 and 106 samples for the early and late stage cancer, respectively, and 58 noncancer samples. To avoid overfitting in network construction, the maximum degree of the proteins in the PPI network should be less than the cancer/noncancer sample number . In this dataset, we had a greater number of cancer and noncancer samples to overcome the sample size restriction on the size of the network. Prior to further analysis, the gene expression value, , was normalized to -transformed scores, , for each gene, i, and then the normalized expression value resulting had a mean and standard deviation over sample [11, 14].
Cases are grouped by cancer and surrounding normal tissues came from human patients of early stage and late bladder cancer.|
The PPI data for Homo sapiens were extracted from the Biological General Repository for Interaction Database (BioGRID, downloaded in October 2012). BioGRID is an open-access archive of genetic and protein interactions that are curated from the primary biomedical literature of all major model organisms. As of September 2012, BioGRID houses more than 500,000 manually annotated interactions from more than 30 model organisms . The above two databases were mined for bladder cancer and noncancer PPI networks using their corresponding microarray data. These early and late stage bladder cancer and noncancer PPI networks were then compared to obtain network markers.
2.3. Selection of Protein Pool and Identification of the Protein-Protein Interaction Networks (PPINs) for Cancerous and Noncancerous Cells
To integrate gene expression with PPI data to construct the corresponding CPPINs and NPPINs, we set up a protein pool containing differentially expressed proteins. The gene expression values were reasonably assumed to correlate with protein expression levels. We used one-way analysis of variance (ANOVA) to analyze the expression of each protein and select for proteins with differential expression levels. This method allowed determination of significant differences between cancer and noncancer datasets. The null hypothesis (Ho) was based on the assumption that the mean protein expression levels of cancer and noncancer sets are the same. Bonferroni adjustment , a type of multiple testing, was used to detect and correct proteins with discrepancy. Proteins with a value of less than 0.01 were included in the protein pool. However, if the proteins in the protein pool did not have PPI information, they were eliminated. In addition, proteins that were not already in the protein pool were included if their PPI information could determine that they had a tight relationship with proteins already in the pool. As a result, the protein pool contained proteins that had certain differences in expression levels and proteins that had tight relationships with the aforementioned proteins. In this case, the protein pool in bladder cancer consisted of 2,245 proteins in the early stage and 1,101 proteins in the late stage.
On the strength of the significant pool and PPI information, candidate PPI networks for early and late stage bladder cancer were constructed for bladder cancer and noncancer by linking the proteins that interacted with each other. In other words, the proteins that had PPI information through the pool were linked together, resulting in candidate PPI networks.
As the candidate PPIN included all possible PPIs under various environments, different organisms, and experimental conditions, the candidate PPIN needed to be further confirmed by microarray data to identify appropriate PPIs according to the biological processes that are relevant to cancer. To remove false positive PPIs from each candidate PPIN for different biological conditions, we used both a PPI model and a model order detection method to prune each candidate PPIN using the corresponding microarray data to approach the actual PPIN. Here, the PPIs of a target protein in the candidate PPIN can be depicted by the following protein association model: where represents the expression levels of the target protein for the sample ; represents the expression level of the th protein interacting with the target protein for the sample ; denotes the association interaction ability between the target protein and its th interactive protein; represents the number of proteins interacting with the target protein ; and represents the stochastic noise due to other factors or model uncertainty. The biological meaning of (1) is that the expression levels of the target protein are associated with the expression levels of the proteins interacting with it. Consequently, a protein association (interaction) model for each protein in the protein pool can be built as (1).
After constructing (1) for the PPI model of each protein in the candidate PPIN, we used the maximum likelihood estimation method  to identify the association parameters in (1) by microarray data as follows (see Supplementary Materials S.1 available online at http://dx.doi.org/10.1155/2014/159078): where is identified using microarray data in accordance with the maximum likelihood estimation method (see Supplementary Materials).
Once the association parameters for all proteins in the candidate PPI network were identified for each protein, the significant protein associations were determined using the interaction model order detection method based on the estimated association abilities. The Akaike information criterion (AIC)  and Student’s -test  were employed for both model order selection and significance determination of the protein associations in (see Supplementary Materials S.2).
2.4. Determination of Significant Proteins and Their Network Structures in the Carcinogenesis of Four Types of Cancers
After values were determined using the AIC order detection and Student’s -test, spurious false positive PPIs in (2) were pruned away and only the significant PPIs that remained were refined as follows: where denotes the number of significant PPIs of PPIN, with the target protein . In other words, a number of (or false positives) are pruned in the PPIs of target protein . One protein by one protein (i.e., for all proteins in the refined PPIN in (3)) results in the following refined PPIN: where the interaction matrix denotes the PPIs.
If there is no PPI between proteins and or it is pruned away by AIC order detection due to insignificance in the refined PPIN then . In general, , but if this is not the case, the larger one will be chosen as to avoid the situation where . The above PPIN construction method was employed to construct the refined PPINs for each stage of bladder cancer (early and late) and noncancer cells. The interaction matrices of the refined PPINs in (4) for cancer and noncancer cells of both the early and late stages of bladder cancer were constructed, respectively, as follows: where = early and late stage bladder cancer; and denote the interaction matrices of refined PPIN of the th cancer and noncancer, respectively; is the number of proteins in the refined PPIN. Therefore, the protein association model for CPPIN and NPPIN in the th stage bladder cancer and noncancer can be represented by the following equations according to (4) and (5): where = early and late stage bladder cancer; denote the vectors of expression levels; and and indicate the noise vectors of PPINs in the th cancer and noncancer cells, respectively.
The different matrix of the differential PPI network between CPPIN and NPPIN in the th cancer is defined as follows: where = early and late stage bladder cancer; denotes the protein association ability difference between CPPIN and NPPIN in the th stage bladder cancer; and the matrix indicates the difference in network structure between CPPIN and NPPIN in the th stage bladder cancer. In order to investigate carcinogenesis from the difference matrix between CPPIN and NPPIN of the th stage bladder cancer in (8), a score, which we named the carcinogenesis relevance value (CRV), was presented to quantify the correlation of each protein in with the significance of carcinogenesis as follows : where , and k = early and late stage bladder cancer.
The in (9) quantifies the differential extent of protein associations of the th protein (the absolute sum of the th row of in (8)) and the can differentiate CPPIN from NPPIN in the th stage bladder cancer. In other words, the in (9) could represent the network structure difference of the th protein between the cancer and noncancer networks in the th stage bladder cancer.
In order to investigate what proteins are more likely involved in the th stage bladder cancer, we needed to calculate the corresponding empirical value to determine the statistical significance of . To determine the observed value of each , we repeatedly permuted the network structure of the candidate PPIN of the th stage bladder cancer as a random network of the th stage bladder cancer. Each protein in the random network of the th stage bladder cancer will have its own CRV to generate a distribution of for = early and late stage bladder cancer. Although there was random disarrangement of the network structure, the linkages of each protein were maintained. In other words, the proteins with which a particular protein interacted were permuted without changing the total number of protein interactions. This procedure was repeated 100,000 times and the corresponding value was calculated as the fraction of random network structure in which the is at least as large as the CRV of the real network structure. According to the distributions of the of the random networks, the in (9) with a value of less than or equal to 0.01 was regarded as a significant CRV and the corresponding protein was determined to be a significant protein in the carcinogenesis of the th stage bladder cancer: a protein with a value greater than 0.01 was removed from the list of significant proteins in carcinogenesis (in other words, if the value of was greater than 0.01, then the th protein was removed from the in (9) and the remainder in the with values of CRVs less than 0.01 were considered significant proteins of the th stage bladder cancer).
Based on the value of the CRVs for all proteins () and the two stages of bladder cancer ( = early and late stage bladder cancer), we generated two lists of significant proteins for each of the two stages according to the CRV and the statistical assessment of each significant protein in in (9). We found 152 significant proteins in early stage bladder cancer and 50 significant proteins in late stage bladder cancer. These proteins showed significant changes between the CPPIN and NPPIN in the carcinogenic process according to their corresponding stage of cancer and we suspected that these changes might play important roles in the carcinogenesis process of bladder cancer. These findings warrant further investigation.
The intersections of these significant proteins in the early and late stages of bladder cancer and their PPIs are known as the core network markers appearing in all stages of bladder cancer. In contrast, the unique significant proteins and their PPIs in each stage of bladder cancers are known as the specific network markers for each stage of cancer. We found that there were 18 significant proteins that could be classified as a core network marker in the whole carcinogenesis process of bladder cancer. We also found 134 significant proteins in the specific network marker of early stage bladder cancer and 32 significant proteins in the specific network marker of late stage bladder cancer.
2.5. Pathway Analysis
Much valuable cellular information can be found in the known pathways, which are useful for describing most “normal” biological phenomena. All of these known pathways are the result of repeated testing and verification and the entire pathway network has given definitions for most links. Therefore, the proteins we identified to be significant in the above network markers were mapped onto the known pathway networks (e.g., the KEGG or PANTHER pathway) to investigate significant pathways with the network marker and to explore the relationships between these pathways and the carcinogenesis of bladder cancer. This approach supports the view that systems biology can help identify significant network biomarkers in both normal and cancerous pathways to their roles in the pathogenesis of cancer.
Together with comprehensive pathway databases such as the Kyoto Encyclopedia of Genes and Genomes (KEGG), we used a series of bioinformatics pathway analysis tools to identify biologically relevant pathway networks . KEGG includes manually curated biological pathways that cover three main categories: systems information (e.g., human diseases and drugs), genomics information (e.g., gene catalogs and sequence similarities), and chemical information (e.g., metabolites and biochemical reactions). At present, KEGG contains 134,511 distinct pathways generated from 391 original reference pathways . Therefore, to investigate the pathways involved in carcinogenesis, the bioinformatics database DAVID [27, 28], which generates automatic outputs of the results from KEGG pathway analysis , was used for the pathway analysis of significant proteins identified in network markers to determine their roles in the pathogenesis of early and late stage bladder cancer. Our methodology does not contain the pathway analysis and gene set enrichment analysis. To complete our research results, we used the NOA software to do the pathway analysis and gene set enrichment analysis on biological processes, cellular components, and molecular functions [19, 29].
2.6. The Contribution of Protein Interaction Network Will Affect the Results of Biomarkers and the Evolution of Network Biomarkers
Our cancer PPI model is constructed from the differential expression of cancer and noncancer microarray data and data mining of PPI information from BioGRID database. So, the early and late stage bladder cancer CPPINs (cancer PPI networks) and NPPINs (noncancer PPI networks) are the results of our systems biology model using the original microarray data and PPI databases. There are three key factors that will affect the final results.(i)The effect of different microarray data: we know that the microarray data has the shortage of irreproducible. That means even in the same case the microarray data does not promise to produce the same result as the previous ones. Also, for the same cancers, patients of different ethnics, different age, or different sex will give the different microarray data. This is the first factor to affect the final results.(ii)The effect of different original PPI databases: we know that PPI databases, such as BioGRID and MIPS, are constructed from putative and validated by wet-lab experiments. Due to the advances of many high-throughput experimental skills, the original PPI databases are evolved with time growing. The new updated original PPI databases are the second factor to affect the final results.(iii)The effect of systems biology model: microarray data, PPI databases, and PPI interaction model in (1) are employed to construct the PPI networks of normal and cancer cells by the maximum likelihood parameter estimation method (see Supplementary Material S.1). The AIC system order detection method (Supplementary Materials S.2) is employed to prune the false positive PPIs to obtain the real PPI networks of normal and cancer cells; that is, we use the so-called reverse engineering method to construct PPI networks of normal and cancer cells. Then the differential PPI network between cancer PPI network and normal PPI network is obtained in (8) to investigate PPI variations of each protein in the differential PPI network due to the carcinogenesis. Finally, the carcinogenesis value (CRV) based on PPI variations is also proposed to evaluate the significance of carcinogenesis for each protein of differential PPI network. Proteins with significant CRV ( value < 0.01) are considered as significant proteins of the cancer. The significant proteins in Table 3 are these significant proteins of early and late stage bladder cancers, and these proteins and their PPIs construct the interaction network in Figure 2. Finally, from the early to late stage bladder cancer network markers, we investigate the mechanism of carcinogenesis process with the help of databases (e.g., GO database, DAVID, and KEGG pathway database) and try to find multiple network target therapy of cancer. Unlike the conventional theoretical methods, which always give a single mathematical model for cancer network for a more detailed theoretical analysis, this study is to introduce a systems biology approach to cancer network markers based on real microarray data through the so-called reverse engineering, theoretical statistical method and data mining method in combination with big databases. These are the novelty and significance of our paper. Although we described the novelty of our systems biology model, we have validated our results by literature surveying in the research. In the future, our results will be validated by other researchers’ wet-lab experiments, and we will modify our mathematical model again and again. This is the third key factor to affect the results. Although not directly, it will also have the influence on protein interaction network.
We also know that the biosystems are evolved with time. It is obvious that the early stage and late stage patients have very different symptoms; they are the key features for us to classify early and late stage bladder cancers. Since the two stage bladder cancer patients have great different symptoms, it is undoubted that the microarray data of these two stage patients will show to be quite different. As described above, the protein expression from microarray data is one of the key factors of our systems biology model to give the final CPPINs and NPPINs. And the CPPINs and NPPINs give the final network biomarkers from our systems biology model. So, the most important thing for the network biomarkers evolving is due to the evolution of microarray data at both stages of bladder cancer, which is inherent in the exhibition of cancer-related genes due to DNA mutations in the carcinogenesis process.
3. Results and Discussion
3.1. Time Evolution of the Network Biomarker from Early to Late Stage Bladder Cancer
In the first instance, we built the CPPIN and NPPIN for early and late stage bladder cancer (Figure 2). From the differential networks between CPPIN and NPPIN of early stage and late stage bladder cancer, we then calculated the CRV of each protein in the network structure. Screening in accordance with the value of CRV, we determined the significant proteins of network markers for the two stages of bladder cancer. In the following, we will discuss the significant proteins identified in both stages and their intersection to reveal the carcinogenesis mechanisms from early to late stage bladder cancer.
3.2. Network Marker of Early and Late Stage Bladder Cancer
After value (0.01) screening, we found that there were 152 and 50 significant proteins for early and late stage bladder cancer, respectively. In addition, their corresponding CRV values ranged between 4.1 and 158.5 and 3.4–29.9, respectively. These significant proteins and their PPIs were used to construct the network markers at early and late stage bladder cancer. The intersection network marker of both stages was a core feature that contained 18 significant proteins in carcinogenesis. We listed the 18 significant proteins and their corresponding CRV and value in both stages of bladder cancer (Table 2). From this, we separately identified the 10 most significant proteins in early and late stage bladder cancer (Table 3). The full list of the 152 and 50 significant proteins for the two stages of bladder cancer is detailed in supplementary tables (Tables S1 and S2).
3.3. Pathway Analysis of Early Stage Bladder Cancer
We analyzed the pathway of early stage bladder cancer using the DAVID database. Our initial observation revealed that several cancer pathways were hit by the 152 key proteins, including 11 genes in hsa05200: pathways in cancer (Figure 3(a)), 7 genes involved in prostate cancer, 6 genes involved in chronic myeloid leukemia, 5 genes involved in small cell lung cancer, 4 genes involved in bladder cancer, and 3 genes involved in thyroid cancer, respectively (Table 3). The four genes of hsa05219 involved in bladder cancer (TP53, MDM2, RN1, and MYC) are principal genes altered in urothelial carcinoma, which is highly related to metastatic bladder cancer and are significant targets of metastatic bladder cancer therapies  (Figure 3(b)). Thus, we now note that the 152 candidate proteins are not only related to bladder cancer, but also to other cancers and chronic myeloid leukemia. This would mean that common mechanisms exist between the development of the different cancers in the early stage of carcinogenesis.
(a) The proteins in the early stage bladder cancer network marker are enriched in “hsa05200:Pathways in cancer” (Rank 2 in Table 4)
(b) The proteins in the early stage bladder cancer network marker are enriched in “hsa05219:Bladder cancer” (Rank 7 in Table 4)
(c) The proteins in the early stage bladder cancer network marker are enriched in “hsa04110:Cell cycle” (Rank 1 in Table 4)
(d) The proteins in the early stage bladder cancer network marker are enriched in “hsa04110:Wnt signaling pathway” (Rank 5 in Table 4)
(e) The proteins in the early stage bladder cancer network marker are enriched in “hsa04120:Ubiquitin mediated proteolysis pathway” (Rank 13 in Table 4)
Next, we proceeded to analyze the important pathways related to early stage bladder cancer (Table 4). Firstly, the cell cycle is composed of two consecutive periods (Figure 3(c)) characterized by DNA replication, sequential differentiation, and segregation of replicated chromosomes into two separate daughter cells. Both positive-acting and negative-acting proteins control the cells’ entry and advancement through the cell cycle, which is composed of four distinct phases: G1 (Gap 1), S (synthesis), G2 (Gap 2), and M (mitosis) . The G1 phase, where the cell grows in size, acts as a quality control check to determine whether the cell is ready to divide. The S phase is where the cell copies its DNA. The G2 phase involves cell checking as to whether all of its DNA has been correctly copied. The M phase is the cell division phase where the cell divides in two. Find out more about how cells prepare to divide and then share out their DNA and split in two. There are many reported discussions in regards to the cell cycle regulators and checkpoint functions involved in bladder cancer [32, 33]. Dysregulation of the cell cycle governs deviant cell proliferation in cancer. Losing the ability to control cell cycle checkpoints induces abnormal genetic instability. This may be due to the activation of tumorigenic mutations, which have been recognized in various tumors at different levels in the mitogenic signal transduction pathways: ligands and receptors (receptor mutations of HER2/neu [ErB2] or the amplification of the HER2 gene), downstream signal transduction networks (Raf/Ras/MAPK or PI3K-AKT-mTOR), and regulatory genes of the cell cycle (cyclin D1/CDK4, CDK6, and cyclin E/CDK2) . Increasing evidence convincingly implicates aberrant expression of cell cycle regulators in multiple cancers. Especially the restriction point (R) is the so-called G1 checkpoint. It separates the cell cycle into a mitogen-dependent phase and a growth factor-independent phase from the commitment to enter S phase. The G1 checkpoint commitment process integrates various and complex extracellular and intracellular signal transduction into the cell nucleus. Any malfunction of the G1 checkpoint may result in uncontrolled cell proliferation or genetic instability, possibly the origin of cancer or other diseases development .
The significant pathways via DAVID Bioinformatics database are selected for the 152 significant proteins in carcinogenesis. Black background indicates value > 0.05.|
|: number of genes in reference set.|
: number of genes in test set.
: number of genes annotated by given term in reference set.
: number of genes annotated by given term in test set.
The Wnt/β-catenin signaling pathways (Figure 3(d)) are composed of many functional networks, including a bundle of signaling pathways consisting of various proteins that transduce signals from the outside of a cell through the receptors on the cell surface and into the cell interior. They contribute significantly to the developmental process, particularly to direct cell attachment and proliferation. They are one of the most powerful signaling pathways and play critical roles in human development by controlling the genetic programs of embryonic development and adult homeostasis . Under normal conditions, the Wnt signaling pathway is critical for healthy and normal development, while in adult cells, a dysregulated Wnt signaling pathway can lead to tumorigenesis. For this purpose, cancer cells must have the ability to switch from quiescent mode to proliferation mode, as well as switching between cell proliferation and cell invasion modes. Therefore, the Wnt signaling pathway participates in each of the stages of malignant cancer development and clearly contributes to human tumor progression. Much research has been reported on the relationship between Wnt signaling pathways and urological cancers (including bladder cancer) [37, 38].
Other pathways identified in early stage bladder cancer, such as the Notch signaling pathway, adherens junctions, the TGF-β signaling pathway, ubiquitin-mediated proteolysis (Figures 3(e) and 4(c)), and the p53 signaling pathway are also associated with cancer [39–43].
(a) The proteins in the late stage bladder cancer network marker are enriched in “hsa03010:Ribosome” (Rank 1 in Table 5)
(b) The proteins in the late stage bladder cancer network marker are enriched in “hsa03040:Ribosome” (Rank 2 in Table 5)
(c) The proteins in the early stage bladder cancer network marker are enriched in “hsa04120:Ubiquitin mediated proteolysis pathway” (Rank 3 in Table 5)
The NOA analysis results of the pathway and gene enrichment analysis of the early stage bladder cancer is shown in Table 4(b): (1) Biological processes Cellular components Molecular functions. We saw that most of the biological processes are related to the metabolic processes. Second, about the cellular components, there are three of them related to the ribosome. Finally, about the molecular functions, there are RNA binding, heparin binding and cyclin binding, which are very different from the late stage bladder cancer.
3.4. Pathway Analysis of Late Stage Bladder Cancer
The most important results in this study as compared to our previous work are that we reveal related pathways of late stage bladder cancer in comparison to early stage cancer to reveal the evolution of network biomarkers in the carcinogenesis process. From Table 5, we observed that only three pathways, ribosome, spliceosome, and ubiquitin-mediated proteolysis pathways, were hit by the 50 candidate proteins identified in late stage bladder cancer. This is indicative of the evolution of cancer mechanisms from early stage bladder cancer.
The significant pathways via DAVID Bioinformatics database are selected for the 50 significant proteins in carcinogenesis.|
|: number of genes in reference set.|
: number of genes in test set.
: number of genes annotated by given term in reference set.
: number of genes annotated by given term in test set.
The nucleolus is the site of ribosome biogenesis (Figure 4(a)). Due to the higher concentration of both RNA and proteins in the nucleolus than in the nucleoplasm, the nucleolus is easily detected by microscopy in living cells. From electron microscopy images, three major components were constantly exhibited by mammalian cells. They include fibrillar centers (FCs), which appear as surrounding structures of various sizes, with a very low electron opacity; the dense fibrillar component (DFC), which always constitutes a rim intimately accompanied with the fibrillar centers, composed of densely packed fibrils; and the granular component (GC), which is composed of granules that surround the fibrillar components. There is evidence that changes in nucleolar morphology and function may depend on both the rate and status of ribosome biogenesis and on the proliferative activity of cycling cells . In cancer cells the upregulated ribosome biogenesis leads to an increased demand of ribosomal proteins for rRNA binding. In this way, after ribosome biogenesis alterations, cycling cells can activate the p53 pathway to ensure cell cycle arrest or alternatively to start the apoptotic program . According to our analysis, there were eight significant proteins in the late stage cancer to hit the ribosome pathway.
Alternative splicing is a modification of the premessenger RNA (pre-mRNA) transcript in which internal noncoding regions of pre-mRNA (introns) are removed and then the remaining segments (exons) are joined (Figure 4(b)). The formation of mature messenger RNA (mRNA) is subsequently capped at its 5′ end and polyadenylated at its 3′ end, and transported out of the nucleus to be translated into protein in the cytoplasm. Most genes use alternative splicing to generate multiple spliced transcripts. These transcripts contain various combinations of exons resulting from different mRNA variants and then are synthesized as protein isoforms. The exons are always around 50–250 base pairs, whereas introns could be as long as several thousands of base pairs. For nuclear encoded genes, splicing takes place within the nucleus after or simultaneously with transcription. Splicing is necessary for the eukaryotic messenger RNA (mRNA) before it can be translated into a correct protein. The spliceosome is a dynamic intracellular macromolecular complex of multiple proteins and ribonucleoproteins (snRNPs). For many eukaryotic introns, the spliceosome carries out the two main functions of alternative splicing. First, it recognizes the intron-exon boundaries and second it catalyzes the cut-and-paste reactions that remove introns and concatenate exons. The various spliceosomal machinery complex is formed from 5 ribonucleo-protein (RNP) subunits, termed uridine-rich (U-rich) small nuclear RNP (snRNP), transiently associated with more than 760 non-snRNPs splicing factors (RNA helicases, SR splicing factors, etc.) [46, 47]. Each spliceosomal snRNP (U1, U2, U4, U5, and U6) consists of a uridine-rich small nuclear RNA (snRNA) complexed with a set of seven proteins known as canonical Sm core or SNRP proteins. The seven Sm proteins (B/B′, D1, D2, D3, E, F, and G) form a core ring structure that surrounds the RNA. All Sm proteins contain a conserved sequence motif in two segments (Sm1 and Sm2) that are responsible for the assembly and ordering of the snRNAs. They form the Sm core of the spliceosomal snRNPs  and process the pre-mRNA . Spliceosomes not only catalyze splicing by a series of reactions, but they are also the main cellular machinery that guides splicing. Recently, scientists have found two natural compounds that can interfere with spliceosome function that also display anticancer activity in vitro and in vivo [50, 51]. Therefore, it is believable that inhibiting the spliceosome could act as a new target for anticancer drug development , and it should be validated in vivo or in vitro in the future.
The NOA analysis results of the pathway and gene enrichment analysis of the late stage bladder cancer is shown in Table 5(b): Biological processes Cellular components Molecular functions. We saw most of the biological processes are related to cell cycle, which are different from the metabolic processes of early stage. Second, about the cellular components, there are complex evolution behaviors of the network compared with the early stage bladder cancer; there is only one intersection of these two stages that is ribonucleoprotein complex. It gives us many clues to develop evolutionary strategies for cancer target therapy. Finally, about the molecular functions, there are enzyme binding, protein binding and nucleotide binding, which are very different from the early stage bladder cancer. All the evolutionary behaviors from early to late stage bladder cancer let us reveal more hidden carcinogenesis mechanism.
3.5. Pathway Analysis of Both Early and Late Stage Bladder Cancer
The only pathway to intersect between early and late stage bladder cancer is the ubiquitin-mediated proteolysis pathway (Table 6). This means it is the only housekeeping pathway for bladder cancer and that the mechanisms of early and late stage bladder cancer are completely different. We hypothesize that this may be a novel concept for target therapy. Various other researches have never built a model in accordance with the network markers at the different stages of cancer. Our results show that the network markers of early stage hit common mechanisms and fundamental pathways, such as cell cycle, cell proliferation, and Wnt signaling, among others, which are implicated in various cancers. These provide clues in that early stage bladder cancer is active in many related pathways and we can assume that it is an active process to change the cell. In contrast, in the late stage of bladder cancer, the cells were inactive and close to silence. This may mean that the cells are close to death. Should we attempt to save these cells, we should aim to focus on the ribosome and spliceosome pathways. Of course ubiquitin-mediated proteolysis pathways are both active in early and late stage cancer.
Bladder cancer is among the 10 most common forms of carcinoma in the USA and worldwide. It is a lethal disease like other cancers and understanding the carcinogenesis mechanism can help to develop new therapeutic strategy. Identifying the PPI interface to develop small molecule inhibitors has become a new direction for targeted cancer therapy. This study, which follows from our prior work, analyzes the carcinogenesis mechanism from early to late stage bladder cancer using a network-based biomarker evolution approach. Other research studies do not distinguish network markers between these two stages of bladder cancer. Thus, our approach is advantageous in that it can provide added insight into the significant network marker evolution of the carcinogenesis process of bladder cancer. The network markers and their related pathways identified in early stage bladder cancer are mostly related to ordinary cancer mechanisms, which just show a highly active state of the early stage and cannot reveal additional novel results. All of these results should be validated in vivo or in vitro in the future. However, from the two specific and significant pathways identified in late stage bladder cancer, ribosome pathway and spliceosome pathway, we identified a novel result, which has potential to become a target for cancer therapy. The only core pathway in these two stages is the ubiquitin-mediated proteolysis pathway, which is a significant cue of carcinogenesis from early to late stage bladder cancer. Applying our method to study more cancers and more classification groups (such as stage, age, ethics, and sex) will give us further insight into the various pathogenesis mechanisms.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
The authors are grateful for the support provided by the Ministry of Science and Technology (NSC-102-2745-E-007-001-ASP).
There are two parts of the supplementary materials. The first one: S.1 is the Parameter Identification of Regression Model in Equation (1) by Maximum Likelihood Method; and the second part: S.2 is the Determination of significant protein associations by AIC and Student’s t-test.
- T. N. Seyfried and L. M. Shelton, “Cancer as a metabolic disease,” Nutrition & Metabolism, vol. 7, article 7, 2010.
- D. S. Kaufman, W. U. Shipley, and A. S. Feldman, “Bladder cancer,” The Lancet, vol. 374, no. 9685, pp. 239–249, 2009.
- A. M. Donaldson, J. G. Gonzalez, M. K. B. Parmar, and N. Donaldson, “The MRC superficial bladder cancer trial of intravesical mytomicin-c after complete surgical resection. Sequential statistical methods applied to survival data from a randomised clinical trial,” International Journal of Surgery, vol. 7, no. 5, pp. 441–445, 2009.
- V. H. Cowling and M. D. Cole, “Mechanism of transcriptional activation by the Myc oncoproteins,” Seminars in Cancer Biology, vol. 16, no. 4, pp. 242–252, 2006.
- M. Pacal and R. Bremner, “Insights from animal models on the origins and progression of retinoblastoma,” Current Molecular Medicine, vol. 6, no. 7, pp. 759–781, 2006.
- J. N. Contessa, J. Hampton, G. Lammering et al., “Ionizing radiation activates Erb-B receptor dependent Akt and p70 S6 kinase signaling in carcinoma cells,” Oncogene, vol. 21, no. 25, pp. 4032–4041, 2002.
- A. M. Codegoni, M. I. Nicoletti, G. Buraggi et al., “Molecular characterisation of a panel of human ovarian carcinoma xenografts,” European Journal of Cancer, vol. 34, no. 9, pp. 1432–1438, 1998.
- T. R. Golub, D. K. Slonim, P. Tamayo et al., “Molecular classification of cancer: class discovery and class prediction by gene expression monitoring,” Science, vol. 286, no. 5439, pp. 531–527, 1999.
- H. Han, D. J. Bearss, L. W. Browne, and et al, “Identification of differentially expressed genes in pancreatic cancer cells using cDNA microarray,” Cancer Research, vol. 62, pp. 2890–2896, 2002.
- K.-Q. Liu, Z.-P. Liu, J.-K. Hao, L. Chen, and X.-M. Zhao, “Identifying dysregulated pathways in cancers from pathway interaction networks,” BMC Bioinformatics, vol. 13, no. 1, article 126, 2012.
- H. Uramoto, K. Sugio, T. Oyama et al., “Expression of the p53 family in lung cancer,” Anticancer Research, vol. 26, no. 3, pp. 1785–1790, 2006.
- E. A. Horvat, J. D. Zhang, U. Stefan, O. Sahin, and K. A. .Zweig, “A network-based method to assess the statistical significance of mild co-regulation effects,” PLoS ONE, vol. 8, Article ID e73413, 2013.
- Y.-C. Wang and B.-S. Chen, “A network-based biomarker approach for molecular investigation and diagnosis of lung cancer,” BMC Medical Genomics, vol. 4, article 2, 2011.
- A. A. Ivanov, F. R. Khuri, and H. Fu, “Targeting protein-protein interactions as an anticancer strategy,” Trends in Pharmacological Sciences, vol. 34, no. 7, pp. 393–400, 2013.
- L. Chen, R. Liu, Z.-P. Liu, M. Li, and K. Aihara, “Detecting early-warning signals for sudden deterioration of complex diseases by dynamical network biomarkers,” Scientific Reports, vol. 2, article 342, 2012.
- R. Liu, X. Wang, K. Aihara, and L. Chen, “Early diagnosis of complex diseases by molecular biomarkers, network biomarkers, and dynamical network biomarkers,” Medicinal Research Reviews, vol. 34, pp. 455–478, 2014.
- R. Liu, X. Yu, X. Liu, D. Xu, K. Aihara, and L. Chen, “Identifying critical transitions of complex diseases based on a single sample,” Bioinformatics, vol. 30, no. 11, pp. 1579–1586, 2014.
- C. Zhang, J. Wang, K. Hanspers, D. Xu, L. Chen, and A. R. Pico, “NOA: a cytoscape plugin for network ontology analysis,” Bioinformatics, vol. 29, no. 16, pp. 2066–2067, 2013.
- W.-J. Kim, E.-J. Kim, S.-K. Kim et al., “Predictive value of progression-related gene classifier in primary non-muscle invasive bladder cancer,” Molecular Cancer, vol. 9, article 3, 2010.
- A. Chatr-Aryamontri, B.-J. Breitkreutz, S. Heinicke et al., “The BioGRID interaction database: 2013 update,” Nucleic Acids Research, vol. 41, no. 1, pp. D816–D823, 2013.
- J. M. Bland and D. G. Altman, “Multiple significance tests: the Bonferroni method,” British Medical Journal, vol. 310, article 170, no. 6973, 1995.
- R. Johansson, System Modeling and Identification, Prentice Hall, 1993.
- M. Pagano and K. Gauvreau, Principles of Biostatistics, 2000.
- M. Kanehisa, “Molecular network analysis of diseases and drugs in KEGG,” Methods in Molecular Biology, vol. 939, pp. 263–275, 2013.
- J.-I. Satoh, “Molecular network of microRNA targets in Alzheimer's disease brains,” Experimental Neurology, vol. 235, no. 2, pp. 436–446, 2012.
- D. W. Huang, B. T. Sherman, and R. A. Lempicki, “Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources,” Nature Protocols, vol. 4, no. 1, pp. 44–57, 2009.
- D. W. Huang, B. T. Sherman, and R. A. Lempicki, “Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists,” Nucleic Acids Research, vol. 37, no. 1, pp. 1–13, 2009.
- J. Wang, Q. Huang, Z.-P. Liu et al., “NOA: a novel Network Ontology Analysis method,” Nucleic Acids Research, vol. 39, no. 13, p. e87, 2011.
- M. Fassan, E. J. Trabulsi, L. G. Gomella, and R. Baffa, “Targeted therapies in the management of metastatic bladder cancer,” Biologics, vol. 1, no. 4, pp. 393–406, 2007.
- J. Camacho, Molecular Ontology: Principles and Recent Advances, 2012.
- P. Korkolopoulou, P. Christodoulou, A.-E. Konstantinidou, E. Thomas-Tsagli, P. Kapralos, and P. Davaris, “Cell cycle regulators in bladder cancer: a multivariate survival study with emphasis on p27Kip1,” Human Pathology, vol. 31, no. 6, pp. 751–760, 2000.
- S. C. Doherty, S. R. McKeown, C. Stephen Downes et al., “Cell cycle checkpoint function in bladder cancer,” Journal of the National Cancer Institute, vol. 95, no. 24, pp. 1859–1868, 2003.
- G. H. Williams and K. Stoeber, “The cell cycle and cancer,” Journal of Pathology, vol. 226, no. 2, pp. 352–364, 2012.
- J. Lukas, J. Bartkova, and J. Bartek, “Convergence of mitogenic signalling cascades from diverse classes of receptors at the cyclin D-cyclin-dependent kinase-pRb-controlled G1 checkpoint,” Molecular & Cellular Biology, vol. 16, no. 12, pp. 6917–6925, 1996.
- T. Grigoryan, P. Wend, A. Klaus, and W. Birchmeier, “Deciphering the function of canonical Wnt signals in development and disease: conditional loss- and gain-of-function mutations of β-catenin in mice,” Genes and Development, vol. 22, no. 17, pp. 2308–2341, 2008.
- S. Majid, S. Saini, and R. Dahiya, “Wnt signaling pathways in urological cancers: past decades and still growing,” Molecular Cancer, vol. 11, article 7, 2012.
- I. Ahmad, The role of Wnt signalling in urothelial cell carcinoma [Ph.D. thesis], University of Glasgow, 2011.
- H. Lodish, Molecular Cell Biology, W.H. Freeman, 7th edition, 2013.
- E. J. Allenspach, I. Maillard, J. C. Aster, and W. S. Pear, “Notch signaling in cancer,” Cancer Biology and Therapy, vol. 1, pp. 466–476, 2002.
- V. Bolos, J. Grego-Bessa, and J. L. de la Pompa, “Notch signaling in development and cancer,” Endocrine Reviews, vol. 28, no. 3, pp. 339–363, 2007.
- J. Schneikert and J. Behrens, “The canonical Wnt signalling pathway and its APC partner in colon cancer development,” Gut, vol. 56, no. 3, pp. 417–425, 2007.
- R. Derynck, R. J. Akhurst, and A. Balmain, “TGF-β signaling in tumor suppression and cancer progression (Nature Genetics (2001) 29 (117–129)),” Nature Genetics, vol. 29, no. 3, p. 351, 2001.
- L. Montanaro, D. Treré, and M. Derenzini, “Nucleolus, ribosomes, and cancer,” The American Journal of Pathology, vol. 173, no. 2, pp. 301–310, 2008.
- L. Montanaro, D. Treré, and M. Derenzini, “Changes in ribosome biogenesis may induce cancer by down-regulating the cell tumor suppressor potential,” Biochimica et Biophysica Acta—Reviews on Cancer, vol. 1825, no. 1, pp. 101–110, 2012.
- M. S. Jurica and M. J. Moore, “Pre-mRNA splicing: awash in a sea of proteins,” Molecular Cell, vol. 12, no. 1, pp. 5–14, 2003.
- Z. Zhou, L. J. Licklider, S. P. Gygi, and R. Reed, “Comprehensive proteomic analysis of the human spliceosome,” Nature, vol. 419, no. 6903, pp. 182–185, 2002.
- R. S. Pillai, M. Grimmler, G. Meister et al., “Unique Sm core structure of U7 snRNPs: assembly by a specialized SMN complex and the role of a new component, Lsm11, in histone RNA processing,” Genes and Development, vol. 17, no. 18, pp. 2321–2333, 2003.
- L. Agranat-Tamir, N. Shomron, J. Sperling, and R. Sperling, “Interplay between pre-mRNA splicing and microRNA biogenesis within the supraspliceosome,” Nucleic Acids Research, vol. 42, no. 7, pp. 4640–4651, 2014.
- D. Kaida, H. Motoyoshi, E. Tashiro et al., “Spliceostatin A targets SF3b and inhibits both splicing and nuclear retention of pre-mRNA,” Nature Chemical Biology, vol. 3, no. 9, pp. 576–583, 2007.
- Y. Kotake, K. Sagane, T. Owa et al., “Splicing factor SF3b as a target of the antitumor natural product pladienolide,” Nature Chemical Biology, vol. 3, no. 9, pp. 570–575, 2007.
- R. J. van Alphen, E. A. C. Wiemer, H. Burger, and F. A. L. M. Eskens, “The spliceosome as target for anticancer treatment,” British Journal of Cancer, vol. 100, no. 2, pp. 228–232, 2009.
Copyright © 2014 Yung-Hao Wong et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.