International Journal of Proteomics

International Journal of Proteomics / 2013 / Article

Research Article | Open Access

Volume 2013 |Article ID 760208 |

Holger Husi, Janice B. Barr, Richard J. E. Skipworth, Nathan A. Stephens, Carolyn A. Greig, Henning Wackerhage, Rona Barron, Kenneth C. H. Fearon, James A. Ross, "The Human Urinary Proteome Fingerprint Database UPdb", International Journal of Proteomics, vol. 2013, Article ID 760208, 7 pages, 2013.

The Human Urinary Proteome Fingerprint Database UPdb

Academic Editor: Andrew J. Link
Received03 Jun 2013
Accepted29 Aug 2013
Published09 Oct 2013


The use of human urine as a diagnostic tool has many advantages, such as ease of sample acquisition and noninvasiveness. However, the discovery of novel biomarkers, as well as biomarker patterns, in urine is hindered mainly by a lack of comparable datasets. To fill this gap, we assembled a new urinary fingerprint database. Here, we report the establishment of a human urinary proteomic fingerprint database using urine from 200 individuals analysed by SELDI-TOF (surface enhanced laser desorption ionisation-time of flight) mass spectrometry (MS) on several chip surfaces (SEND, HP50, NP20, Q10, CM10, and IMAC30). The database currently lists 2490 unique peaks/ion species from 1172 nonredundant SELDI analyses in the mass range of 1500 to 150000. All unprocessed mass spectrometric scans are available as “.xml” data files. Additionally, 1384 peaks were included from external studies using CE (capillary electrophoresis)-MS, MALDI (matrix assisted laser desorption/ionisation), and CE-MALDI hybrids. We propose to use this platform as a global resource to share and exchange primary data derived from MS analyses in urinary research.

1. Introduction

Screening of human tissues and biofluids for disease biomarkers is an important task in healthcare and disease prevention but is often hindered by the complexity of the system studied, for example, plasma. A substantially less complex system such as urine, which contains approximately 3000 proteins [1, 2], would be a preferred medium to screen for protein or peptide biomarkers as sampling is both simple and noninvasive, and unrestricted quantities are obtainable. Urine is relatively stable in terms of protein/peptide composition and fragmentation state compared with other body fluids such as serum, where proteolytic degradation by endogenous proteases has been shown to occur during or after sample collection [3]. Several investigations have been published describing the urinary peptidome and proteome [4, 5], including biomarker discoveries for several disease processes [610]. These studies have used methodologies ranging from traditional 2D gel electrophoresis alone [11] or coupled with mass spectrometry (2-DE-MS) [12], immunohistochemistry [13], liquid chromatography mass spectrometry (LC-MS) [14], and surface enhanced laser desorption ionisation-time of flight mass spectrometry (SELDI-TOF-MS) [1517].

In complex disease processes, the identification of biomarkers is key to developing novel therapeutic target molecules. Identification of the most robust urinary biomarkers will be enhanced by collating and correlating data from other published and current studies. Currently there are a number of urinary databases available. The majority consists of lists of identified proteins derived from tryptic digests analysed by LC-MS/MS, such as MAPU [18] and Sys-BodyFluid [19] and does not cover naturally occurring mass-centric molecular entities. More recently, a urinary database, combining chromatographic reverse-phase retention times and m/z values, has been established [20]. The Mosaiques database [21, 22] consists of naturally occurring protein and peptide patterns detected by capillary electrophoresis MS (CE-MS) from more than 3600 individuals, covering mainly an m/z range of 800 to 3000. However, databases that give access to unprocessed data files are not available but would be the most useful resource with which to compare and validate novel datasets.

It is also prudent, especially in urinary proteome research, to remember that any peak in any MS scan profile might be derived from the same molecule (differing only in either its fragmentation or posttranslational modifications). This differentiation might be lost in an MS/MS screen, where proteolytic processing of the samples might alter the original protein/peptide signatures and intensities. Additionally, such fragmentation steps are also time consuming and decrease the sensitivity of the analysis. Other technologies such as ESI (electrospray ionization) methods require off-line fractionation and sample clean-up steps, which can be avoided using LC-MS as a platform. However, the limitation of the inline LC step, usually employing a reverse-phase resin as a solid matrix, narrows the general usability of this method. Alternatives which allow a suitable range of inline fractionation steps using various resins is SELDI, and a novel emerging alternative termed material-enhanced laser desorption/ionization (MELDI) [23, 24], where biomolecules are absorbed onto a solid phase resin and directly used for mass analysis using MALDI.

We chose the high-throughput SELDI-TOF-MS technology as our platform for biomarker pattern screening. The main advantages of the SELDI technology are its ease of use including little or no sample preparation, high reproducibility, high volume throughput in a minimum of time, with proven methodology over time for the numerous diseases studied, whereas MELDI might require further development before it can be generally applied. The main limitations of the SELDI technology lie with the instrumentation where poor resolution on older instruments led to difficult reproducibility and sometimes questionable results. However, we have chosen a more modern technology (see Section 2). A number of reviews list the issues and compares the various MS-based methods in urinary research [2527].

Utilizing data from both our own and published studies, we have established the urinary proteome fingerprint database UPdb, which will be publically available as a repository for SELDI-MS data and as a reference for scientists to probe the urinary proteome for proteins implicated in disease processes.

2. Materials and Methods

2.1. Urine Samples

Urine samples were obtained from 86 cancer patients, 93 noncancer controls, and 21 patients with a previous history of cancer but were diagnosed as cancer-free 6 to 18 months after resectional surgery. Summary participant demographics are shown in Table 1, and full details are provided as part of the database. The cancer sample urines were collected just prior to surgery. One-third of the cancer patients were diagnosed with pancreatic tumours, approximately one-third had oesophageal cancer, approximately one-sixth had malignancies of the oesophagogastric junction (OGJ), and approximately one-sixth suffered from gastric cancer. All procedures were approved by the local research ethics committee. Written informed consent was obtained. The study conformed to the standards set by the Declaration of Helsinki. All urine samples were stored at −40°C.

Control Cancer FollowupTotal OesophagusOGJPancreasGastricPancreas/DuodenumDuodenumSmall bowel

Average age6265686466626474605471
Male 726112145221216911
Female 2125955511261
Total 93862120027132815111


0.1 mL human urine was applied directly to preconditioned SELDI ProteinChip arrays (Bio-Rad Laboratories Inc.) (NP20, H50, SEND, Q10, CM10 and IMAC30), as recommended by the manufacturer, in a ProteinChip bioprocessor and incubated with 0.1 mL binding buffer where appropriate. The chip-spots were washed with 0.2 mL binding buffer three times and air-dried, followed by application of emitter matrix (alpha-cyano-4-hydroxycinnamic acid (CHCA) or sinapinic acid (SPA)). The arrays were read twice, one at low laser settings (focused on 100–50,000 Da m/z) and one at high laser settings (focused on 1000–200,000 Da m/z), on a ProteinChip Enterprise System PCS4000 (BioRad Laboratories Inc.), SELDI-TOF instrument, and spectral data collected over an average of 588 shots per spot using ProteinChip Data Manager software. Files were exported in “.xml” format. All spectra were processed using the expression difference mapping (EDM) wizard in the ProteinChip Data Manager software (BioRad Laboratories Inc.) with a signal-to-noise-ratio cutoff of 5%, 3% valley depth, and a cluster mass window of 0.2% m/z.

3. Results and Discussion

SELDI-MS analysis of human urine samples has been reported to show little intra- and interchip variation, as well as low intraindividual day-to-day variation [19] and has been established as a key emerging technology to discover new biomarkers for a variety of diseases. We chose to establish a repository for urinary SELDI data to be made available for the scientific community in order to enable an open exchange of research findings and data sharing.

We analysed the 200 urine specimens using the SELDI-MS platform on various chip types, ranging from small sized screens of 21 samples on NP20 and HP50 surfaces, medium-sized screens of 63 samples on SEND and Q10 surfaces, and full screens of all 200 samples on CM10 and IMAC30 chip-types (Table 2). The selection of the appropriate chip-surface for a screening purpose depends on many factors, such as peak intensities, distribution, and the number of clearly identifiable ion species (Figure 1). However, under certain conditions a nonoptimal chip type might resolve potential biomarkers and biomarker patterns better than another one. We chose to evaluate all commonly used chip surfaces.

Chip-typeChip specificityNumber of spectra recorded in the low mass range (1500–25000)Number of spectra recorded in the high mass range (20000–150000)Demographic distribution (number of patient samples analysed which are healthy/cancer pre-op/disease-free cancer post-op)Number of peaks above threshold in all samplesNumber of common peaks above threshold in 10% of all samplesNumber of common peaks above threshold in 20% of all samples

SENDReversed phase636320/43/02182514
NP20None (silicon oxide)21217/14/037116670
Q10Anion exchanger636320/43/039312062
CM10Cation exchanger20020093/86/21559202141
IMAC30Metal binding (Cu2+)20020093/86/21587280186

Both CM10 and IMAC30 (Cu2+) gave the best results in terms of signal intensities, peak resolution, and the number of observable peaks. A similar finding has been reported previously using a single urine specimen [16]. Figure 1 shows the SELDI-MS scans of two samples on the six surfaces tested. We also observed that urines from different individuals display a certain degree of heterogeneity, which is easily overcome by increasing the number of analysed samples. Using a 20% threshold for peaks commonly found in any sample, 31.7% of all molecules are present using the IMAC30 (Cu2+) chip-type, 25.2% using CM10, and 23.5% using HP50 surfaces. These low numbers are partially due to the various disease states and are higher by comparing samples from healthy control specimens.

Normalising on total ion count and aligning all spectra from individual chip-types resulted in the catalog of 2490 detected peaks, which are fully listed in the database (Figure 2). The database structure also allows the storage and retrieval of information relating to the MS environment, pre- and subfractionation methods, chromatography setups, studied diseases, and other data. Peak-specific data, such as identified biomarker, statistical information, and, if known, identified proteins, are provided. The database covers the mass range of 1500 to 150000 for SELDI spectra and consists of averaged and median m/z, intensities and measurement specific data. All 1172 spectra (raw data files) are available for download in “.xml” format from the PADB website at

Initial literature data mining led to the identification of 29 additional urinary datasets, which were incorporated into our database (Table 3). These sets are based on several MS platforms, ranging from SELDI and MALDI to CE-MS and CE-MALDI. The median mass of each individual MS technology, based on the identified peaks per technique, shows that both MALDI and CE-MS favor smaller compounds and peptides, whereas SELDI has an advantage in the higher mass range, albeit with a lower resolution of measured peaks. In total, the database covers a mass range of 800 to 200000 m/z or Da since most peaks using these technologies will have a charge of one. Currently, of these 3924 peaks, 39 are associated with identified proteins. This number should continue to rise over time. Additionally, the UPdb database is part of the Proteomic Analysis DataBase (PADB) initiative, and a full integration, as well as development of specific analysis and retrieval tools, is envisaged.

MS platformNumber of peaksMass range m/z Median m/z Disease areaNumber of identified proteinsNumber of external studies

SELDI27041500–19900018330Lupus nephritis, renal allograft nephropathy, cancer, nephritic syndrome, proteinuria, transplant rejection, systemic lupus erythematosus, diabetes, and radiocontrast exposure2716
MALDI451220–1140003212Cancer 63
CE-MS1125803–160002057Diabetes, IgA nephropathy, membranous glomerulonephritis, neonatal ureteropelvic junction obstruction, renal damage, renal disease, transplant rejection, and cancer69
CE-MALDI50890–61902000Rejection, sepsis, and transplant rejection01

4. Conclusions

UPdb is accessible and downloadable through the PADB initiative at http://www. This platform should be used as a global resource to share and exchange primary data derived from SELDI-, MALDI-, MELDI-, CE-, LC-, and other TOF-MS analyses in urinary research. We encourage other laboratories to contribute to UPdb by submitting high quality MS spectra from human urine samples. We envisage providing full linkage of the identified m/z species to the large-scale screening resource (LSSR) database (in preparation), which will list molecules identified by MS or other large-scale proteomic methods by their protein or gene names and will also contain a substantial database of identified peptide sequences relating to the proteins listed.


2-DE: 2 Dimensional electrophoresis
CE: Capillary electrophoresis
ESI: Electrospray ionisation
LC: Liquid chromatography
MALDI: Matrix assisted laser desorption/ionisation
MELDI: Material-enhanced laser desorption/ionization
MS: Mass spectroscopy
SELDI: Surface enhanced laser desorption ionisation
TOF: Time of flight.

Conflict of Interests

All authors declare that they have no competing interests.


  1. J. Adachi, C. Kumar, Y. Zhang, J. V. Olsen, and M. Mann, “The human urinary proteome contains more than 1500 proteins, including a large proportion of membrane proteins,” Genome Biology, vol. 7, no. 9, pp. R80.1–R80.16, 2006. View at: Publisher Site | Google Scholar
  2. H. Husi, N. A. Stephens, A. Cronshaw et al., “Proteomic analysis of urinary upper gastrointestinal cancer markers,” Proteomics, vol. 5, no. 5-6, pp. 289–299, 2011. View at: Publisher Site | Google Scholar
  3. D. M. Good, V. Thongboonkerd, J. Novak et al., “Body fluid proteomics for biomarker discovery: lessons from the past hold the key to success in the future,” Journal of Proteome Research, vol. 6, no. 12, pp. 4549–4555, 2007. View at: Publisher Site | Google Scholar
  4. T. Pisitkun, R.-F. Shen, and M. A. Knepper, “Identification and proteomic profiling of exosomes in human urine,” Proceedings of the National Academy of Sciences of the United States of America, vol. 101, no. 36, pp. 13368–13373, 2004. View at: Publisher Site | Google Scholar
  5. V. Thongboonkerd, K. R. McLeish, J. M. Arthur, and J. B. Klein, “Proteomic analysis of normal human urinary proteins isolated by acetone precipitation or ultracentrifugation,” Kidney International, vol. 62, no. 4, pp. 1461–1469, 2002. View at: Publisher Site | Google Scholar
  6. J. Wu, N. Wang, J. Wang et al., “Identification of a uromodulin fragment for diagnosis of IgA nephropathy,” Rapid Communications in Mass Spectrometry, vol. 24, no. 14, pp. 1971–1978, 2010. View at: Publisher Site | Google Scholar
  7. S. Schaub, D. Rush, J. Wilkins et al., “Proteomic-based detection of urine proteins associated with acute renal allograft rejection,” Journal of the American Society of Nephrology, vol. 15, no. 1, pp. 219–227, 2004. View at: Publisher Site | Google Scholar
  8. Z. K. Shihabi, J. C. Konen, and M. L. O'Connor, “Albuminuria vs urinary total protein for detecting chronic renal disorders,” Clinical Chemistry, vol. 37, no. 5, pp. 621–624, 1991. View at: Google Scholar
  9. J. S. Yudkin, R. D. Forrest, and C. A. Jackson, “Microalbuminuria as predictor of vascular disease in non-diabetic subjects. Islington Diabetes Survey,” The Lancet, vol. 2, no. 8610, pp. 530–533, 1988. View at: Google Scholar
  10. H. H.-Y. Ngai, W.-H. Sit, P.-P. Jiang, R.-J. Xu, J. M.-F. Wan, and V. Thongboonkerd, “Serial changes in urinary proteome profile of membranous nephropathy: implications for pathophysiology and biomarker discovery,” Journal of Proteome Research, vol. 5, no. 11, pp. 3038–3047, 2006. View at: Publisher Site | Google Scholar
  11. T. Marshall and K. Williams, “Two-dimensional electrophoresis of human urinary proteins following concentration by dye precipitation,” Electrophoresis, vol. 17, no. 7, pp. 1265–1272, 1996. View at: Publisher Site | Google Scholar
  12. R. Pieper, C. L. Gatlin, A. M. McGrath et al., “Characterization of the human urinary proteome: a method for high-resolution display of urinary proteins on two-dimensional electrophoresis gels with a yield of nearly 1400 distinct protein spots,” Proteomics, vol. 4, no. 4, pp. 1159–1174, 2004. View at: Publisher Site | Google Scholar
  13. M. R. Bueler, F. Wiederkehr, and D. J. Vonderschmitt, “Electrophoretic, chromatographic and immunological studies of human urinary proteins,” Electrophoresis, vol. 16, no. 1, pp. 124–134, 1995. View at: Publisher Site | Google Scholar
  14. C. Spahr, M. Davis, M. D. McGinley et al., “Towards defining the urinary proteome using liquid chromatography-tandem mass spectrometry I. Profiling an unfractionated tryptic digest,” Proteomics, vol. 1, no. 1, pp. 93–107, 2001. View at: Google Scholar
  15. P. A. Cadieux, D. T. Beiko, J. D. Watterson et al., “Surface-Enhanced Laser Desorption/Ionization-Time of Flight-Mass Spectrometry (SELDI-TOF-MS): a new proteomic urinary test for patients with urolithiasis,” Journal of Clinical Laboratory Analysis, vol. 18, no. 3, pp. 170–175, 2004. View at: Publisher Site | Google Scholar
  16. H. Roelofsen, G. Alvarez-Llamas, M. Schepers, K. Landman, and R. J. Vonk, “Proteomics profiling of urine with surface enhanced laser desorption/ionization time of flight mass spectrometry,” Proteome Science, vol. 5, article 2, 2007. View at: Publisher Site | Google Scholar
  17. K. J. A. Vanhoutte, C. Laarakkers, E. Marchiori et al., “Biomarker discovery with SELDI-TOF MS in human urine associated with early renal injury: evaluation with computational analytical tools,” Nephrology Dialysis Transplantation, vol. 22, no. 10, pp. 2932–2943, 2007. View at: Publisher Site | Google Scholar
  18. Y. Zhang, Y. Zhang, J. Adachi et al., “MAPU: max-planck unified database of organellar, cellular, tissue and body fluid proteomes,” Nucleic Acids Research, vol. 35, no. 1, pp. D771–D779, 2007. View at: Publisher Site | Google Scholar
  19. S.-J. Li, M. Peng, H. Li et al., “Sys-BodyFluid: a systematical database for human body fluid proteome research,” Nucleic Acids Research, vol. 37, no. 1, pp. D907–D912, 2009. View at: Publisher Site | Google Scholar
  20. I. A. Agron, D. M. Avtonomov, A. S. Kononikhin, I. A. Popov, S. A. Moshkovskii, and E. N. Nikolaev, “Accurate mass tag retention time database for urine proteome analysis by chromatography-mass spectrometry,” Biochemistry, vol. 75, no. 5, pp. 636–641, 2010. View at: Publisher Site | Google Scholar
  21. J. J. Coon, P. Zürbig, M. Dakna et al., “CE-MS analysis of the human urinary proteome for biomarker discovery and disease diagnostics,” Proteomics, vol. 2, no. 7-8, pp. 964–973, 2008. View at: Publisher Site | Google Scholar
  22. D. M. Good, P. Zürbig, À. Argilés et al., “Naturally occurring human urinary peptides for use in diagnosis of chronic kidney disease,” Molecular and Cellular Proteomics, vol. 9, no. 11, pp. 2424–2437, 2010. View at: Publisher Site | Google Scholar
  23. R. Bakry, M. Rainer, C. W. Huck, and G. K. Bonn, “Protein profiling for cancer biomarker discovery using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry and infrared imaging: a review,” Analytica Chimica Acta, vol. 690, no. 1, pp. 26–34, 2011. View at: Publisher Site | Google Scholar
  24. M. Rainer, C. Sajdik, and G. K. Bonn, “Mass spectrometric profiling of low-molecular-weight proteins,” Methods in Molecular Biology, vol. 1023, pp. 83–95, 2013. View at: Google Scholar
  25. L. Chen, S. Fatima, J. Peng, and X. Leng, “SELDI protein chip technology for the detection of serum biomarkers for liver disease,” Protein and Peptide Letters, vol. 16, no. 5, pp. 467–472, 2009. View at: Publisher Site | Google Scholar
  26. T. Gemoll, U. J. Roblick, G. Auer, H. Jörnvall, and J. K. Habermann, “SELDI-TOF serum proteomics and colorectal cancer: a current overview,” Archives of Physiology and Biochemistry, vol. 116, no. 4-5, pp. 188–196, 2010. View at: Publisher Site | Google Scholar
  27. A. K. Callesen, O. Mogensen, A. K. Jensen et al., “Reproducibility of mass spectrometry based protein profiles for diagnosis of ovarian cancer across clinical studies: a systematic review,” Journal of Proteomics, vol. 75, no. 10, pp. 2758–2772, 2012. View at: Publisher Site | Google Scholar

Copyright © 2013 Holger Husi et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

More related articles

 PDF Download Citation Citation
 Download other formatsMore
 Order printed copiesOrder

Related articles

Article of the Year Award: Outstanding research contributions of 2020, as selected by our Chief Editors. Read the winning articles.