Table of Contents Author Guidelines Submit a Manuscript
BioMed Research International
Volume 2015 (2015), Article ID 621690, 10 pages
http://dx.doi.org/10.1155/2015/621690
Research Article

The Impact of Normalization Methods on RNA-Seq Data Analysis

1Department of Mathematical and Statistical Methods, Poznan University of Life Sciences, 60-637 Poznan, Poland
2Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
3Department of Hematology and Bone Marrow Transplantation, Poznan University of Medical Sciences, 60-569 Poznan, Poland

Received 20 March 2015; Revised 17 May 2015; Accepted 18 May 2015

Academic Editor: Ernesto Picardi

Copyright © 2015 J. Zyprych-Walczak et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. S. Zhao, W.-P. Fung-Leung, A. Bittner, K. Ngo, and X. Liu, “Comparison of RNA-Seq and microarray in transcriptome profiling of activated T cells,” PLoS ONE, vol. 9, no. 1, Article ID e78644, 2014. View at Publisher · View at Google Scholar · View at Scopus
  2. E. T. Wang, R. Sandberg, S. Luo et al., “Alternative isoform regulation in human tissue transcriptomes,” Nature, vol. 456, no. 7221, pp. 470–476, 2008. View at Publisher · View at Google Scholar · View at Scopus
  3. Q. Pan, O. Shai, L. J. Lee, B. J. Frey, and B. J. Blencowe, “Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing,” Nature Genetics, vol. 40, no. 12, pp. 1413–1415, 2008. View at Publisher · View at Google Scholar · View at Scopus
  4. W. M. Landau and P. Liu, “Dispersion estimation and its effect on test performance in RNA-seq data analysis: a simulation-based comparison of methods,” PLoS ONE, vol. 8, no. 12, Article ID e81415, 2013. View at Publisher · View at Google Scholar · View at Scopus
  5. A. Mortazavi, B. A. Williams, K. McCue, L. Schaeffer, and B. Wold, “Mapping and quantifying mammalian transcriptomes by RNA-Seq,” Nature Methods, vol. 5, no. 7, pp. 621–628, 2008. View at Publisher · View at Google Scholar · View at Scopus
  6. A. Oshlack and M. J. Wakefield, “Transcript length bias in RNA-seq data confounds systems biology,” Biology Direct, vol. 4, article 14, 2009. View at Publisher · View at Google Scholar · View at Scopus
  7. J. K. Pickrell, J. C. Marioni, A. A. Pai et al., “Understanding mechanisms underlying human gene expression variation with RNA sequencing,” Nature, vol. 464, no. 7289, pp. 768–772, 2010. View at Publisher · View at Google Scholar · View at Scopus
  8. D. Risso, K. Schwartz, G. Sherlock et al., “GC-content normalization for RNA-seq,” Tech. Rep. 291, Division of Biostatistics, University of California, Berkeley, 2011. View at Google Scholar
  9. J. T. Leek, R. B. Scharpf, H. C. Bravo et al., “Tackling the widespread and critical impact of batch effects in high-throughput data,” Nature Reviews Genetics, vol. 11, no. 10, pp. 733–739, 2010. View at Publisher · View at Google Scholar · View at Scopus
  10. J. H. Bullard, E. Purdom, K. D. Hansen, and S. Dudoit, “Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments,” BMC Bioinformatics, vol. 11, article 94, 2010. View at Publisher · View at Google Scholar · View at Scopus
  11. M. D. Robinson and A. Oshlack, “A scaling normalization method for differential expression analysis of RNA-seq data,” Genome Biology, vol. 11, no. 3, article r25, 2010. View at Publisher · View at Google Scholar · View at Scopus
  12. M. D. Robinson, D. J. McCarthy, and G. K. Smyth, “EdgeR: a bioconductor package for differential expression analysis of digital gene expression data,” Bioinformatics, vol. 26, no. 1, pp. 139–140, 2010. View at Publisher · View at Google Scholar · View at Scopus
  13. S. Anders and W. Huber, “Differential expression analysis for sequence count data,” Genome Biology, vol. 11, no. 10, article R106, 2010. View at Publisher · View at Google Scholar · View at Scopus
  14. N. Leng, J. Dawson, J. Thomson et al., “EBSeq: an empirical bayes hierarchical model for inference in RNA-seq experiments,” Tech. Rep. 226, University of Wisconsin, 2012. View at Google Scholar
  15. J. Li, D. M. Witten, I. M. Johnstone, and R. Tibshirani, “Normalization, testing, and false discovery rate estimation for RNA-sequencing data,” Biostatistics, vol. 13, no. 3, pp. 523–538, 2012. View at Publisher · View at Google Scholar · View at Scopus
  16. Y. W. Asmann, B. M. Necela, K. R. Kalari et al., “Detection of redundant fusion transcripts as biomarkers or disease-specific therapeutic targets in breast cancer,” Cancer Research, vol. 72, no. 8, pp. 1921–1928, 2012. View at Publisher · View at Google Scholar · View at Scopus
  17. V. G. Cheung, R. R. Nayak, I. X. Wang et al., “Polymorphic cis- and Trans-Regulation of human gene expression,” PLoS Biology, vol. 8, no. 9, Article ID e1000480, 2010. View at Publisher · View at Google Scholar · View at Scopus
  18. A. C. Frazee, B. Langmead, and J. T. Leek, “Recount: a multi-experiment resource of analysis-ready RNA-seq gene count datasets,” BMC Bioinformatics, vol. 12, article 449, 2011. View at Publisher · View at Google Scholar · View at Scopus
  19. C. Argyropoulos, A. A. Chatziioannou, G. Nikiforidis, A. Moustakas, G. Kollias, and V. Aidinis, “Operational criteria for selecting a cDNA microarray data normalization algorithm,” Oncology Reports, vol. 15, no. 4, pp. 983–996, 2006. View at Google Scholar · View at Scopus
  20. B. Uszczynska, J. Zyprych-Walczak, L. Handschuh et al., “Analysis of boutique arrays: a universal method for the selection of the optimal data normalization procedure,” International Journal of Molecular Medicine, vol. 32, no. 3, pp. 668–684, 2013. View at Publisher · View at Google Scholar · View at Scopus
  21. D. Chen, Z. Liu, X. Ma, and D. Hua, “Selecting genes by test statistics,” Journal of Biomedicine and Biotechnology, vol. 2005, no. 2, pp. 132–138, 2005. View at Publisher · View at Google Scholar · View at Scopus
  22. R Development Core Team, R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria, 2011.
  23. M. Goswami, N. Hensel, B. D. Smith et al., “Expression of putative targets of immunotherapy in acute myeloid leukemia and healthy tissues,” Leukemia, vol. 28, no. 5, pp. 1167–1170, 2014. View at Publisher · View at Google Scholar · View at Scopus
  24. E. Eisenberg and E. Y. Levanon, “Human housekeeping genes, revisited,” Trends in Genetics, vol. 29, no. 10, pp. 569–574, 2013. View at Publisher · View at Google Scholar · View at Scopus
  25. F. Seyednasrollah, A. Laiho, and L. L. Elo, “Comparison of software packages for detecting differential expression in RNA-seq studies,” Briefings in Bioinformatics, vol. 16, no. 1, pp. 59–70, 2015. View at Publisher · View at Google Scholar
  26. M.-A. Dillies, A. Rau, J. Aubert et al., “A comprehensive evaluation of normalization methods for Illumina high-throughput RNA sequencing data analysis,” Briefings in Bioinformatics, vol. 14, no. 6, pp. 671–683, 2013. View at Publisher · View at Google Scholar · View at Scopus
  27. E. Maza, P. Frasse, P. Senin, M. Bouzayen, and M. Zouine, “Comparison of normalization methods for differential gene expression analysis in RNA-Seq experiments: a matter of relative size of studied transcriptomes,” Communicative & Integrative Biology, vol. 6, no. 6, Article ID e25849, 2013. View at Publisher · View at Google Scholar · View at Scopus