Table of Contents Author Guidelines Submit a Manuscript
BioMed Research International
Volume 2015, Article ID 918954, 10 pages
http://dx.doi.org/10.1155/2015/918954
Research Article

-Profiles: A Nonlinear Clustering Method for Pattern Detection in High Dimensional Data

1Department of Mathematics and Computer Science, Emory University, Atlanta, GA 30322, USA
2School of Software Engineering, Tongji University, Shanghai 200092, China
3The Advanced Institute of Translational Medicine and Department of Gastroenterology, Shanghai Tenth People’s Hospital, Tongji University, Shanghai 200092, China
4Department of Biostatistics and Bioinformatics, Emory University, Atlanta, GA 30322, USA

Received 5 November 2014; Accepted 18 December 2014

Academic Editor: Fang-Xiang Wu

Copyright © 2015 Kai Wang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. J. Rung and A. Brazma, “Reuse of public genome-wide gene expression data,” Nature Reviews Genetics, vol. 14, no. 2, pp. 89–99, 2013. View at Publisher · View at Google Scholar · View at Scopus
  2. G. J. Patti, O. Yanes, and G. Siuzdak, “Innovation: metabolomics: the apogee of the omics trilogy,” Nature Reviews Molecular Cell Biology, vol. 13, no. 4, pp. 263–269, 2012. View at Publisher · View at Google Scholar · View at Scopus
  3. T. Yu, “An exploratory data analysis method to reveal modular latent structures in high-throughput data,” BMC Bioinformatics, vol. 11, article 440, 2010. View at Publisher · View at Google Scholar · View at Scopus
  4. Y. Zhao, J. Kang, and T. Yu, “A Bayesian nonparametric mixture model for selecting genes and gene subnetworks,” The Annals of Applied Statistics, vol. 8, no. 2, pp. 999–1021, 2014. View at Publisher · View at Google Scholar · View at MathSciNet
  5. A. K. H. Tung, X. Xu, and B. C. Ooi, “CURLER: finding and visualizing nonlinear correlation clusters,” in Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 467–478, ACM, New York, NY, USA, June 2005. View at Publisher · View at Google Scholar · View at Scopus
  6. D. Jiang, C. Tang, and A. Zhang, “Cluster analysis for gene expression data: a survey,” IEEE Transactions on Knowledge and Data Engineering, vol. 16, no. 11, pp. 1370–1386, 2004. View at Publisher · View at Google Scholar · View at Scopus
  7. T. Hastie, R. Tibshirani, and J. H. Friedman, The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Springer Series in Statistics, Springer, New York, NY, USA, 2nd edition, 2009. View at MathSciNet
  8. G. Gan, C. Ma, and J. Wu, Data Clustering: Theory, Algorithms, and Applications, ASA-SIAM Series on Statistics and Applied Probability, SIAM American Statistical Association, Philadelphia, Pa, USA, 2007. View at Publisher · View at Google Scholar · View at MathSciNet
  9. K.-C. Li, C.-T. Liu, W. Sun, S. Yuan, and T. Yu, “A system for enhancing genome-wide coexpression dynamics study,” Proceedings of the National Academy of Sciences of the United States of America, vol. 101, no. 44, pp. 15561–15566, 2004. View at Publisher · View at Google Scholar · View at Scopus
  10. H. K. Solvang, O. C. Lingjærde, A. Frigessi, A.-L. Børresen-Dale, and V. N. Kristensen, “Linear and non-linear dependencies between copy number aberrations and mRNA expression reveal distinct molecular pathways in breast cancer,” BMC Bioinformatics, vol. 12, article 197, 2011. View at Publisher · View at Google Scholar · View at Scopus
  11. A. Jian, Z. Zhang, and E. Chang, “Adaptive non-linear clustering in data streams,” in Proceedings of the 15th ACM International Conference on Information and Knowledge Management (CIKM '06), pp. 122–131, Arlington, Va, USA, 2006. View at Publisher · View at Google Scholar
  12. M. Ehler, V. N. Rajapakse, B. R. Zeeberg et al., “Nonlinear gene cluster analysis with labeling for microarray gene expression data in organ development,” BMC Proceedings, vol. 5, supplement 2, article S3, 2011. View at Publisher · View at Google Scholar · View at Scopus
  13. B. Kegl, A. Krzyzak, T. Linder, and K. Zeger, “Learning and design of principal curves,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 22, no. 3, pp. 281–297, 2000. View at Publisher · View at Google Scholar
  14. P. Delicado and M. Smrekar, “Measuring non-linear dependence for two random variables distributed along a curve,” Statistics & Computing, vol. 19, no. 3, pp. 255–269, 2009. View at Publisher · View at Google Scholar · View at Scopus
  15. T. Yu and H. Peng, “Hierarchical clustering of high-throughput expression data based on general dependences,” IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 10, no. 4, pp. 1080–1085, 2013. View at Publisher · View at Google Scholar · View at Scopus
  16. T. Yu, H. Peng, and W. Sun, “Incorporating nonlinear relationships in microarray missing value imputation,” IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 8, no. 3, pp. 723–731, 2011. View at Publisher · View at Google Scholar · View at Scopus
  17. G. Gutin and A. P. Punnen, The Traveling Salesman Problem and Its Variations, Combinatorial Optimization, Kluwer Academic Publishers, Dordrecht, The Netherlands, 2002. View at MathSciNet
  18. P. Langfelder, B. Zhang, and S. Horvath, “Defining clusters from a hierarchical cluster tree: the dynamic tree cut package for R,” Bioinformatics, vol. 24, no. 5, pp. 719–720, 2008. View at Publisher · View at Google Scholar · View at Scopus
  19. D. Applegate, W. Cook, and A. Rohe, “Chained Lin-Kernighan for large traveling salesman problems,” INFORMS Journal on Computing, vol. 15, no. 1, pp. 82–92, 2003. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  20. L. Hubert and P. Arabie, “Comparing partitions,” Journal of Classification, vol. 2, no. 1, pp. 193–218, 1985. View at Publisher · View at Google Scholar · View at Scopus
  21. P. T. Spellman, G. Sherlock, M. Q. Zhang et al., “Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization,” Molecular Biology of the Cell, vol. 9, no. 12, pp. 3273–3297, 1998. View at Publisher · View at Google Scholar · View at Scopus
  22. K.-C. Li, M. Yan, and S. Yuan, “A simple statistical model for depicting the cdc15-synchronized yeast cell-cycle regulated gene expression data,” Statistica Sinica, vol. 12, no. 1, pp. 141–158, 2002. View at Google Scholar · View at MathSciNet · View at Scopus
  23. M. Ashburner, C. A. Ball, J. A. Blake et al., “Gene ontology: tool for the unification of biology. The Gene Ontology Consortium,” Nature Genetics, vol. 25, no. 1, pp. 25–29, 2000. View at Publisher · View at Google Scholar
  24. J. M. Cherry, E. L. Hong, C. Amundsen et al., “Saccharomyces Genome Database: the genomics resource of budding yeast,” Nucleic Acids Research, vol. 40, no. 1, pp. D700–D705, 2012. View at Publisher · View at Google Scholar · View at Scopus
  25. S. Falcon and R. Gentleman, “Using GOstats to test gene lists for GO term association,” Bioinformatics, vol. 23, no. 2, pp. 257–258, 2007. View at Publisher · View at Google Scholar · View at Scopus
  26. T. Yu and Y. Bai, “Improving gene expression data interpretation by finding latent factors that co-regulate gene modules with clinical factors,” BMC Genomics, vol. 12, article 563, 2011. View at Publisher · View at Google Scholar · View at Scopus
  27. T. Yu and Y. Bai, “Capturing changes in gene expression dynamics by gene set differential coordination analysis,” Genomics, vol. 98, no. 6, pp. 469–477, 2011. View at Publisher · View at Google Scholar · View at Scopus