Finding Groups in Gene Expression Data

Hand, David J.; Heard, Nicholas A.

doi:https://doi.org/10.1155/JBB.2005.215

BioMed Research International

On this page

Abstract Copyright Related Articles

Special Issue

Data Mining in Genomics and Proteomics

View this Special Issue

Review article | Open Access

Volume 2005 | Article ID 638324 | https://doi.org/10.1155/JBB.2005.215

Finding Groups in Gene Expression Data

David J. Hand¹and Nicholas A. Heard¹

Received11 Jun 2004

Revised24 Aug 2004

Accepted24 Aug 2004

Abstract

The vast potential of the genomic insight offered by microarray technologies has led to their widespread use since they were introduced a decade ago. Application areas include gene function discovery, disease diagnosis, and inferring regulatory networks. Microarray experiments enable large-scale, high-throughput investigations of gene activity and have thus provided the data analyst with a distinctive, high-dimensional field of study. Many questions in this field relate to finding subgroups of data profiles which are very similar. A popular type of exploratory tool for finding subgroups is cluster analysis, and many different flavors of algorithms have been used and indeed tailored for microarray data. Cluster analysis, however, implies a partitioning of the entire data set, and this does not always match the objective. Sometimes pattern discovery or bump hunting tools are more appropriate. This paper reviews these various tools for finding interesting subgroups.

Copyright

Copyright © 2005 Hindawi Publishing Corporation. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation Order printed copies

Views

297

Downloads

948

Citations