Table of Contents Author Guidelines Submit a Manuscript
International Journal of Genomics
Volume 2017, Article ID 6202567, 14 pages
Research Article

Common Expression Quantitative Trait Loci Shared by Histone Genes

Department of Bioinformatics and Life Science, Soongsil University, Seoul, Republic of Korea

Correspondence should be addressed to Chaeyoung Lee;

Received 29 May 2017; Revised 26 July 2017; Accepted 2 August 2017; Published 27 August 2017

Academic Editor: Marco Gerdol

Copyright © 2017 Hanseol Kim et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.


A genome-wide association study (GWAS) was conducted to examine expression quantitative trait loci (eQTLs) for histone genes. We examined common eQTLs for multiple histone genes in 373 European lymphoblastoid cell lines (LCLs). A linear regression model was employed to identify single-nucleotide polymorphisms (SNPs) associated with expression of the histone genes, and the number of eQTLs was determined by linkage disequilibrium analysis. Additional associations of the identified eQTLs with other genes were also examined. We identified 31 eQTLs for 29 histone genes through genome-wide analysis using 29 histone genes (). Among them, 12 eQTLs were associated with the expression of multiple histone genes. Transcriptome-wide association analysis using the identified eQTLs showed their associations with additional 80 genes (). In particular, expression of RPPH1, SCARNA2, and SCARNA7 genes was associated with 26, 25, and 23 eQTLs, respectively. This study suggests that histone genes shared 12 common eQTLs that might regulate cell cycle-dependent transcription of histone and other genes. Further investigations are needed to elucidate the transcriptional mechanisms of these genes.

1. Introduction

Histone mRNA transcripts and proteins are important for packing DNA into chromatin and are thus tightly regulated in most human cells [1]. In humans, the genes encoding histones are gathered on chromosomes 1 and 6. It has been suspected that the clustered structure of genes can provide a manageable unit for coordinating transcription [1]. Recently, genome-wide chromatin interaction analysis with paired-end-tag sequencing (ChIA-PET) has shown that some histone genes can share promoters [2].

While many efforts have been made to understand the mechanisms for the transcription of histone genes, they have not yet been well defined. Nuclear protein of the ataxia-telangiectasia-mutated locus (NPAT), which promotes the transcription of histone genes, is located near the Cajal body [1]. Clusters of histone genes are also located near the Cajal body [3]. The positions of histone gene clusters near the Cajal body have been observed between the restriction point (R-point) and the G1/S transition (S-point) during the cell cycle [4]. The objective of this study was to select simultaneously expressed histone genes, identify their expression quantitative trait loci (eQTLs), and examine the functions of those eQTLs.

2. Material and Methods

2.1. Subjects and Data

The subjects of this study were 373 Europeans including 95 Finnish in Finland, 94 British in England and Scotland, 93 Tuscans from Italy, and 91 Utahn residents with Northern and Western European ancestry from the CEPH collection. Their genotypic data were derived from the phase 1 dataset produced by the 1000 Genomes Project [5] ( This study utilized genotypic data at 5,796,145 SNPs after filtering out the SNPs with minor allele frequency < 0.05, with missing rate > 0.05, or in Hardy-Weinberg disequilibrium with .

Transcriptional data on 10,518 human genes were obtained in lymphoblastoid cells of the subjects by the Geuvadis RNA sequencing project ( The unit used for the mRNA expression level was reads per kilobase per million mapped reads (RPKM). Outliers were removed based on sample similarity, which was estimated by the Spearman rank correlation between RPKMs and the exon counts of the samples [6]. Sample swaps or contaminated samples were excluded based on allele-specific expression analysis [6]. For details on the quality control process, see t Hoen et al. [7].

2.2. Statistical Methods

We selected histone genes that were expressed simultaneously. Pairwise gene expression relationships were estimated using Pearson’s correlation coefficient (r). The significance of the correlation was determined by .

We investigated genome-wide associations of the expression of the selected histone genes. A regression model was employed to identify SNPs associated with expressions of histone genes using PLINK [8]. The Bonferroni correction was applied as a multiple testing, and the significance was determined by .

Linkage disequilibrium (LD) between the identified SNPs was estimated using the HaploView program [9]. The LD block was determined according to the 95% confidence interval of the value for pairwise LD between the nucleotide variants with minor allele frequency > 0.05 [10].

The identified eQTLs were further analyzed for their associations with the expression of nonhistone genes throughout the genome. The Bonferroni multiple testing based on t-statistic was also applied with a significance threshold value of .

The functions of identified SNPs were examined using the Ensembl Variant Effect Predictor program [11] and RegulomeDB [12] (e.g., the motif of DNA footprinting assay, chromatin structure by DNA-seq, and protein binding by ChIP-seq).

3. Results

We observed numerous correlations amid the expression of the histone genes investigated in the current study (Figure 1). In particular, the expression of 29 genes showed correlations significantly (). Genome-wide association analysis showed that 74 SNPs were associated with the expression of the 29 histone genes (; Table 1). Among them, 26 SNPs were simultaneously associated with the expression of multiple histone genes, and 5 out of 26 SNPs were associated with the expression of more than 10 histone genes (Figure 2). Thirty-one LD blocks were constructed covering the identified SNPs (Figure 3). The eQTLs corresponded to functional sites provided by various functional search sites. The rs79335804 had the most probable function with a RegulomeDB score of 2b (Table S available online at

Figure 1: Pearson’s correlation (r) between expressions of histone genes using the ellipse visualization method. The correlation estimate without significance () is marked with “X.”
Table 1: Genome-wide associations of SNPs with expression of histone genes.
Figure 2: Number of histone genes with expression associated with SNPs. The figure in parentheses following the SNPs indicates the chromosomal number.
Figure 3: Linkage disequilibrium (LD) blocks of SNPs associated with the expression of histone genes. (a) 4 LD blocks in chromosome 2 (48891009-48924921, 141415941-141454893, 141710041-141714131, and 205737518-205744540), (b) 1 LD block in chromosome 5 (14080602-14101222), and (c) 1 LD block in chromosome 11 (1062030-1064929). No LD blocks were observed in chromosomes 6 and 7.

Transcriptome-wide association analysis revealed 80 additional genes associated with the 31 identified eQTLs (; Table 2). The genes encoding ribonuclease P RNA component H1 (RPPH1) and some small Cajal body-specific RNAs (scaRNAs), in particular, were associated with more than half of the eQTLs (>15 eQTLs; Table 2).

Table 2: Associations of the identified eQTLs with expression of histone and nonhistone genes all over the human genome.

4. Discussion

We analyzed the eQTLs for simultaneously expressed histone genes. We found significant correlations amid the expression of 29 histone genes, which were all clustered in chromosome 1 or 6. This clustered structure of the genes may serve to control simultaneous transcription, and this is supported by the observation that the expression of other histone genes not located on chromosome 1 or 6, including H1FX and H2A family members, was not correlated with those of the 29 selected genes. Furthermore, correlation estimates showed two subgroups nested within the large group (one with 21 genes and the other with 10 genes; with strong correlation coefficients of ), which likely provide a manageable unit for coordinating transcription.

The genome-wide eQTL analysis revealed that 12 loci were associated with the expression of multiple histone genes. The eQTLs were located on chromosomes 2, 7, and 11. Since 29 histone genes were all located on chromosome 1 or 6, we suspect that the identified eQTLs were transacting. This suggests that many histone genes are simultaneously transcribed by remote regulators.

Functional analysis of the identified eQTLs suggests that they are very important for transcription. For example, rs79335804, an SNP within an eQTL on chromosome 2, was the binding motif for Kruppel-like factor 4 (Klf4) protein in various cells including LCLs. Klf4 was associated with chromosomal aberrations and can prevent cell proliferation by acting as a transcription factor [13]. The aberrant chromatin formation could be caused by overproduction of a histone dimer set (H2A-H2B or H3-H4) [14]. Thus, we suspect that there is an association between the chromosomal aberrations by Klf4 and histone gene mRNA expression. Rs849578 within another eQTL on chromosome 2 was associated with autism in the Chinese Han population [15]. It is located in an intron of neuropilin 2 (NRP2) which may be an effector of apoptosis, proliferation, and neuronal development [16]. Histones are known to be related to developmental regulation [17], but additional study is required to elucidate underlying mechanism of the relationship between histones and NRP2.

Transcriptome-wide association analysis revealed that many nonhistone genes were also associated with the identified eQTLs. In particular, some genes were associated with 23 or more eQTLs. One was RPPH1, an RNA component of RNase P, which may assist in the cell cycle-dependent transcription of ribosomal RNAs (rRNAs) by associating with chromatin [18]. The expression of rRNAs increased from G1 to S and peaked at G2 [18]. The transcription of histone genes rapidly increased before the S phase of the cell cycle and decreased shortly thereafter [1]. Thus, many eQLTs identified in this study might be involved in the cell cycle-dependent expression of both RPPH1 and histone genes. Such a regulation of the eQTLs would be one of the key factors to solve their underlying mechanisms. The others identified with many eQTLs were the genes encoding scaRNAs (SCARNA2 and SCARNA7) located in the Cajal body, similar to the pre-mRNAs of histones, which move to the Cajal body for mRNA processing [3, 19].

Interestingly, many genes controlled by the same eQTLs as those for histones do not have polyadenylated structures [20]. In particular, the genes associated with more than 10 eQTLs were all nonpolyadenylated. They were snoRNAs, scaRNAs, and RPPH1. Considering that histones are also nonpolyadenylated, this may help us to understand the transcriptional regulation of histone genes by these eQTLs.

The expressions of histone genes play an important role in controlling chromatin accessibility [21]. Improper expression of histone genes has been associated with tumorigenesis [2224]. Expression of NPAT, a transcriptional activator for histone genes, is also associated with human tumorigenesis [25]. The influence of histone genes on tumor developments might be supported by the eQTLs identified in the current study, because some of the eQTLs were located within anticipated tumor suppressor genes such as low-density lipoprotein receptor-related protein (LRP1B) and utrophin (UTRN) [26, 27].

In conclusion, we identified 31 eQTLs for histone genes. The eQTLs were also associated with nonhistone genes that exhibited both a cell cycle-dependent expression and a nonpolyadenylated RNA structure. Further investigations are required to understand the mechanisms regulating the transcription of the histone and nonhistone genes identified in this study and to appreciate their influence on cancer and other diseases. Moreover, identification of eQTLs using disease-specific cell types would provide resolute mechanisms by diseases.

Conflicts of Interest

The authors declare that they have no competing interests.


This work was funded by the National Research Foundation of Korea, the Ministry of Education, Science, and Technology (Grant no. NRF-2012M3A9D1054705).


  1. W. F. Marzluff and R. J. Duronio, “Histone mRNA expression: multiple levels of cell cycle regulation and important developmental consequences,” Current Opinion in Cell Biology, vol. 14, no. 6, pp. 692–699, 2002. View at Publisher · View at Google Scholar · View at Scopus
  2. G. Li, X. Ruan, R. K. Auerbach et al., “Extensive promoter-centered chromatin interactions provide a topological basis for transcription regulation,” Cell, vol. 148, no. 1-2, pp. 84–98, 2012. View at Publisher · View at Google Scholar · View at Scopus
  3. L. S. Shopland, M. Byron, J. L. Stein, J. B. Lian, G. S. Stein, and J. B. Lawrence, “Replication-dependent histone gene expression is related to Cajal body (CB) association but does not require sustained CB contact,” Molecular Biology of the Cell, vol. 12, no. 3, pp. 565–576, 2001. View at Google Scholar
  4. G. S. Stein, A. J. van Wijnen, J. L. Stein et al., “An architectural perspective of cell-cycle control at the G1/S phase cell-cycle transition,” Journal of Cellular Physiology, vol. 209, no. 3, pp. 706–710, 2006. View at Publisher · View at Google Scholar · View at Scopus
  5. Genomes Project Consortium, G. R. Abecasis, A. Auton et al., “An integrated map of genetic variation from 1,092 human genomes,” Nature, vol. 491, no. 7422, pp. 56–65, 2012. View at Publisher · View at Google Scholar · View at Scopus
  6. T. Lappalainen, M. Sammeth, M. R. Friedlander et al., “Transcriptome and genome sequencing uncovers functional variation in humans,” Nature, vol. 501, no. 7468, pp. 506–511, 2013. View at Publisher · View at Google Scholar · View at Scopus
  7. P. A. t Hoen, M. R. Friedlander, J. Almlof et al., “Reproducibility of high-throughput mRNA and small RNA sequencing across laboratories,” Nature Biotechnology, vol. 31, no. 11, pp. 1015–1022, 2013. View at Publisher · View at Google Scholar · View at Scopus
  8. S. Purcell, B. Neale, K. Todd-Brown et al., “Plink: a tool set for whole-genome association and population-based linkage analyses,” American Journal of Human Genetics, vol. 81, no. 3, pp. 559–575, 2007. View at Publisher · View at Google Scholar · View at Scopus
  9. J. C. Barrett, B. Fry, J. Maller, and M. J. Daly, “Haploview: analysis and visualization of LD and haplotype maps,” Bioinformatics, vol. 21, no. 2, pp. 263–265, 2005. View at Publisher · View at Google Scholar · View at Scopus
  10. S. B. Gabriel, S. F. Schaffner, H. Nguyen et al., “The structure of haplotype blocks in the human genome,” Science, vol. 296, no. 5576, pp. 2225–2229, 2002. View at Publisher · View at Google Scholar · View at Scopus
  11. W. McLaren, B. Pritchard, D. Rios, Y. Chen, P. Flicek, and F. Cunningham, “Deriving the consequences of genomic variants with the Ensembl API and SNP Effect Predictor,” Bioinformatics, vol. 26, no. 16, pp. 2069-2070, 2010. View at Publisher · View at Google Scholar · View at Scopus
  12. A. P. Boyle, E. L. Hong, M. Hariharan et al., “Annotation of functional variation in personal genomes using RegulomeDB,” Genome Research, vol. 22, no. 9, pp. 1790–1797, 2012. View at Publisher · View at Google Scholar · View at Scopus
  13. E. A. El-Karim, E. G. Hagos, A. M. Ghaleb, B. Yu, and V. W. Yang, “Kruppel-like factor 4 regulates genetic stability in mouse embryonic fibroblasts,” Molecular Cancer, vol. 12, p. 89, 2013. View at Publisher · View at Google Scholar · View at Scopus
  14. D. Meeks-Wagner and L. H. Hartwell, “Normal stoichiometry of histone dimer sets is necessary for high fidelity of mitotic chromosome transmission,” Cell, vol. 44, no. 1, pp. 43–52, 1986. View at Publisher · View at Google Scholar · View at Scopus
  15. S. Wu, W. Yue, M. Jia et al., “Association of the neuropilin-2 (NRP2) gene polymorphisms with autism in Chinese Han population,” American Journal of Medical Genetics Part B: Neuropsychiatric Genetics, vol. 144B, no. 4, pp. 492–495, 2007. View at Publisher · View at Google Scholar · View at Scopus
  16. C. Chen, M. Li, H. Chai, H. Yang, W. E. Fisher, and Q. Yao, “Roles of neuropilins in neuronal development, angiogenesis, and cancers,” World Journal of Surgery, vol. 29, no. 3, pp. 271–275, 2005. View at Publisher · View at Google Scholar · View at Scopus
  17. R. W. Old and H. R. Woodland, “Histone genes: not so simple after all,” Cell, vol. 38, no. 3, pp. 624–626, 1984. View at Publisher · View at Google Scholar
  18. R. Reiner, N. Krasnov-Yoeli, Y. Dehtiar, and N. Jarrous, “Function and assembly of a chromatin-associated RNase P that is required for efficient transcription by RNA polymerase I,” PLoS One, vol. 3, no. 12, article e4072, 2008. View at Publisher · View at Google Scholar · View at Scopus
  19. X. Darzacq, B. E. Jady, C. Verheggen, A. M. Kiss, E. Bertrand, and T. Kiss, “Cajal body-specific small nuclear RNAs: a novel class of 2-O-methylation and pseudouridylation guide RNAs,” The EMBO Journal, vol. 21, no. 11, pp. 2746–2756, 2002. View at Publisher · View at Google Scholar · View at Scopus
  20. L. Yang, M. O. Duff, B. R. Graveley, G. G. Carmichael, and L. L. Chen, “Genomewide characterization of non-polyadenylated RNAs,” Genome Biology, vol. 12, no. 2, article R16, 2011. View at Publisher · View at Google Scholar · View at Scopus
  21. S. Misri, S. Pandita, R. Kumar, and T. K. Pandita, “Telomeres, histone code, and DNA damage response,” Cytogenetic and Genome Research, vol. 122, no. 3-4, pp. 297–307, 2008. View at Publisher · View at Google Scholar · View at Scopus
  22. K. Yoshioka, Y. Atsumi, H. Nakagama, and H. Teraoka, “Development of cancer-initiating cells and immortalized cells with genomic instability,” World Journal of Stem Cells, vol. 7, no. 2, pp. 483–489, 2015. View at Publisher · View at Google Scholar
  23. M. Piscopo, G. Campisi, G. Colella et al., “H3 and H3.3 histone mRNA amounts and ratio in oral squamous cell carcinoma and leukoplakia,” Oral Diseases, vol. 12, no. 2, pp. 130–136, 2006. View at Publisher · View at Google Scholar · View at Scopus
  24. T. Iwaya, T. Fukagawa, Y. Suzuki et al., “Contrasting expression patterns of histone mRNA and microRNA 760 in patients with gastric cancer,” Clinical Cancer Research, vol. 19, no. 23, pp. 6438–6449, 2013. View at Publisher · View at Google Scholar · View at Scopus
  25. G. Gao, A. P. Bracken, K. Burkard et al., “NPAT expression is regulated by E2F and is essential for cell cycle progression,” Molecular and Cellular Biology, vol. 23, no. 8, pp. 2821–2833, 2003. View at Publisher · View at Google Scholar · View at Scopus
  26. C. X. Liu, Y. Li, L. M. Obermoeller-McCormick, A. L. Schwartz, and G. Bu, “The putative tumor suppressor LRP1B, a novel member of the low density lipoprotein (LDL) receptor family, exhibits both overlapping and distinct properties with the LDL receptor-related protein,” The Journal of Biological Chemistry, vol. 276, no. 31, pp. 28889–28896, 2001. View at Publisher · View at Google Scholar · View at Scopus
  27. Y. Li, J. Huang, Y. L. Zhao et al., “UTRN on chromosome 6q24 is mutated in multiple tumors,” Oncogene, vol. 26, no. 42, pp. 6220–6228, 2007. View at Publisher · View at Google Scholar · View at Scopus