- About this Journal ·
- Abstracting and Indexing ·
- Aims and Scope ·
- Annual Issues ·
- Article Processing Charges ·
- Articles in Press ·
- Author Guidelines ·
- Bibliographic Information ·
- Citations to this Journal ·
- Contact Information ·
- Editorial Board ·
- Editorial Workflow ·
- Free eTOC Alerts ·
- Publication Ethics ·
- Reviewers Acknowledgment ·
- Submit a Manuscript ·
- Subscription Information ·
- Table of Contents
Volume 2014 (2014), Article ID 203435, 11 pages
Genes Associated with SLE Are Targets of Recent Positive Selection
1Department of Medicine, Medical University of South Carolina, Charleston, SC 29425, USA
2Department of Public Health Sciences, Medical University of South Carolina, Charleston, SC 29425, USA
3Department of Public Health Sciences, Wake Forest School of Medicine and Center for Public Health Genomics, Winston-Salem, NC 27157, USA
Received 23 September 2013; Accepted 12 November 2013; Published 23 January 2014
Academic Editor: Juan-Manuel Anaya
Copyright © 2014 Paula S. Ramos et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
The reasons for the ethnic disparities in the prevalence of systemic lupus erythematosus (SLE) and the relative high frequency of SLE risk alleles in the population are not fully understood. Population genetic factors such as natural selection alter allele frequencies over generations and may help explain the persistence of such common risk variants in the population and the differential risk of SLE. In order to better understand the genetic basis of SLE that might be due to natural selection, a total of 74 genomic regions with compelling evidence for association with SLE were tested for evidence of recent positive selection in the HapMap and HGDP populations, using population differentiation, allele frequency, and haplotype-based tests. Consistent signs of positive selection across different studies and statistical methods were observed at several SLE-associated loci, including PTPN22, TNFSF4, TET3-DGUOK, TNIP1, UHRF1BP1, BLK, and ITGAM genes. This study is the first to evaluate and report that several SLE-associated regions show signs of positive natural selection. These results provide corroborating evidence in support of recent positive selection as one mechanism underlying the elevated population frequency of SLE risk loci and supports future research that integrates signals of natural selection to help identify functional SLE risk alleles.
Systemic lupus erythematosus (SLE) is an autoimmune disease whose prevalence, incidence, and disease severity are known to vary among ethnic groups. Increased prevalence has been reported among African-Americans, Asians, Hispanics, and Native Americans (reviewed elsewhere [1, 2]). The reasons for the ethnic disparities remain elusive. According to the “hygiene hypothesis” first proposed by Strachan two decades ago , the increased disease prevalence of autoimmune and allergic diseases in industrialized countries may be due to modern society’s limited pathogen exposure. The Hygiene Hypothesis posits that humans have adapted to infectious exposures that were the norm in the past and that exposure was protective against autoimmune disease. Over many generations environmental pressure may have favored alleles that allow humans to respond to immune system challenges differently but resulted in an increased risk of autoimmune diseases. This could be a mechanism explaining the number of SLE risk alleles that are common in the population.
Human genome variation at the population level is shaped by four evolutionary processes: mutation, migration, random genetic drift, and natural selection. Natural selection is the process by which a trait, in the context of the organism’s environment, becomes either more or less common in a population as a function of the effect of the inherited trait on the differential reproductive success. This ability to survive and reproduce and contribute to the gene pool of the next generation is known as fitness. Natural selection drives adaptation, the evolutionary process whereby over generations the members of a population become better suited to survive and reproduce in that environment. While negative selection decreases the prevalence of traits that diminish individuals’ fitness, positive selection increases the prevalence of adaptive traits. Left untreated, SLE would have a reproductive fitness cost, defined as the ability to raise offspring that successfully reproduce. Thus, some evolutionary process must sustain the relative high frequency of SLE risk alleles seen in current populations around the world. We hypothesize that since the human genome is shaped by adaptation to environmental pressures at the population level, one plausible reason for the higher frequency of disease-risk alleles may be the direct effect of population-specific positive natural selection.
There is compelling evidence that natural selection is acting on a significant fraction of all genes (~3%) [4–7] and as much as 10% of the human genome . Multiple studies have identified genes involved in immune-related functions to be under selection [8–10], including the HLA [11–14] (associated with all autoimmune diseases), BTLA  (associated with rheumatoid arthritis), ITPR3  (SLE, type 1 diabetes, Grave’s disease), PTPN22  (rheumatoid arthritis, Crohn’s disease, type 1 diabetes, vitiligo), ITGAX  (SLE), and BLK  (SLE, rheumatoid arthritis, Kawasaki disease). Finally, we have recently provided evidence that variants within the APOL1 gene known to be under selective pressure in some African populations predispose to end-stage kidney disease in SLE . Given the increasing evidence of selection at loci associated with human autoimmune diseases, identification of alleles under selection may provide further insight into SLE susceptibility and help understand the natural history of SLE predisposition.
A list of genetic regions with compelling evidence of association with SLE was compiled from the literature. This list includes results that met genome-wide significance in any genome-wide association study (GWAS) or transethnic study of SLE and common or rare variants that are considered established SLE-predisposing loci from candidate gene and other studies. The list of regions was based on the literature as of August 2013 and comprises 89 genes in 74 genomic regions.
This list was built upon all the SLE-associated regions described in recent reviews [16–19], which include common and rare variants from candidate gene studies with compelling evidence of association with SLE. We included all reported risk variants for SLE using data from the National Human Genome Research Institute’s Catalog of Published GWAS (http://www.genome.gov/gwastudies) accessed on August 30th, 2013 . Finally, we searched PubMed (http://www.ncbi.nlm.nih.gov/pubmed) for all large-scale transethnic or multiracial studies in SLE and catalogued all variants with a reported meta-analysis . The references for these more recent studies are included in Table 1. Given the paucity of studies conducted in some minority populations, and in order to avoid differential bias due to the number of reported associations in different ethnic groups, we chose to include all variation regardless of the population(s) where they were reported and ignore the information about the population(s) where they have been reported to date.
Assuming no other influencing factors, the advantageous alleles at a locus under positive selective pressure will tend to stochastically increase in prevalence over generations. This can lead to allele frequency differences between populations, which can be detected using statistics that compare the genetic variability within and between populations . It can also lead to the haplotype carrying the advantageous allele to remain longer than genetic distance predicts around alleles of equal frequency, which can be measured using haplotype-based statistics . The evidence of selection in each SLE-associated region was analyzed using both population differentiation, allele frequency spectrum, and haplotype-based statistics in the HapMap II and HGDP populations as implemented in the Haplotter (http://haplotter.uchicago.edu/)  and the Human Genome Diversity Project (HGDP) Selection Browsers (http://hgdp.uchicago.edu/cgi-bin/gbrowse/HGDP/) , respectively.
Haplotter displays the results of a scan for positive selection in the human genome using the International HapMap Project data (http://haplotter.uchicago.edu/) . These data consist of ~800,000 polymorphic SNPs in three distinct population samples of unrelated individuals: 89 Japanese and Han Chinese individuals from Tokyo and Beijing, respectively, denoted as East Asian (ASN), 60 individuals of northern and western European origin (CEU), and 60 Yoruba (YRI) from Ibadan, Nigeria. It shows results on the autosomes only. Results from several selection statistics are displayed, including (1) the fixation index , (2) the Tajima’s , and (3) the integrated haplotype score (iHS). In situations where selection is restricted to certain populations or geographical locations, the allele frequencies at the locus that is undergoing selection may vary significantly between different populations. The fixation index provides a metric of the magnitude of global allele frequency differentiation between populations at a locus [69, 71]. is directly related to the variance in allele frequency among populations and, conversely, to the degree of resemblance among individuals within populations. If is small, it means that the allele frequencies within each population are similar; if it is large, it means that the allele frequencies are different . The Tajima’s is based on the frequencies of the polymorphisms segregating in a locus . As described , positive selection results in an excess of high frequency derived alleles compared to neutral expectations when the selected allele has swept to high frequencies. Positive selection also results in an excess of low frequency polymorphisms, especially when the selected allele is close to fixation or right after fixation. This skewing of SNP frequencies in different directions can be detected by Tajima’s , which is based on the frequencies of SNPs segregating in the region of interest . Signals of selective sweeps will result in high negative . The integrated haplotype score (iHS) uses the lengths of the haplotypes surrounding each core SNP to identify SNPs for which alleles have rapidly risen in frequency [7, 74]. It is based on linkage disequilibrium (LD) surrounding a positively selected allele compared with background, providing evidence of recent positive selection at a locus . An iHS score > 2.0 reflects the fact that haplotypes on the ancestral background are longer compared with those on the derived allelic background.
For these analyses, genome-wide SNP data from Phase II of the HapMap Project were used to investigate if the regions associated with SLE showed evidence of selection in the CEU, YRI, and ASN populations using these three metrics (iHS, Tajima’s , and ). Regions of 1 Mb around each of the 74 regions in Table 1 were queried, and, when higher than 2, the maximum value on the -axis () in this 1 Mb interval was recorded. As described by Voight et al. , the value represents the negative log of the rank of the observed statistic for a given SNP divided by the total number of SNPs. The statistic that is ranked is obtained independently for each of the three statistics separately for each population. For , the estimated value of was used for ranking. For iHS, for each SNP, 25 SNPs on either side of the SNP are scanned for . The proportion of SNPs in this 51 SNP window with is computed. For , the statistic to be ranked is obtained in a similar manner as that for iHS except for each population comparison, the thresholds for defining a significant is based on the top 5% cutoff for each population comparison. The different thresholds used for were CEU-YRI: 0.2976, CEU-ASN: 0.2055, and YRI-ASN: 0.3374. Haplotter also displays the value of the SNPs in the top 1% within each population comparison, which were also recorded, if any such SNPs were present in the 1 Mb interval. In addition to these, Haplotter shows an empirical value estimated for each gene and for each population, as detailed by Voight et al. . When this value showed significant evidence for selection, the value was recorded.
The HGDP Selection Browser displays results from a series of genome-wide scans for natural selection using single nucleotide polymorphism (SNP) genotype data from the Human Genome Diversity-CEPH Panel (HGDP), a dataset containing 938 individuals from 53 populations typed on the Illumina 650Y platform (http://hgdp.uchicago.edu/cgi-bin/gbrowse/HGDP/) . Summary statistics regarding haplotype structure and population differentiation on this data can be queried in the browser. These include the iHS, the , and the cross-population extended haplotype homozygosity test (XP-EHH) . While the iHS detects partial selective sweeps of moderate frequency (~50%–80%), the XP-EHH detects selected alleles that have risen to near fixation in one population (above 80% frequency) [7, 74]. As described by Pickrell et al. , the was calculated on the level of population groupings identified by Rosenberg et al. ; that is, if a SNP has high , most of the variance in allele frequencies is captured by the seven labels identified in that paper. In the browser, plotted is the of the empirical value for each SNP—the higher this plotted value, the more extreme (high) the value is compared the rest of the genotyped SNPs. The iHS was calculated as in Voight et al.  and smoothed across windows. Plotted is the of the value for a window centered at the SNP; high values again indicate potential signals of positive selection. The test statistic was the fraction of SNPs with . The XP-EHH was calculated as in Sabeti et al.’s work . The test statistic was the maximum XP-EHH. Again, the plotted measure is a measure of how extreme a SNP is with regard to the rest of the genome, and high values indicate outliers potentially due to the action of natural selection. The iHS and XP-EHH have been calculated in each individual population, as well as in the following groupings: Bantu-speaking populations, Europeans, Middle Easterners, Central Asians, East Asians, Americans, and Oceanians.
Regions of 1 Mb around each of the 74 regions in Table 1 were queried, and the maximum value on the -axis () in this 1 Mb interval was recorded.
To test whether SLE susceptibility loci show evidence of positive selection, a list of 74 genetic regions with compelling evidence of association with SLE was compiled (Table 1). In order to test whether SLE-associated loci show evidence for recent positive selection, 1 Mb regions around each of the 74 regions were queried. Regions where the maximum (for Haplotter) or (for HGDP) for the , , iHS, or XP-EHH were considered as showing evidence for recent positive selection (Tables 2 and 3). In addition, regions that in the HapMap populations had SNPs with values in the top 1% within each population comparison, or whose empirical value estimated for each gene and for each population showed significant evidence for selection ( value < 0.001) were also considered to show evidence for selection. Of the 74 regions associated with SLE, 19 showed evidence of selection in a HapMap population (Table 2), and 16 exhibited a signal of selection in a HGDP population (Table 3). Many of these loci also had corroborating evidence using different metrics.
In the HapMap data multiple regions displayed evidence of population differentiation, as indicated by the , which was the highest in the PTPN22, TET3-DGUOK, ITPR3, ITGAM, and CD226 regions. Several SNPs with very high (in the top 1% within each population comparison) were identified in these and other regions, especially XKR6-BLK ( in YRI versus ASN), TET3-DGUOK ( in YRI versus ASN, and in YRI versus CEU), CD226 ( in CEU versus YRI), LRRC18-WDFY4 ( in YRI versus ASN), IFIH1 ( in CEU versus YRI), PTPN22 ( in YRI versus ASN), and ITGAM ( in YRI versus ASN). The highest allele frequency differences, as indicated by the statistic, were detected in the PTPN22, IFIH1, ITPR3, and XKR6-BLK regions. The ITPR3 region also had a high iHS. This and BLK are the regions that displayed the most consistently strong evidence for selection according to all three metrics. The ITPR3 gene lies at 6p21, adjacent to the centromeric end of the extended MHC region, after the class II flanking region. XKR6 and BLK lie on the same chromosomal inversion at 8p23.1. PTPN22, ITPR3, and CD226 exhibited the strongest evidence for selection according to the frequency-based statistics. Finally, several regions included genes whose empirical value showed significant evidence for selection. These genes included XKR6 ( in ASN) and UHRF1BP1 ( in CEU). Other genes were significant in several regions, such as the TET3-DGUOK region (DUSP11 and STAMBP with and , resp., in CEU). The PTPN22, ITGAX (near ITGAM), ITPR3, and BLK regions were recently reported to be under selection (in YRI, YRI, YRI, and ASN, resp.) in a candidate gene study by Grossman et al. , who used full-genome sequence variation from the 1000 Genomes Project and the composite of multiple signals (CMS) test.
Since the regions in Table 2 showed evidence of selection in the HapMap samples, the evidence centered at the specific SNP associated with SLE were tested (Supplementary Table 1 in the Supplementary Material available online at http://dx.doi.org/10.1155/2014/203435). Specifically, Haplotter displays the iHS and for common SNPs. Of the queried SLE-associated SNPs, the highest evidence of population differentiation was shown by rs9937837 in ITGAM ( in YRI versus ASN). Evidence for association according to the iHS test was observed in CFHR1-CFHR4 (rs16840639, in YRI), NMNAT2 (rs2022013, in ASN), APOBEC4 (rs10911390, in ASN), CFH (rs6677604, in YRI), UHRF1BP1 (rs11755393, in CEU), and CD226 (rs727088, in CEU). The evidence for selection at the UHRF1BP1 variant was recently reported in a study of candidate inflammatory-disease SNPs using the same statistic and HapMap II data .
In the HGDP data, the highest XP-EHH was detected in the BLK, CLEC16A, and IRF8 regions and the maximum iHS in the CLEC16A and PTTG1 regions. The CLEC16A, BLK, PTPN22, and UHRF1BP1 regions showed strong evidence for selection under the haplotype-based statistics. TNFSF4, IL10, and BLK were the regions showing the highest degree of population differentiation. The TNFSF4 and BLK regions showed the strongest most consistent evidence of selection according to all three metrics. Using the same HapMap II data, Raj and colleagues  previously reported SNPs with a significant signal of selection in CLEC16A (rs12708716, in CEU) and UHRF1BP1 (rs11755393, in CEU). As mentioned, the BLK and ITGAX-ITGAM regions were recently reported to be under selection (in ASN and YRI, resp.) in a candidate genes study using the 1000 Genomes Project samples . For the genes in Table 2, an inspection of the worldwide distribution of allele frequencies for the SNPs associated with SLE (Supplementary Table ) revealed interesting patterns for SNPs in BLK, ITGAM, and CLEC16A (Figure 1).
Comparing the results of the tests for selection in the HapMap and the HGDP samples shows that there are seven genetic regions captured by at least one test in both datasets (Table 4). The common regions captured by the majority of tests were that of the PTPN22, UHRF1BP1, and BLK genes. While the region of the TNIP1 gene was captured in both the HapMap and HGDP populations by the frequency spectrum and population differentiation statistics ( and ), the region of the UHRF1BP1 gene was captured by the haplotype-based statistics. The evidence for selection in these seven genetic regions (Table 4) is strengthened by the fact that they show consistent evidence across different studies and analytic methods.
The diversity exhibited in the human genome is a result of stochastic population genetics processes such as mutation, migration, drift, and selection. SLE disproportionately affects women of child bearing age and without treatment would tend to put affected individuals at a reproductive disadvantage; here, reproductive disadvantage not only includes conception but the ability to raise offspring that successfully reproduce. Thus, strong alternative forces or changing selective pressure must exist that permits the relative high frequency of these risk alleles seen in current populations around the world. Infectious diseases and pathogenic exposures have been postulated to be important factors resulting in strong selective pressure and might provide such alternative pressures. This study investigated whether SLE susceptibility loci show signs of recent positive selection by comparing these regions to the background distribution of genetic variation.
Two important studies have computed several genome-wide tests for selection in two main reference populations, the HapMap and the HGDP populations [7, 70], and implemented the results in genetic browsers. These browsers were queried to assess whether SLE-associated genetic regions have shown evidence for selection in the HapMap and HGDP populations.
This study reports several SLE-associated loci that show evidence for selection in the HapMap populations, and several SLE-associated loci that show evidence for selection in the HGDP populations. Seven genetic regions showed evidence for selection on both the HapMap and HGDP populations. These include the regions of the PTPN22, TNFSF4, TET3-DGUOK, TNIP1, UHRF1BP1, BLK, and ITGAM genes. In addition to the regions that are concordant, the different results obtained with the different metrics and datasets are expected, mostly due to the different coverage of the SNP arrays used, local adaptation in different ethnic groups, and the different test statistics which are likely recovering selective events from different time periods and for different stages of the selective sweep .
Several of these genes have been previously reported to show patterns of genetic variation that are consistent with evidence for recent positive selection. For example, in their search for inflammatory-disease SNPs that localize to regions of the genome where patterns of genetic variation are consistent with that expected under a model of recent positive selection, Raj and colleagues  also reported SNPs in CLEC16A and UHRF1BP1 that exhibit a significant signal of selection using the iHS test. Furthermore, they show that the SLE susceptibility allele in UHRF1BP1 is associated with decreased UHRF1BP1 RNA expression in different cell subsets, suggesting that the SLE risk allele is under recent selection and has a regulatory effect . Furthermore, UHRF1BP1 has been shown to be significantly differentially expressed in dendritic cells after Mycobacterium tuberculosis (MTB) infection . Using full-genome sequence variation from the 1000 Genomes Project and the composite of multiple signals (CMS) test, Grossman et al.  reported the PTPN22, ITGAX (near ITGAM), ITPR3, and BLK regions to show evidence for recent positive selection.
Several of the immune genes that have been identified in regions under selection are under the selective pressure of known pathogens, such as the Duffy blood group atypical chemokine receptor (DARC) gene to Plasmodium vivax malaria , ras homolog family member A (RHOA), and OTU domain ubiquitin aldehyde binding 1 (OTUB1) genes to Yersinia pestis (plague) , or the tyrosylprotein sulfotransferase 1 (TPST1) gene to HIV . Several genetic regions associated with susceptibility to different autoimmune diseases show evidence of selection that has been attributed to host-pathogen coevolution, including the multiple major histocompatibility complex (MHC) [82–84] and the celiac risk locus SH2B3 as a protective factor against bacterial infection . Karlsson et al.  have recently reported that cholera has exerted strong selective pressure on proinflammatory pathways, and Jostins et al.  reported considerable overlap between susceptibility loci for inflammatory bowel disease and mycobacterial infection. Variants in the IFIH1 gene, whose protein is a cytoplasmic helicase that recognizes RNA of picornaviruses and mediates induction of interferon response to viral RNA, have been shown to affect IFIH1 function and host antiviral response . In the context of SLE predisposing loci, Clatworthy et al.  have shown that FCGR2B is important in controlling the immune response to Plasmodium falciparum, the parasite responsible for the most severe form of malaria, and suggests that the higher frequency of human FCGR2B polymorphisms predisposing to SLE in Asians and Africans may be maintained because these variants reduce susceptibility to malaria. The complement component (3b/4b) receptor 1 (CR1) gene has been shown to be a P. falciparum resistance gene  used by the parasite for host invasion. Machado et al.  have suggested that helminth infection has driven positive selection of FCGRs variation. Finally, Grossman et al.  implicated Salmonella typhimurium and other exposures that directionally drive selection of the toll-like receptor 5 (TLR5) gene . Given that infectious organisms are strong agents of natural selection, it is plausible that alleles selected for protection against infection predispose to autoimmune diseases.
It is important to acknowledge the challenges and limitations inherent to the study of traits with complex genetic architectures and/or a less clear influence on survival and reproduction, such as SLE. As Castiblanco and colleagues  recently articulated, the differences in allele and genotype frequencies of diverse human populations depend upon their evolutionary and epidemiological history, including environmental exposures, which might explain why some risk alleles to autoimmunity may be protective factors to infectious diseases and vice versa in a given population (e.g., PTPN22 [94, 95] and TNF ). Immune and infectious agents have been recognized as among the strongest selective pressures for natural populations, as shown by the identification of candidate adaptive alleles that functionally contribute to biological variation in contemporary populations. However, clarifying the relationship between the functional alleles and reproductive fitness in the environment in which they rose to a high frequency in the ancestors of the study population can rarely be attained. In complex diseases such as SLE, despite the established associations to specific regions or polymorphisms, the true causal variants still remain largely unknown. The emerging availability of genome-wide functional data allows the integration of an unprecedented amount of biological information to help identify potential functional variants and characterize their biological impact. Recent examples demonstrate how the integration of signatures of positive selection with phenotypic association studies and/or with regulatory data can improve the identification of functional loci [10, 97–99]. Also, the complex genetic architecture of SLE, resulting from the effects of many alleles of small effects, suggests that adaptation is likely to have occurred by simultaneous selection on variants at many loci. In this scenario, the response to selection is due to small frequency shifts of many alleles. However, most methods to detect selection rely on rapid fixation of strongly selected alleles. The development of novel analytical approaches to detect more subtle signatures of selection will improve the identification of selection signatures in complex diseases like SLE. Clearly, much remains to be done until the functional adaptive SLE risk loci are identified, the phenotypic consequences of these risk alleles elucidated, and the relationship between the functional alleles and reproductive fitness clarified. Recent progresses will provide the necessary tools to accelerate the discovery of these functional adaptive variants that increase the risk of SLE, which will improve knowledge about the etiology and deepen our understanding of the natural history of SLE. Further research regarding exploration of the interplay between infection, type of exposure, additional environmental factors, and autoimmunity will result in the discovery of multiple factors underpinning perhaps newly identified physiopathology mechanisms of SLE and autoimmune diseases .
In summary, this study has systematically queried the HapMap and HGDP populations for evidence for selection at SLE susceptibility regions and provides a comprehensive catalog of regions with both evidence for recent positive selection and association with SLE. These results provide support for recent positive selection influencing genetic variation associated with SLE, suggesting that population-specific selective pressures may be one of the factors behind the high frequency of SLE risk alleles in the population and differential disease risk. Finally, these results support future analyses aimed at identifying the specific selective pressures and characterizing the functional mechanisms of adaptation and disease predisposition.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
The authors would like to thank Mia T. Chandler for assistance in compiling the list with SLE-associated regions. This study was supported by the US National Institutes of Health (NIH) Grant P60 AR062755, by the South Carolina Clinical & Translational Research (SCTR) Institute, with an academic home at the Medical University of South Carolina, through NIH Grants nos. UL1 RR029882 and UL1 TR000062, and by the WFU Center for Public Health Genomics.
- C. A. Peschken, S. J. Katz, E. Silverman et al., “The 1000 Canadian faces of lupus: determinants of disease outcome in a large multiethnic cohort,” Journal of Rheumatology, vol. 36, no. 6, pp. 1200–1208, 2009.
- M. Fernández, G. S. Alarcón, J. Calvo-Alén et al., “A multiethnic, multicenter cohort of patients with Systemic Lupus Erythematosus (SLE) as a model for the study of ethnic disparities in SLE,” Arthritis Care and Research, vol. 57, no. 4, pp. 576–584, 2007.
- D. P. Strachan, “Hay fever, hygiene, and household size,” British Medical Journal, vol. 299, no. 6710, pp. 1259–1260, 1989.
- M. A. Eberle, M. J. Rieder, L. Kruglyak, and D. A. Nickerson, “Allele frequency matching between SNPs reveals an excess of linkage disequilibrium in genic regions of the human genome,” PLoS Genetics, vol. 2, no. 9, article e142, 2006.
- P. C. Sabeti, D. E. Reich, J. M. Higgins et al., “Detecting recent positive selection in the human genome from haplotype structure,” Nature, vol. 419, no. 6909, pp. 832–837, 2002.
- J. M. Smith and J. Haigh, “The hitch hiking effect of a favourable gene,” Genetical Research, vol. 23, no. 1, pp. 23–35, 1974.
- B. F. Voight, S. Kudaravalli, X. Wen, and J. K. Pritchard, “A map of recent positive selection in the human genome,” PLoS Biology, vol. 4, no. 3, article e72, 2006.
- S. H. Williamson, M. J. Hubisz, A. G. Clark, B. A. Payseur, C. D. Bustamante, and R. Nielsen, “Localizing recent adaptive evolution in the human genome,” PLoS Genetics, vol. 3, no. 6, article e90, 2007.
- M. Fumagalli, R. Cagliani, U. Pozzoli et al., “Widespread balancing selection and pathogen-driven selection at blood group antigen genes,” Genome Research, vol. 19, no. 2, pp. 199–212, 2009.
- S. R. Grossman, K. G. Andersen, I. Shlyakhter, et al., “Identifying recent adaptations in large-scale genomic data,” Cell, vol. 152, pp. 703–713, 2013.
- F. L. Black and P. W. Hedrick, “Strong balancing selection at HLA loci: evidence from segregation in South Amerindian families,” Proceedings of the National Academy of Sciences of the United States of America, vol. 94, no. 23, pp. 12452–12456, 1997.
- R. Cagliani, S. Riva, U. Pozzoli et al., “Balancing selection is common in the extended MHC region but most alleles with opposite risk profile for autoimmune diseases are neutrally evolving,” BMC Evolutionary Biology, vol. 11, no. 1, article 171, 2011.
- X. Liu, Y. Fu, Z. Liu et al., “An ancient balanced polymorphism in a regulatory region of human major histocompatibility complex is retained in Chinese minorities but lost worldwide,” American Journal of Human Genetics, vol. 78, no. 3, pp. 393–400, 2006.
- Z. Tan, A. M. Shon, and C. Ober, “Evidence of balancing selection at the HLA-G promoter region,” Human Molecular Genetics, vol. 14, no. 23, pp. 3619–3628, 2005.
- B. I. Freedman, C. D. Langefeld, K. K. Andringa et al., “End-stage kidney disease in African Americans with lupus nephritis associates with APOL1,” Arthritis and Rheumatism, 2013.
- O. J. Rullo and B. P. Tsao, “Recent insights into the genetic basis of systemic lupus erythematosus,” Annals of the Rheumatic Diseases, vol. 72, Suppl 2, pp. ii56–ii61, 2013.
- S. E. Vaughn, L. C. Kottyan, M. E. Munroe, and J. B. Harley, “Genetic susceptibility to lupus: the biological basis of genetic risk found in B cell signaling pathways,” Journal of Leukocyte Biology, vol. 92, pp. 577–591, 2012.
- S. G. Guerra, T. J. Vyse, and D. S. Cunninghame Graham, “The genetics of lupus: a functional perspective,” Arthritis Research & Therapy, vol. 14, no. 3, article 211, 2012.
- P. S. Ramos, E. E. Brown, R. P. Kimberly, and C. D. Langefeld, “Genetic factors predisposing to systemic lupus erythematosus and lupus nephritis,” Seminars in Nephrology, vol. 30, no. 2, pp. 164–176, 2010.
- L. A. Hindorff, H. Junkins, J. P. Mehta, and T. A. Manolio, “A catalog of published genome-wide association studies,” 2010, http://www.genome.gov/gwastudies/.
- H. Nishino, K. Shibuya, Y. Nishida, and M. Mushimoto, “Lupus erythematosus-like syndrome with selective complete deficiency of C1q,” Annals of Internal Medicine, vol. 95, no. 3, pp. 322–324, 1981.
- V. Gateva, J. K. Sandling, G. Hom et al., “A large-scale replication study identifies TNIP1, PRDM1, JAZF1, UHRF1BP1 and IL10 as risk loci for systemic lupus erythematosus,” Nature Genetics, vol. 41, no. 11, pp. 1228–1233, 2009.
- R. R. Graham, G. Hom, W. Ortmann, and T. W. Behrens, “Review of recent genome-wide association scans in lupus,” Journal of Internal Medicine, vol. 265, no. 6, pp. 680–688, 2009.
- J. B. Harley, M. E. Alarcón-Riquelme, L. A. Criswell et al., “Genome-wide association scan in women with systemic lupus erythematosus identifies susceptibility variants in ITGAM, PXK, KIAA1542 and other loci,” Nature Genetics, vol. 40, no. 2, pp. 204–210, 2008.
- C. Kyogoku, C. D. Langefeld, W. A. Ortmann et al., “Genetic association of the R620W polymorphism of protein tyrosine phosphatase PTPN22 with human SLE,” American Journal of Human Genetics, vol. 75, no. 3, pp. 504–507, 2004.
- F. B. Karassa, T. A. Trikalinos, and J. P. A. Ioannidis, “Role of the Fcγ receptor IIa polymorphism in susceptibility to systemic lupus erythematosus and lupus nephritis: a meta-analysis,” Arthritis and Rheumatism, vol. 46, no. 6, pp. 1563–1571, 2002.
- F. B. Karassa, T. A. Trikalinos, J. P. A. Ioannidis et al., “The FcγRIIIA-F158 allele is a risk factor for the development of lupus nephritis: a meta-analysis,” Kidney International, vol. 63, no. 4, pp. 1475–1482, 2003.
- W. Yang, H. Tang, Y. Zhang, et al., “Meta-analysis followed by replication identifies loci in or near CDKN1B, TET3, CD80, DRAM1, and ARID5B as associated with systemic lupus erythematosus in Asians,” American Journal of Human Genetics, vol. 92, no. 1, pp. 41–51, 2013.
- Y. K. Chang, W. Yang, M. Zhao et al., “Association of BANK1 and TNFSF4 with systemic lupus erythematosus in Hong Kong Chinese,” Genes and Immunity, vol. 10, no. 5, pp. 414–420, 2009.
- D. S. C. Graham, R. R. Graham, H. Manku et al., “Polymorphism at the TNF superfamily gene TNFSF4 confers susceptibility to systemic lupus erythematosus,” Nature Genetics, vol. 40, no. 1, pp. 83–89, 2008.
- A. M. Delgado-Vega, A. K. Abelson, E. Sánchez et al., “Replication of the TNFSF4 (OX40L) promoter region association with systemic lupus erythematosus,” Genes and Immunity, vol. 10, no. 3, pp. 248–253, 2009.
- J. W. Han, H. F. Zheng, Y. Cui et al., “Genome-wide association study in a Chinese Han population identifies nine new susceptibility loci for systemic lupus erythematosus,” Nature Genetics, vol. 41, no. 11, pp. 1234–1237, 2009.
- W. Yang, N. Shen, D. Q. Ye et al., “Genome-wide association study in asian populations identifies variants in ETS1 and WDFY4 associated with systemic lupus erythematosus,” PLoS Genetics, vol. 6, no. 2, Article ID e1000841, 2010.
- J. Zhao, H. Wu, M. Khosravi, et al., “Association of genetic variants in complement factor H and factor H-related genes with systemic lupus erythematosus susceptibility,” PLoS Genetics, vol. 7, no. 5, Article ID e1002079, 2011.
- J. C. Edberg, J. Wu, C. D. Langefeld et al., “Genetic variation in the CRP promoter: association with systemic lupus erythematosus,” Human Molecular Genetics, vol. 17, no. 8, pp. 1147–1155, 2008.
- J. E. Molineros, A. K. Maiti, C. Sun, et al., “Admixture mapping in lupus identifies multiple functional variants within IFIH1 associated with apoptosis, inflammation, and autoantibody production,” PLoS Genetics, vol. 9, Article ID e1003222, 2013.
- A. K. Abelson, A. M. Delgado-Vega, S. V. Kozyrev et al., “STAT4 associates with systemic lupus erythematosus through two independent effects that correlate with gene expression and act additively with IRF5 to increase risk,” Annals of the Rheumatic Diseases, vol. 68, no. 11, pp. 1746–1753, 2009.
- R. R. Graham, C. Cotsapas, L. Davies et al., “Genetic variants near TNFAIP3 on 6q23 are associated with systemic lupus erythematosus,” Nature Genetics, vol. 40, no. 9, pp. 1059–1061, 2008.
- G. Hom, R. R. Graham, B. Modrek et al., “Association of systemic lupus erythematosus with C8orf13-BLK and ITGAM-ITGAX,” New England Journal of Medicine, vol. 358, no. 9, pp. 900–909, 2008.
- Y. H. Lee, S. C. Bae, S. J. Choi, J. D. Ji, and G. G. Song, “Genome-wide pathway analysis of genome-wide association studies on systemic lupus erythematosus and rheumatoid arthritis,” Molecular Biology Reports, vol. 39, pp. 10627–10635, 2012.
- E. F. Remmers, R. M. Plenge, A. T. Lee et al., “STAT4 and the risk of rheumatoid arthritis and systemic lupus erythematosus,” New England Journal of Medicine, vol. 357, no. 10, pp. 977–986, 2007.
- L. Prokunina, C. Castillejo-López, F. Öberg et al., “A regulatory polymorphism in PDCD1 is associated with susceptibility to systemic lupus erythematosus in humans,” Nature Genetics, vol. 32, no. 4, pp. 666–669, 2002.
- M. A. Lee-Kirsch, M. Gong, D. Chowdhury et al., “Mutations in the gene encoding the 3′-5′ DNA exonuclease TREX1 are associated with systemic lupus erythematosus,” Nature Genetics, vol. 39, no. 9, pp. 1065–1067, 2007.
- S. M. Al-Mayouf, A. Sunker, R. Abdwani et al., “Loss-of-function variant in DNASE1L3 causes a familial form of systemic lupus erythematosus,” Nature Genetics, vol. 43, no. 12, pp. 1186–1188, 2011.
- C. J. Lessard, I. Adrianto, J. A. Ice et al., “Identification of IRF8, TMEM39A, and IKZF3-ZPBP2 as susceptibility loci for systemic lupus erythematosus in a large-scale multiracial replication study,” American Journal of Human Genetics, vol. 90, no. 4, pp. 648–660, 2012.
- Y. Okada, K. Shimane, Y. Kochi et al., “A genome-wide association study identified AFF1 as a susceptibility locus for systemic lupus eyrthematosus in Japanese,” PLoS Genetics, vol. 8, no. 1, Article ID e1002455, 2012.
- S. V. Kozyrev, A. K. Abelson, J. Wojcik et al., “Functional variants in the B-cell gene BANK1 are associated with systemic lupus erythematosus,” Nature Genetics, vol. 40, no. 2, pp. 211–216, 2008.
- T. Hughes, X. Kim-Howard, J. A. Kelly et al., “Fine-mapping and transethnic genotyping establish IL2/IL21 genetic association with lupus and localize this genetic effect to IL21,” Arthritis and Rheumatism, vol. 63, no. 6, pp. 1689–1697, 2011.
- W. Tan, K. Sunahori, J. Zhao et al., “Association of PPP2CA polymorphisms with systemic lupus erythematosus susceptibility in multiple ethnic groups,” Arthritis and Rheumatism, vol. 63, no. 9, pp. 2755–2763, 2011.
- L. Boteva, D. L. Morris, J. Cortés-Hernández, J. Martin, T. J. Vyse, and M. M. A. Fernando, “Genetically determined partial complement C4 deficiency states are not independent risk factors for SLE in UK and Spanish populations,” American Journal of Human Genetics, vol. 90, no. 3, pp. 445–456, 2012.
- M. M. A. Fernando, C. R. Stevens, P. C. Sabeti et al., “Identification of two independent risk factors for lupus within the MHC in United Kingdom families,” PLoS Genetics, vol. 3, no. 11, article e192, 2007.
- F. C. Grumet, A. Coukell, J. G. Bodmer, W. F. Bodmer, and H. O. McDevitt, “Histocompatibility (HL-A) antigens associated with systemic lupus erythematosus. A possible genetic predisposition to disease,” New England Journal of Medicine, vol. 285, no. 4, pp. 193–196, 1971.
- H. Waters, P. Konrad, and R. L. Walford, “The distribution of HL-A histocompatibility factors and genes in patients with systemic lupus erythematosus,” Tissue Antigens, vol. 1, no. 2, pp. 68–73, 1971.
- T. Oishi, A. Iida, S. Otsubo et al., “A functional SNP in the NKX2.5-binding site of ITPR3 promoter is associated with susceptibility to systemic lupus erythematosus in Japanese population,” Journal of Human Genetics, vol. 53, no. 2, pp. 151–162, 2008.
- S. L. Musone, K. E. Taylor, T. T. Lu et al., “Multiple polymorphisms in the TNFAIP3 region are independently associated with systemic lupus erythematosus,” Nature Genetics, vol. 40, no. 9, pp. 1062–1064, 2008.
- R. R. Graham, S. V. Kozyrev, E. C. Baechler et al., “A common haplotype of interferon regulatory factor 5 (IRF5) regulates splicing and expression and is associated with increased risk of systemic lupus erythematosus,” Nature Genetics, vol. 38, no. 5, pp. 550–555, 2006.
- S. Sigurdsson, G. Nordmark, H. H. H. Göring et al., “Polymorphisms in the tyrosine kinase 2 and interferon regulatory factor 5 genes are associated with systemic lupus erythematosus,” American Journal of Human Genetics, vol. 76, no. 3, pp. 528–537, 2005.
- C. J. Lessard, I. Adrianto, J. A. Kelly et al., “Identification of a systemic lupus erythematosus susceptibility locus at 11p13 between PDHX and CD44 in a multiethnic study,” American Journal of Human Genetics, vol. 88, no. 1, pp. 83–91, 2011.
- V. Agnello, M. M. De Bracco, and H. G. Kunkel, “Hereditary C2 deficiency with some manifestations of systemic lupus erythematosus,” Journal of Immunology, vol. 108, no. 3, pp. 837–840, 1972.
- N. Manjarrez-Orduno, E. Marasco, S. A. Chung, et al., “CSK regulatory polymorphism is associated with systemic lupus erythematosus and influences B-cell signaling and activation,” Nature Genetics, vol. 44, pp. 1227–1230, 2012.
- K. Yasutomo, T. Horiuchi, S. Kagami et al., “Mutation of DNASE1 in people with systemic lupus erythematosus,” Nature Genetics, vol. 28, no. 4, pp. 313–314, 2001.
- Y. J. Sheng, J. P. Gao, J. Li et al., “Follow-up study identifies two novel susceptibility loci PRKCB and 8p11.21 for systemic lupus erythematosus,” Rheumatology, vol. 50, no. 4, pp. 682–688, 2011.
- S. K. Nath, S. Han, X. Kim-Howard et al., “A nonsynonymous functional variant in integrin-αM (encoded by ITGAM) is associated with systemic lupus erythematosus,” Nature Genetics, vol. 40, no. 2, pp. 152–154, 2008.
- S. E. Löfgren, A. M. Delgado-Vega, C. J. Gallant et al., “A 3′-untranslated region variant is associated with impaired expression of CD226 in T and natural killer T cells and is associated with susceptibility to systemic lupus erythematosus,” Arthritis and Rheumatism, vol. 62, no. 11, pp. 3404–3414, 2010.
- K. Kim, E. E. Brown, C. B. Choi, et al., “Variation in the ICAM1-ICAM4-ICAM5 locus is associated with systemic lupus erythematosus susceptibility in multiple ancestries,” Annals of the Rheumatic Diseases, vol. 71, pp. 1809–1814, 2012.
- T. A. Briggs, G. I. Rice, S. Daly et al., “Tartrate-resistant acid phosphatase deficiency causes a bone dysplasia with autoimmunity and a type i interferon expression signature,” Nature Genetics, vol. 43, no. 2, pp. 127–131, 2011.
- C. O. Jacob, J. Zhu, D. L. Armstrong et al., “Identification of IRAK1 as a risk gene with critical role in the pathogenesis of systemic lupus erythematosus,” Proceedings of the National Academy of Sciences of the United States of America, vol. 106, no. 15, pp. 6256–6261, 2009.
- R. Webb, J. D. Wren, M. Jeffries et al., “Variants within MECP2, a key transcription regulator, are associated with increased susceptibility to lupus and differential gene expression in patients with systemic lupus erythematosus,” Arthritis and Rheumatism, vol. 60, no. 4, pp. 1076–1084, 2009.
- R. C. Lewontin and J. Krakauer, “Distribution of gene frequency as a test of the theory of the selective neutrality of polymorphisms,” Genetics, vol. 74, no. 1, pp. 175–195, 1973.
- J. K. Pickrell, G. Coop, J. Novembre et al., “Signals of recent positive selection in a worldwide sample of human populations,” Genome Research, vol. 19, no. 5, pp. 826–837, 2009.
- C. C. Cockerham and B. S. Weir, “Estimation of inbreeding parameters in stratified populations,” Annals of Human Genetics, vol. 50, no. 3, pp. 271–281, 1986.
- K. E. Holsinger and B. S. Weir, “Genetics in geographically structured populations: defining, estimating and interpreting FST,” Nature Reviews Genetics, vol. 10, no. 9, pp. 639–650, 2009.
- F. Tajima, “Statistical method for testing the neutral mutation hypothesis by DNA polymorphism,” Genetics, vol. 123, no. 3, pp. 585–595, 1989.
- P. C. Sabeti, P. Varilly, B. Fry, et al., “Genome-wide detection and characterization of positive selection in human populations,” Nature, vol. 449, pp. 913–918, 2007.
- N. A. Rosenberg, J. K. Pritchard, J. L. Weber et al., “Genetic structure of human populations,” Science, vol. 298, no. 5602, pp. 2381–2385, 2002.
- T. Raj, M. Kuchroo, J. M. Replogle, et al., “Common risk alleles for inflammatory diseases are targets of recent positive selection,” American Journal of Human Genetics, vol. 92, pp. 517–529, 2013.
- J. M. Akey, “Constructing genomic maps of positive selection in humans: where do we go from here?” Genome Research, vol. 19, no. 5, pp. 711–722, 2009.
- L. B. Barreiro, L. Tailleux, A. A. Pai, B. Gicquel, J. C. Marioni, and Y. Gilad, “Deciphering the genetic architecture of variation in the immune response to Mycobacterium tuberculosis infection,” Proceedings of the National Academy of Sciences of the United States of America, vol. 109, no. 4, pp. 1204–1209, 2012.
- P. C. Sabeti, S. F. Schaffner, B. Fry et al., “Positive natural selection in the human lineage,” Science, vol. 312, no. 5780, pp. 1614–1620, 2006.
- M. J. Edelmann, H. B. Kramer, M. Altun, and B. M. Kessler, “Post-translational modification of the deubiquitinating enzyme otubain 1 modulates active RhoA levels and susceptibility to Yersinia invasion,” FEBS Journal, vol. 277, no. 11, pp. 2515–2530, 2010.
- M. Farzan, T. Mirzabekov, P. Kolchinsky et al., “Tyrosine sulfation of the amino terminus of CCR5 facilitates HIV-1 entry,” Cell, vol. 96, no. 5, pp. 667–676, 1999.
- A. L. Hughes and M. Nei, “Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selection,” Nature, vol. 335, no. 6186, pp. 167–170, 1988.
- F. Prugnolle, A. Manica, M. Charpentier, J. F. Guégan, V. Guernier, and F. Balloux, “Pathogen-driven selection and worldwide HLA class I diversity,” Current Biology, vol. 15, no. 11, pp. 1022–1027, 2005.
- N. Qutob, F. Balloux, T. Raj et al., “Signatures of historical demography and pathogen richness on MHC class i genes,” Immunogenetics, vol. 64, no. 3, pp. 165–175, 2012.
- A. Zhernakova, C. C. Elbers, B. Ferwerda et al., “Evolutionary and functional analysis of celiac risk loci reveals SH2B3 as a protective factor against bacterial infection,” American Journal of Human Genetics, vol. 86, no. 6, pp. 970–977, 2010.
- E. K. Karlsson, J. B. Harris, S. Tabrizi et al., “Natural selection in a bangladeshi population from the cholera-endemic ganges river delta,” Science Translational Medicine, vol. 5, no. 192, Article ID 192ra186, 2013.
- L. Jostins, S. Ripke, R. K. Weersma, et al., “Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease,” Nature, vol. 491, pp. 119–124, 2012.
- S. Nejentsev, N. Walker, D. Riches, M. Egholm, and J. A. Todd, “Rare variants of IFIH1, a gene implicated in antiviral responses, protect against type 1 diabetes,” Science, vol. 324, no. 5925, pp. 387–389, 2009.
- M. R. Clatworthy, L. Willcocks, B. Urban et al., “Systemic lupus erythematosus-associated defects in the inhibitory receptor FcγRIIb reduce susceptibility to malaria,” Proceedings of the National Academy of Sciences of the United States of America, vol. 104, no. 17, pp. 7169–7174, 2007.
- I. A. Cockburn, M. J. Mackinnon, A. O'Donnell et al., “A human complement receptor 1 polymorphism that reduces Plasmodium falciparum rosetting confers protection against severe malaria,” Proceedings of the National Academy of Sciences of the United States of America, vol. 101, no. 1, pp. 272–277, 2004.
- L. R. Machado, R. J. Hardwick, J. Bowdrey, et al., “Evolutionary history of copy-number-variable locus for the low-affinity Fcgamma receptor: mutation rate, autoimmune disease, and the legacy of helminth infection,” American Journal of Human Genetics, vol. 90, pp. 973–985, 2012.
- T. R. Hawn, H. Wu, J. M. Grossman, B. H. Hahn, B. P. Tsao, and A. Aderem, “A stop codon polymorphism of Toll-like receptor 5 is associated with resistance to systemic lupus erythematosus,” Proceedings of the National Academy of Sciences of the United States of America, vol. 102, no. 30, pp. 10593–10597, 2005.
- J. Castiblanco, M. Arcos-Burgos, and J. M. Anaya, “What is next after the genes for autoimmunity?” BMC Medicine, vol. 11, article 197, 2013.
- L. M. Gomez, J. M. Anaya, C. I. Gonzalez et al., “PTPN22 C1858T polymorphism in Colombian patients with autoimmune diseases,” Genes and Immunity, vol. 6, no. 7, pp. 628–631, 2005.
- L. M. Gomez, J. M. Anaya, and J. Martin, “Genetic Influence of PTPN22 R620W Polymorphism in Tuberculosis,” Human Immunology, vol. 66, no. 12, pp. 1242–1247, 2005.
- P. A. Correa, L. M. Gomez, J. Cadena, and J. M. Anaya, “Autoimmunity and tuberculosis. Opposite association with TNF polymorphism,” Journal of Rheumatology, vol. 32, no. 2, pp. 219–224, 2005.
- B. Vernot, A. B. Stergachis, M. T. Maurano, et al., “Personal and population genomics of human regulatory variation,” Genome Research, vol. 22, pp. 1689–1697, 2012.
- S. Kudaravalli, J. B. Veyrieras, B. E. Stranger, E. T. Dermitzakis, and J. K. Pritchard, “Gene expression levels are a target of recent natural selection in the human genome,” Molecular Biology and Evolution, vol. 26, no. 3, pp. 649–658, 2009.
- H. B. Fraser, “Gene expression drives local adaptation in humans,” Genome Research, vol. 23, pp. 1089–1096, 2013.