Disease Markers

Disease Markers / 2020 / Article

Research Article | Open Access

Volume 2020 |Article ID 1910215 | https://doi.org/10.1155/2020/1910215

Muhammad Muaaz Aslam, Peter John, Kang-Hsien Fan, Attya Bhatti, Wajahat Aziz, Bashir Ahmed, Eleanor Feingold, F. Yesim Demirci, M. Ilyas Kamboh, "Investigating the GWAS-Implicated Loci for Rheumatoid Arthritis in the Pakistani Population", Disease Markers, vol. 2020, Article ID 1910215, 9 pages, 2020. https://doi.org/10.1155/2020/1910215

Investigating the GWAS-Implicated Loci for Rheumatoid Arthritis in the Pakistani Population

Academic Editor: Wen-Jun Tu
Received26 Feb 2020
Revised17 Jul 2020
Accepted23 Jul 2020
Published03 Aug 2020

Abstract

Rheumatoid arthritis (RA) is a complex and multifactorial autoimmune disorder with the involvement of multiple genetic and environmental factors. Genome-wide association studies (GWAS) have identified more than 50 RA genetic loci in European populations. Given the anticipated overlap of RA-relevant genes and pathways across different ethnic groups, we sought to replicate 58 GWAS-implicated SNPs reported in Europeans in Pakistani subjects. 1,959 unrelated subjects comprising 1,222 RA cases and 737 controls were collected from three rheumatology facilities in Pakistan. Genotyping was performed using iPLEX or TaqMan® methods. A total of 50 SNPs were included in the final association analysis after excluding those that failed assay design/run or postrun QC analysis. Fourteen SNPs (LINC00824/rs1516971, PADI4/rs2240336, CEP57/rs4409785, CTLA4/rs3087243, STAT4/rs13426947, HLA-B/MICA/rs2596565, C5orf30/rs26232, CCL21/rs951005, GATA3/rs2275806, VPS37C/rs595158, HLA-DRB1/rs660895, EOMES/rs3806624, SPRED2/rs934734, and RUNX1/rs9979383) were replicated in our Pakistani sample at false discovery rate (FDR) of <0.20 with nominal values ranging from 4.73E-06 to 3.48E-02. Our results indicate that several RA susceptibility loci are shared between Pakistani and European populations, supporting the role of common genes/pathways.

1. Introduction

Rheumatoid arthritis (RA) is a complex autoimmune disorder due to the involvement of many genetic, environmental, and stochastic elements in its etiology [1, 2]. It is characterized by continuous inflammation associated with altered expression of different proinflammatory (TNFα, IL-1, IL-17) and anti-inflammatory (IL-4, IL-10, IL-13, IL-35) cytokines and activation of B and T cells, which leads to the destruction of synovial joints and ultimately physical disability [3, 4]. Worldwide, RA affects 0.5 to 1% of the population [5]. RA prevalence does not differ significantly between rural and urban areas [6]. It can affect both sexes at any age, but data suggest that women are three times more prone than men [7]. There is a significant genetic susceptibility to RA with 50-60% heritability estimates [8]. Twin studies also revealed high concordance rates in monozygotic twins (12.3-15.4%) relative to dizygotic twins (3.5%), strongly indicating the presence of a genetic component in RA [9].

During the last two decades, many genetic studies, including genome-wide association studies (GWAS), have been conducted to understand the genetic basis of RA and this has led to the identification of more than 50 risk loci [10]. Besides HLA-DRB1, many non-HLA loci have been implicated that are involved in multiple functions/pathways including immune activation (NF-κB signaling, JAK-STAT pathway, regulation, and activation of CD4+T-cells, complement activation), fibroblast differentiation and dedifferentiation, and bone modeling and repair [11, 12].

Thus far, the major focus of genetic studies on RA has largely been on Europeans or European-derived populations, with only limited data available on other populations [13], especially in South Asians [14]. In an effort to expand genetic studies on RA in South Asians and to further delineate its genetic basis, we sought to replicate 58 genome-wide significant single nucleotide polymorphisms (SNPs) from 58 loci previously implicated in Europeans or European-derived populations [1518] in Pakistani population where there is the paucity of such data.

2. Material and Methods

2.1. Study Subjects

A total of 1,959 subjects (1,222 cases, 737 controls) were included in our study. Blood samples and relevant clinical information were collected from three major public or private rheumatology clinics in Pakistan: Pakistan Institute of medical sciences (PIMS), Military hospital, and Rehmat Noor Clinic. All cases included in this study (, 78.6% women) were clinically diagnosed by rheumatologists and met the American College of Rheumatology (ACR) 1987 classification criteria for RA [19]. All controls included in this study (, 39.5% women) were recruited from the general population and were free of any autoimmune disease at the time of recruitment. A screening questionnaire was filled out and a written informed consent was obtained from each subject at the time of the recruitment. All blood samples were collected in EDTA tubes to avoid coagulation and processed shortly after the collection. The study was approved by the Institutional review board (IRB) of the University of Pittsburgh, USA (IRB no. PRO12110472).

2.2. Genomic DNA Extraction

Genomic DNA was extracted from whole blood using either the standard phenol-chloroform extraction method or GeneJET Whole Blood Genomic DNA Purification (Thermo Scientific USA) and quantified using the NanoDrop™ 2000 spectrophotometer (Thermo Scientific USA).

2.3. Genotyping

A total of 58 RA-associated genome-wide significant SNPs () was selected from the previously published GWAS in subjects of European ancestry (Table 1). SNP genotyping was performed using either the TaqMan® (Applied Biosystems, Thermo Fisher Scientific) or iPLEX® Gold (Agena Bioscience) methods and following the manufacturer’s design/order instructions and protocols. After the thermal cycling of the TaqMan® assays and DNAs on 384-well plates, the endpoint fluorescence reading was performed on a QuantStudio™ 12K Flex system (Applied Biosystems, Thermo Fisher Scientific). The iPLEX® Gold genotyping was performed in the Core laboratories of the University of Pittsburgh. 18% replicates were used to test genotyping consistency.


SNPLocusChromosomal location (GRCh38)Variant typeGeneReference

rs22281451q21154454494Missense variantIL6R[15]
rs21053251q25.1173380586Intron variantLOC100506023[16]
rs28434011p362596694Intron variantMMEL1[15]
rs284113521p34.3378129073 prime UTR variantMTF1[16]
rs8832201p3438151199Intron variantLOC105378654[15]
rs24766011p13113834946Missense variantPTPN22[15]
rs67325652q13110850255Intron variantACOXL[16]
rs116769222q11100190478IntergenicAFF3[17]
rs67152842q33.1201289674Intron variantALS2CR12[16]
rs30872432q33203874196Downstream variantCTLA4[17]
rs346959442p1660897715Intron variantREL[15]
rs9347342p1465368452Intron variantSPRED2[17]
rs134269472q32191068528Intron variantSTAT4[15]
rs38066243p24.127723132Upstream variantEOMES[16]
rs98268283q22.3136683218Intron variantSTAG1[16]
rs44523133p24.317005540Intron variantPLCL2[16]
rs133155913p1458571114Intron variantFAM107A[17]
rs8740404p1526106575Upstream variantRBPJ[17]
rs26640354p1148218822Intron variantTEC[16]
rs716241195q1156144903Intron variantANKRD55[15]
rs262325q21103261019Intron variantC5orf30[17]
rs30930236q27167120802Intron variantCCR6[16]
rs22340676p21.3136387877Upstream variantETV7[16]
rs69202206q23137685367Upstream variantTNFAIP3[15]
rs25965656p21.3331385552Upstream variantHLA-B/MICA[18]
rs69100716p2132315077Intron variantTSBP1[17]
rs6608956p2132609603Intron variantHLA-DRB1[14]
rs42727q21.2926075153 prime UTR variantCDK6[16]
rs104886317q32128954129Downstream variantIRF5[17]
rs672504507p15.128135367Intron variantJAZF1[16]
rs6783478q22.3101451374Upstream variantGRHL2[16]
rs15169718q24.21128529854Intron variantLINC00824[16]
rs9987318q21.1380183160Intron variantTPD52[16]
rs9510059p1334743684IntergenicCCL21[17]
rs1276437810q2162040245Intron variantARID5B[15]
rs227580610p148053377Intron variantGATA3[15]
rs70677810p156056986Intron variantIL2RA[17]
rs59515811q1261142109Intron variantVPS37C[15]
rs440978511q2195578258Intergenic/unknownCEP57[16]
rs77312512q13.256001170Intron variantSUOX[16]
rs195089714q24.168293424Intron variantRAD51B[16]
rs804308515q1438535939Intron variantRASGRP1[15]
rs802689815q2369699078Intergenic/unknownTLE3[15]
rs1333017616q2485985481Intergenic/unknownIRF8[15]
rs478040116p13.1311745470Upstream variantTXNDC11[16]
rs1293640917q1239887396Intergenic/unknownIKZF3[15]
rs3453644319p1310352442Missense variantTYK2[15]
rs481048520q1346119308Intron variantCD40[17]
rs997938321q2235343463Intron variantRUNX1[15]
rs189359221q22.342434957Intron variantUBASH3A[16]
rs90968522q13.139351666Intron variantSYNGR1[16]
rs22403361p3617347907Intron variantPADI4[16]
rs101757982p23.130226728Upstream variantLBH[16]
rs96856711q12.261828092Intron variantFADS2[16]
rs1077462412q24.12111395984Upstream variantLINC02356[16]
rs960361613q14.1139793932Downstream variantCOG6[16]
rs7319405821q22.1133391982Intergenic/unknownIFNGR2[16]
rs283451221q2234539301Intron variantRCAN1[15]

2.4. Statistical Analysis

Concordance to Hardy-Weinberg Equilibrium (HWE) was tested using the Chi-square goodness of fit test. Departure from HWE was considered at arbitrarily . Logistic regression using an additive model and minor allele as the effect allele was employed for case-control association analysis using sex and age as covariates. The Benjamin Hochberg false discovery rate (FDR) was applied to correct for multiple testing [20]. was considered as suggestive evidence of association and FDR ( value) of <0.20 as statistically significant as used in previous reports [21, 22]. All analyses were implemented in R, version 3.4.4.

2.5. Functional Annotations

To evaluate the potential biological significance of reported genome-wide significant SNPs, we used the Genotype-Tissue Expression (GTEx) database (https://gtexportal.org/home/) to search for expression quantitative trait loci (eQTL) in RA-relevant tissues and whole blood. We also used the RegulomeDB online database (http://regulome.stanford.edu/) to determine possible regulatory functions of the SNPs located in noncoding regions.

3. Results

A total of 1,222 unrelated RA cases and 737 controls were recruited for this research study. The prevalence of RA was higher in females (78%) than males (22%), supporting the earlier data that females are more prone to RA [14].

Eight of the 58 genotyped SNPs failed the QC (quality control) during assay design (either iPLEX® Gold/TaqMan® or both). The genotype distribution of all QC-passed 50 SNPs adhered to the HWE. The association analyses results in our Pakistani sample are presented in Table 2. Fourteen SNPs showed nominal significance at and FDR of ≤0.20.


Gene/SNPMajor alleleMinor alleleMAFReported GWAS valueNominal value in PakistanisOR (95% CI)FDR ()

LINC00824/rs1516971TC0.0993.20E-114.73E-060.57 (0.45, 0.73)2.37E-04
PADI4/rs2240336CT0.4525.9E−95.00E-050.74 (0.64, 0.86)1.25E-03
CEP57/rs4409785TC0.2623.60E-091.03E-031.33 (1.12, 1.58)1.54E-02
CTLA4/rs3087243AG0.4221.2E−81.23E-031.28 (1.1, 1.49)1.54E-02
STAT4/rs13426947GA0.2387.2E−102.59E-031.31 (1.1, 1.56)2.59E-02
HLA-B/MICA/rs2596565GA0.1199.26E-094.53E-031.4 (1.11, 1.77)3.77E-02
C5orf30/rs26232CT0.1814.10E-083.73E-020.83 (0.69, 0.99)1.52E-01
CCL21/rs951005AG0.2483.90E-102.75E-020.82 (0.69, 0.98)1.52E-01
GATA3/rs2275806AG0.3974.6E−84.02E-021.17 (1.01, 1.36)1.52E-01
VPS37C/rs595158AC0.3203.4E−83.10E-021.19 (1.02, 1.38)1.52E-01
HLA-DRB1/rs660895AG0.111<1E-3004.02E-021.27 (1.01, 1.59)1.52E-01
EOMES/rs3806624GA0.2622.80E-083.54E-020.84 (0.71, 0.99)1.52E-01
SPRED2/rs934734AG0.4065.30E-104.24E-021.17 (1.01, 1.35)1.52E-01
RUNX1/rs9979383TC0.3005.0E−103.48E-020.84 (0.72, 0.99)1.52E-01
IL2RA/rs706778TC0.4581.40E-116.21E-020.87 (0.75, 1.01)2.07E-01
LOC105378654/rs883220CA0.2122.1E−81.08E-010.87 (0.73, 1.03)3.17E-01
IRF5/rs10488631TC0.1964.20E-111.07E-011.16 (0.97, 1.39)3.17E-01
RCAN1/rs2834512GA0.1602.1E−81.39E-010.86 (0.71, 1.05)3.86E-01
ARID5B/rs12764378GA0.2414.5E−101.64E-011.13 (0.95, 1.34)4.33E-01
TYK2/rs34536443GC0.0102.3E−141.73E-010.62 (0.31, 1.23)4.33E-01
IL6R/rs2228145AC0.3231.3E−81.89E-010.9 (0.77, 1.05)4.50E-01
PTPN22/rs2476601GA0.0167.5E−772.21E-011.47 (0.79, 2.73)4.93E-01
SYNGR1/rs909685TA0.4806.40E-122.27E-011.09 (0.95, 1.26)4.93E-01
MTF1/rs28411352CT0.2225.90E-092.42E-011.11 (0.93, 1.32)5.05E-01
CCR6/rs3093023GA0.4251.50E-112.70E-011.09 (0.94, 1.26)5.19E-01
ALS2CR12/rs6715284CG0.1852.50E-092.61E-011.11 (0.92, 1.35)5.19E-01
UBASH3A/rs1893592AC0.2419.80E-093.11E-010.92 (0.77, 1.08)5.75E-01
IRF8/rs13330176TA0.2434.0E−84.04E-010.93 (0.78, 1.11)6.13E-01
PLCL2/rs4452313AT0.4925.2E-113.76E-011.07 (0.92, 1.24)6.13E-01
ACOXL/rs6732565GA0.4449.40E-094.15E-010.94 (0.81, 1.09)6.13E-01
GRHL2/rs678347AG0.4257.30E-104.17E-011.06 (0.92, 1.23)6.13E-01
TLE3/rs8026898GA0.3039.16.E-143.50E-010.93 (0.79, 1.09)6.13E-01
RASGRP1/rs8043085GT0.2801.4E−103.64E-011.08 (0.92, 1.27)6.13E-01
REL/rs34695944TC0.1121.4E−104.05E-011.1 (0.88, 1.39)6.13E-01
COG6/rs9603616CT0.2602.80E-114.47E-010.94 (0.8, 1.11)6.38E-01
TPD52/rs998731TC0.4346.60E-095.04E-010.95 (0.82, 1.1)7.00E-01
TNFAIP3/rs6920220GA0.1322.3E−135.33E-011.07 (0.86, 1.34)7.20E-01
CD40/rs4810485GT0.2502.8E−96.03E-010.96 (0.81, 1.13)7.94E-01
RAD51B/rs1950897TC0.2085.00E-087.02E-011.04 (0.86, 1.24)8.98E-01
FAM107A/rs13315591TC0.0434.60E-087.25E-011.06 (0.75, 1.51)8.98E-01
MMEL1/rs2843401CT0.4616.6E−97.80E-011.02 (0.88, 1.18)8.98E-01
RBPJ/rs874040GC0.1591.00E-168.20E-010.98 (0.81, 1.18)8.98E-01
IKZF3/rs12936409CT0.3922.8E−98.19E-010.98 (0.85, 1.14)8.98E-01
LBH/rs10175798AG0.4744.20E-088.46E-011.01 (0.88, 1.17)8.98E-01
LINC02356/rs10774624AG0.1386.90E-098.38E-010.98 (0.79, 1.2)8.98E-01
FADS2/rs968567GA0.0841.80E-088.62E-010.98 (0.75, 1.27)8.98E-01
AFF3/rs11676922TA0.4851.00E-147.91E-010.98 (0.85, 1.14)8.98E-01
CDK6/rs4272AG0.0831.2E-87.64E-010.96 (0.74, 1.24)8.98E-01
TEC/rs2664035GA0.2563.30E-089.54E-011 (0.84, 1.17)9.70E-01
TXNDC11/rs4780401TG0.4308.70E-099.70E-011 (0.86, 1.16)9.70E-01

We found the same direction of association where the minor allele showed the same risk or protective effect as compared to the previous findings on the tested allele (major or minor). In our data, LINC00824/rs1516971 showed the most significant association (). The second most significant SNP was PADI4/rs2240336 () followed by CEP57/rs4409785 (), CTLA4/rs3087243 (), STAT4/rs13426947 (), and HLA-B-MICA/rs2596565 (). Eight other SNPs showed marginal significance: C5orf30/rs26232 (), CCL21/rs951005 (), GATA3/rs2275806 (), VPS37C/rs595158 (), HLA-DRB1/rs660895 (), EOMES/rs3806624 (), SPRED2/rs934734 (), and RUNX1/rs9979383 (). Figure 1 shows the distribution of tested SNPs across the genome where the SNPs with value < 0.05 are labeled.

Next, we examined the functional significance of all 50 SNPs using the GTEx and RegulomeDB databases. Table 3 shows the category summaries of RegulomeDB scores, and Table 4 shows 14 SNPs that had a RegulomeDB score of ≤3, indicating strong evidence of potential regulatory role. SNPs falling in this category have more likelihood to affect the binding of transcriptional factors. Out of these fourteen SNPs, only five (HLA-B/MICA/rs2596565, HLA-DRB1/rs660895, C5orf30/rs26232, CTLA4/rs3087243, and RUNX1/rs9979383) had in our association results. SYNGR1/rs909685 had the top RegulomeDB score of 1b followed by FADS2/rs968567, CD40/rs4810485, CCR6/rs3093023, HLA-B/MICA/rs2596565, and HLA-DRB1/rs660895 with a score of 1f. The latter two SNPs (HLA-B/MICA/rs2596565 and HLA-DRB1/rs660895) were significant in our sample ( and ) and based on RegulomeDB, both SNPs are eQTL for HLA-DQA1. While HLA-DRB1/rs660895 affects the binding of two proteins (BCLAF1 and POLR2A), no such evidence was found for HLA-B/MICA/rs2596565. SYNGR1/rs909685 is an intronic variant, which falls in the p53decamer binding motif and affects the binding of five different proteins (MAX, MYC, PAX5, TRIM28, and EBF1). FADS2/rs968567 is also an intronic variant and it affects the binding of twenty-eight proteins, including MAX, POLR2A, NFKB1, and NFIC. The binding of these proteins is also affected by the CD40/rs4810485 variant. An intronic CCR6/rs3093023 variant falls in the Oct-1 binding motif and is eQTL for CCR6.


ScoreDescription

Likely to affect binding and linked to the expression of a gene target
1aeQTL + TF binding + matched TF motif + matched DNase footprint + DNase peak
1beQTL + TF binding + any motif + DNase footprint + DNase peak
1ceQTL + TF binding + matched TF motif + DNase peak
1deQTL + TF binding + any motif + DNase peak
1eeQTL + TF binding + matched TF motif
1feQTL + TF binding/DNase peak
Likely to affect binding
2aTF binding + matched TF motif + matched DNase footprint + DNase peak
2bTF binding + any motif + DNase footprint + DNase peak
2cTF binding + matched TF motif + DNase peak
Less likely to affect binding
3aTF binding + any motif + DNase peak
3bTF binding + matched TF motif
Minimal binding evidence
4TF binding + DNase peak
5TF binding or DNase peak
6Motif hit
7No data available


SNPChr.ScoreeQTLBound proteinMotifs

SYNGR1/rs909685chr221bSYNGR1MAX, MYC, PAX5, TRIM28, EBF1p53decamer
FADS2/rs968567chr111fNXF1CTCF, E2F4, EP300, ETS1, FOSL2, FOXP2, GABPA, GABPB1, GATA1, HNF4A, HNF4G, MAX, MYBL2, NFIC, NFKB1, NFYB, PML, POLR2A, RAD21, REST, SIN3A, SP1, SREBF1, SREBF2, TAF1, TCF12, YY1, ZBTB7A
CD40/rs4810485chr201fCD40POLR2A, NFKB1, NFIC, MEF2C, MEF2A, IRF1, IKZF1, FOXM1, BCL3, BATF
CCR6/rs3093023chr61fCCR6Oct_1
HLAB/MICA/rs2596565chr61fBTN3A2, HLA-A, HLA-C, HLA-DPB1, HLA-DQA1, HLA-DQB1, HLA-DRB1, HLA-G, HLA-H, VARSL
HLA-DRB1/rs660895chr61fHLA-DQA2, HLA-DQA1BCLAF1, POLR2A
GRHL2/rs678347chr82aYY1, CEBPB, EP300, FOXA2, JUND, STAT3, TCF12, USF2, SIN3AHNF3, Foxa2, FOXF2, Freac-2, Freac-4, FOXA1, HNF3, FOXC1
PTPN22/rs2476601chr12bFOS, STAT3Gabpa, Etv1, Elk1, Erg, Ehf, PU.1
LBH/rs10175798chr22bATF2, FOXM1, RUNX3, NFIC, SPI1, MEF2A, BATFSTAT1:STAT1
RBPJ/rs874040chr42bGATA2, PML, TAL1MEF-2
C5orf30/rs26232chr52bBHLHE40, CHD1, EP300, FOXM1, MAZ, NFATC1, NFIC, RUNX3, TBL1XR1, TBP, USF2, ATF2, MEF2A, MEF2C, NFKB1ICSBP
RUNX1/rs9979383chr213aMYC, NFKB1E47, FIGLA, ID4, MESP1, SNAI2
CTLA4/rs3087243chr23aMAX, FOS, STAT3Bbx
IRF5/rs10488631chr73aGATA2Nanog

GTEx data eQTL showed the lowest p-eQTL (2.59E-34) for FADS2/rs968567 that affects FADS2 gene expression in transformed fibroblast cells; the same variant also affects TMEM258 (), ZBTB3 (), MYRF (), DAGLA (), and RAB3IL1 (). In transformed fibroblast cells, LINC00824/rs1516971 was a weak eQTL () for RP11-89M16.1. No data was available for CTLA4/rs3087243 in GTEx.

STAT4/rs13426947 was eQTL for STAT4, AC005540.3, and RP11-647K16.1 in EBV-transformed lymphocyte cells. HLA-B/MICA/rs2596565 was eQTL for multiple immune-related genes including C4A, C4B, HLA-S, HLA-B, HLA-C, and MICA in all three tested tissues. C5orf30/rs26232 showed the strongest eQTL () with PPIP5K2 only in whole blood. Additional details of GTEx data are given in supplementary Table S1.

4. Discussion

There have been a number of genome-wide association studies on RA over the past decade that have resulted in the identification of many RA susceptibility loci, thus improving our understanding of the complex genetic underpinning of RA [1517]. Most of the recently published GWAS have been conducted on Europeans and North Americans, which have identified many new RA risk loci and replicated and confirmed the previously reported putative risk loci. Many of these reported risk loci are shared among different ethnic groups, while some are specific to certain populations [23]. Replication studies across different ethnic groups are necessary to better understand the globe-wide population specificity of these established RA risk loci and guide future directions. Since South Asians, including the Pakistani population, have significant European ancestry [24, 25], we chose European originated GWAS-implicated SNPs for replication in Pakistanis. For this purpose, we enrolled 1,959 unrelated RA cases and controls from two public and private Rheumatology facilities, where people from different demographic regions of Pakistan visit for treatment, and we successfully genotyped 50 GWAS—implicated SNPs in those subjects.

In our data set, rs1516971 was the most significant SNP (), which is an intronic variant in the LINC00824 gene at chromosome 8q24.21. Previously, this variant has also been reported in a trans-ethnic association study of RA () [16]. Another SNP (rs6651252) from the same region, which is in complete LD () with rs1516971, has also been reported to be associated with RA and Crohn’s disease [15, 26, 27]. GTEx expression data suggests that rs1516971 affects the expression of RP11-89M16.1 at chromosome 8 in transformed fibroblast cells ().

The second most significant SNP in our data was rs2240336 (), which is present in the 9th intron of PADI4 at chromosome 1p36. PADI4 is one of the most important RA susceptibility loci in multiple ethnic groups, including Europeans, Asians, and Latin Americans [28, 29]. Another SNP in this region, rs2301888 that is in strong LD () with rs2240336, has also been reported to be associated with RA in Europeans and Koreans [30]. However, a study on Iranian population found no significant association of two other SNPs (rs11203367 and rs874881) in the PADI4 region with RA, which are not in strong LD () with rs2240336 [31]. GTEx expression data showed that rs2240336 is eQTL for RP4-798A10.2 () and PADI3 () in EBV transformed lymphocytes. The rs2240336 SNP also controls the expression of FBXO42 () in transformed fibroblasts. The third most important SNP in our data is CEP57/rs4409785 (), which is an intergenic variant on chromosome 11q21. This SNP has also been reported to be associated with other immune-related diseases such as multiple sclerosis, vitiligo, and Graves’ disease [3234]. In GTEx expression data, rs4409785 was eQTL for JRKL () in EBV-transformed lymphocytes, and for AP001877.1 (), and MAML2 () in transformed fibroblast cells.

Among the reported genome-wide significant SNPs that did not show significant association in our sample, the top functional ones based on the GTEx and RegulomeDB database included SYNGR1/rs909685, FADS2/rs968567, CD40/rs4810485, CCR6/rs3093023, and IKZF3/rs12936409. SYNGR1/rs909685 has a RegulomeDB score of 1b and is a strong eQTL for SYNGR1 in EBV-transformed lymphocytes () and whole blood (). FADS2/rs968567 has a RegulomeDB score of 1f and is eQTL for FADS2 in EBV-transformed lymphocytes (), whole blood and () transformed fibroblasts (). CD40/rs4810485 also has a high RegulomeDB score of 1f and is strong eQTL for CD40 in whole blood () and transformed fibroblasts () but weak eQTL for CD40 in EBV-transformed lymphocytes (). CCR6/rs3093023 with RegulomeDB score of 1f was the strongest eQTL for RNASET2 in transformed fibroblasts () and whole blood () with no data reported for EBV-transformed lymphocytes in GTEx. IKZF3/rs12936409 was a strong eQTL for ORMDL3 in all three tested tissues. In GTEx expression data, rs12936409 was found to control the expression of GSDMA, GSDMB, and ORMDL3 in all tested cells and whole blood.

5. Conclusions

We were able to successfully replicate 14 of 50 SNPs selected from previously published GWAS results in European populations in our unique Pakistani population. These findings suggest that there is a sharing of RA risk loci among different population groups. A weakness of our study is the availability of relatively small-sized sample, which may have prevented 36 SNPs from achieving the significance threshold, although they showed a trend for the same directional effects as the reported ones. Further studies using larger samples may help to identify and replicate more RA risk loci in the Pakistani population.

Data Availability

The data used to support the findings of this study are included within the article.

Conflicts of Interest

All authors declared no competing interests.

Acknowledgments

We are very thankful to all Rheumatologists and their staff for providing us blood samples of RA patients and the Higher Education Commission (HEC) and the University of Pittsburgh for providing funding. We thank all the study subjects (RA patients and control subjects) for their participation in this study.

Supplementary Materials

S1: additional details of GTEx data. (Supplementary Materials)

References

  1. P. Szodoray, Z. Szabó, A. Kapitány et al., “Anti-citrullinated protein/peptide autoantibodies in association with genetic and environmental factors as indicators of disease outcome in rheumatoid arthritis,” Autoimmunity Reviews, vol. 9, no. 3, pp. 140–143, 2010. View at: Publisher Site | Google Scholar
  2. M. M. Aslam, P. John, A. Bhatti, S. Jahangir, and M. I. Kamboh, “Vitamin D as a principal factor in mediating rheumatoid arthritis-derived immune response,” BioMed Research International, vol. 2019, Article ID 3494937, 12 pages, 2019. View at: Publisher Site | Google Scholar
  3. F. A. S. Kurreeman, L. Padyukov, R. B. Marques et al., “A candidate gene approach identifies the TRAF1/C5 region as a risk factor for rheumatoid arthritis,” PLoS Medicine, vol. 4, no. 9, article e278, 2007. View at: Publisher Site | Google Scholar
  4. G. S. Firestein, “Evolving concepts of rheumatoid arthritis,” Nature, vol. 423, no. 6937, pp. 356–361, 2003. View at: Publisher Site | Google Scholar
  5. M. Bax, J. van Heemst, T. W. J. Huizinga, and R. E. M. Toes, “Genetics of rheumatoid arthritis: what have we learned?” Immunogenetics, vol. 63, no. 8, pp. 459–466, 2011. View at: Publisher Site | Google Scholar
  6. I. Rudan, S. Sidhu, A. Papana et al., “Prevalence of rheumatoid arthritis in low- and middle-income countries: a systematic review and analysis,” Journal of Global Health, vol. 5, no. 1, article 010409, 2015. View at: Publisher Site | Google Scholar
  7. S. S. Dhawan and A. A. Quyyumi, “Rheumatoid arthritis and cardiovascular disease,” Current Atherosclerosis Reports, vol. 10, no. 2, pp. 128–133, 2008. View at: Publisher Site | Google Scholar
  8. E. J. M. Toonen, P. Barrera, T. R. D. J. Radstake et al., “Gene expression profiling in rheumatoid arthritis: current concepts and future directions,” Annals of the Rheumatic Diseases, vol. 67, no. 12, pp. 1663–1669, 2008. View at: Publisher Site | Google Scholar
  9. A. J. MacGregor, H. Snieder, A. S. Rigby et al., “Characterizing the quantitative genetic contribution to rheumatoid arthritis using data from twins,” Arthritis and Rheumatism, vol. 43, no. 1, pp. 30–37, 2000. View at: Publisher Site | Google Scholar
  10. Y. Ito and S. Sakaguchi, “Rheumatoid arthritis, genetics,” in Encyclopedia of Medical Immunology, I. R. Mackay, N. R. Rose, B. Diamond, and A. Davidson, Eds., pp. 996–1001, Springer, New York, NY, USA, 2014. View at: Publisher Site | Google Scholar
  11. The Wellcome Trust Case Control Consortium, “Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls,” Nature, vol. 447, no. 7145, pp. 661–678, 2007. View at: Publisher Site | Google Scholar
  12. D. Diogo, Y. Okada, and R. M. Plenge, “Genome-wide association studies to advance our understanding of critical cell types and pathways in rheumatoid arthritis: recent findings and challenges,” Current Opinion in Rheumatology, vol. 26, no. 1, pp. 85–92, 2014. View at: Publisher Site | Google Scholar
  13. J. Freudenberg, H. S. Lee, B. G. Han et al., “Genome-wide association study of rheumatoid arthritis in Koreans: population-specific loci as well as overlap with European susceptibility loci,” Arthritis &Rheumatism, vol. 63, no. 4, pp. 884–893, 2011. View at: Publisher Site | Google Scholar
  14. S. F. Jalil, A. Bhatti, F. Y. Demirci et al., “Replication of European rheumatoid arthritis loci in a Pakistani population,” The Journal of Rheumatology, vol. 40, no. 4, pp. 401–407, 2013. View at: Publisher Site | Google Scholar
  15. S. Eyre, J. Bowes, D. Diogo et al., “High-density genetic mapping identifies new susceptibility loci for rheumatoid arthritis,” Nature Genetics, vol. 44, no. 12, pp. 1336–1340, 2012. View at: Publisher Site | Google Scholar
  16. Y. Okada, D. Wu, G. Trynka et al., “Genetics of rheumatoid arthritis contributes to biology and drug discovery,” Nature, vol. 506, no. 7488, pp. 376–381, 2014. View at: Publisher Site | Google Scholar
  17. E. A. Stahl, S. Raychaudhuri, E. F. Remmers et al., “Genome-wide association study meta-analysis identifies seven new rheumatoid arthritis risk loci,” Nature Genetics, vol. 42, no. 6, pp. 508–514, 2010. View at: Publisher Site | Google Scholar
  18. L. Bossini-Castillo, C. de Kovel, H. Kallberg et al., “A genome-wide association study of rheumatoid arthritis without antibodies against citrullinated peptides,” Annals of the Rheumatic Diseases, vol. 74, no. 3, p. e15, 2015. View at: Publisher Site | Google Scholar
  19. F. C. Arnett, S. M. Edworthy, D. A. Bloch et al., “The American Rheumatism Association 1987 revised criteria for the classification of rheumatoid arthritis,” Arthritis & Rheumatism, vol. 31, no. 3, pp. 315–324, 1988. View at: Publisher Site | Google Scholar
  20. Y. Benjamini and Y. Hochberg, “Controlling the false discovery rate - a practical and powerful approach to multiple testing,” Journal of the Royal Statistical Society: Series B (Methodological), vol. 57, no. 1, pp. 289–300, 1995. View at: Publisher Site | Google Scholar
  21. Z. H. Radwan, X. Wang, F. Waqar et al., “Comprehensive evaluation of the association of APOE genetic variation with plasma lipoprotein traits in U.S. whites and African blacks,” PLoS One, vol. 9, no. 12, article e114618, 2014. View at: Publisher Site | Google Scholar
  22. V. Niemsiri, X. Wang, D. Pirim et al., “Genetic contribution of SCARB1 variants to lipid traits in African blacks: a candidate gene association study,” BMC Medical Genetics, vol. 16, no. 1, p. 106, 2015. View at: Publisher Site | Google Scholar
  23. A. Suzuki and K. Yamamoto, “From genetics to functional insights into rheumatoid arthritis,” Clinical and Experimental Rheumatology, vol. 33, no. 4, pp. S40–SS3, 2015. View at: Google Scholar
  24. L. Quintana-Murci, R. Chaix, R. S. Wells et al., “Where west meets east: the complex mtDNA landscape of the southwest and central Asian corridor,” American Journal of Human Genetics, vol. 74, no. 5, pp. 827–845, 2004. View at: Publisher Site | Google Scholar
  25. N. A. Rosenberg, S. Mahajan, S. Ramachandran, C. Zhao, J. K. Pritchard, and M. W. Feldman, “Clines, clusters, and the effect of study design on the inference of human population structure,” PLoS Genetics, vol. 1, no. 6, article e70, 2005. View at: Publisher Site | Google Scholar
  26. A. Franke, D. P. B. McGovern, J. C. Barrett et al., “Genome-wide meta-analysis increases to 71 the number of confirmed Crohn's disease susceptibility loci,” Nature Genetics, vol. 42, no. 12, pp. 1118–1125, 2010. View at: Publisher Site | Google Scholar
  27. S. Matthews, M. Eshelman, K. Schieffer, W. Koltun, and G. S. Yochum, “Tu1804 - Crohn's Disease-Associated Snp, Rs6651252, Demarcates a Wntresponsive DNA element that correlates with increased Myc expression in Cd patients,” Gastroenterology, vol. 154, no. 6, pp. S–1024, 2018. View at: Publisher Site | Google Scholar
  28. X. K. Yang, J. Liu, J. Liu et al., “Associations between PADI4 gene polymorphisms and rheumatoid arthritis: an updated meta-analysis,” Archives of Medical Research, vol. 46, no. 4, pp. 317–325, 2015. View at: Publisher Site | Google Scholar
  29. C. L. Too, S. Murad, J. S. Dhaliwal et al., “Polymorphisms in peptidylarginine deiminase (PADI) associate with rheumatoid arthritis in diverse Asian populations: evidence from MyEIRA study and meta-analysis,” Arthritis Research & Therapy, vol. 14, no. 6, article R250, 2012. View at: Publisher Site | Google Scholar
  30. K. Kim, S. Y. Bang, H. S. Lee et al., “High-density genotyping of immune loci in Koreans and Europeans identifies eight new rheumatoid arthritis risk loci,” Annals of the Rheumatic Diseases, vol. 74, no. 3, article e13, 2015. View at: Publisher Site | Google Scholar
  31. E. Shamsian, M. Azarian, M. Akhlaghi et al., “PADI4 polymorphisms in Iranian patients with rheumatoid arthritis,” Acta Reumatológica Portuguesa, vol. 41, no. 4, pp. 338–343, 2016. View at: Google Scholar
  32. The International Multiple Sclerosis Genetics Consortium & The Wellcome Trust Case Control Consortium 2, “Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis,” Nature, vol. 476, no. 7359, pp. 214–219, 2011. View at: Publisher Site | Google Scholar
  33. Y. Jin, S. A. Birlea, P. R. Fain et al., “Genome-wide association analyses identify 13 new susceptibility loci for generalized vitiligo,” Nature Genetics, vol. 44, no. 6, pp. 676–680, 2012. View at: Publisher Site | Google Scholar
  34. J. J. Khong, K. P. Burdon, Y. Lu et al., “Pooled genome wide association detects association upstream of FCRL3 with Graves' disease,” BMC Genomics, vol. 17, no. 1, p. 939, 2016. View at: Publisher Site | Google Scholar

Copyright © 2020 Muhammad Muaaz Aslam et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.


More related articles

 PDF Download Citation Citation
 Download other formatsMore
 Order printed copiesOrder
Views580
Downloads231
Citations

Related articles

Article of the Year Award: Outstanding research contributions of 2020, as selected by our Chief Editors. Read the winning articles.