Abstract

Apicomplexan parasites, including Toxoplasma gondii (T. gondii), express different types of calcium-dependent protein kinases (CDPKs), which perform a variety of functions, including attacking and exiting the host cells. In the current bioinformatics study, we have used several web servers to predict the basic features and specifications of the CDPK7 protein. The findings showed that CDPK7 protein has 2133 amino acid residues with an average molecular weight (MW) of 219085.79 D. The aliphatic index with 68.78 and grand average of hydropathicity (GRAVY) with -0.331 score were estimated. The outcomes of current research showed that the CDPK7 protein included 502 alpha-helix, 1311 random coils, and 320 extended strands with GOR4 method. Considering the Ramachandran plot, the favored region contains more than 92% of the amino acid residues. In addition, evaluation of antigenicity and allergenicity showed that CDPK7 protein has immunogenic and nonallergenic nature. The present research provides key data for more animal-model study on the CDPK7 protein to design an efficient vaccine against toxoplasmosis in the future.

1. Introduction

Toxoplasma gondii is a prevalent intracellular protozoan, which can infect a broad spectrum of mammals (i.e., human) and birds [1, 2]. Oocysts are the potential infective form in the life cycle of the parasite. Feline species as the only definitive hosts can contaminate the environment by shedding unsporulated oocysts through feces [3]. T. gondii is transferred by water/vegetables contaminated via mature oocysts and consumption of raw or semicooked meat from infected animals, vertical transmission from infected pregnant mothers to neonates, and blood transfusion [47]. Approximately one-third of human society has been exposed to T. gondii, worldwide [5, 8, 9]. Often T. gondii infection among immunocompetent people is asymptomatic or demonstrates mild symptoms, whereas in immunocompromised patients, it can cause a various range of clinical symptoms [6, 9, 10]. Toxoplasmosis in immunocompromised subjects can cause repeated attacks in the brain and manifests as encephalitis [11]. Moreover, toxoplasmosis in pregnant women can cause blindness, microcephaly, and mental retardation in the infant [6, 12]. Different factors, such as host’s immune system status, genetic background, age, gender, contact with infected cats, environmental conditions, and diet and cultural habits, as well as the protozoan genotype, can affect the morbidity and mortality rate of Toxoplasma infection [13, 14].

Today, treatment of toxoplasmosis with conventional drugs can just limit the proliferation of tachyzoites at the beginning of infection, while these drugs cannot eradicate cystic forms of parasites in host tissue [15, 16]. In addition, taking these medications in pregnant women can have serious side effects, such as the possibility of teratogenic effects on the fetus [17]. Hence, the discovery and design of an effective vaccine to control and prevent toxoplasmosis is very important, especially in humans and domestic animals. In this regard, various in silico-based studies suggest various antigens as suitable candidates for vaccine design [1832]. Calcium-dependent protein kinases (CDPKs) are a class of serine/threonine kinases that express in apicomplexans, ciliates, and plants [33]. In T. gondii as a member of the apicomplexan parasites, several CDPKs have been identified involving in critical functions in the different stages of the life cycle of parasite, including gliding motility (surface translocation), entry into (invasion), and exit from (egress) of host cells [34]. The CDPK7 is a crucial enzyme for division, growth, and maintenance of structural integrity of the Toxoplasma centrosome. As a result, TgCDPK7 knockdown is suggested as an important goal in achieving the right vaccine [35].

Computer-aided evaluation of different T. gondii proteins involved in various stages of life cycle can open new doors towards recognizing potent vaccine candidates through identification of highly immunogenic, nonallergenic, and nontoxic B- and T-cell epitopes [36]. Thereby, the present in silico study was performed to evaluate the crucial biochemical features and immunogenic epitopes of the CDPK7 protein by means of different bioinformatics servers.

2. Methods

2.1. CDPK7 Sequence

For this purpose, ToxoDB online website was used to obtain the whole amino acid sequence of T. gondii CDPK7 protein.

2.2. Physicochemical Characterization

We used the Expasy ProtParam online server to predict the physicochemical parameters of CDPK7 [37].

2.3. Prediction of Posttranslational Modification (PTM) Sites

The NetPhos 3.1 online tool was applied to predict phosphorylation location, and the CSS-Palm online server was applied to predict acylation location of the CDPK7 [38, 39].

2.4. Transmembrane Domains and Subcellular Location

The transmembrane regions and subcellular localization of T. gondii CDPK7 protein were assessed utilizing the TMHMM 2.0 and PSORT II web servers, respectively [38].

2.5. Secondary and Tertiary Structures

In this study, we employed the Garnier-Osguthorpe-Robson 4 (GOR4) online tool to forecast the secondary structure of CDPK7 protein [40]. Consequently, the three-dimensional (3D) model structures was used by SWISS-MODEL [38, 41].

2.6. The 3D Modeled Structure Refinement and Validation

GalaxyRefine was selected to develop and refine the quality of the template-based protein prediction [42]. To the Ramachandran plot validated the 3D structure of the protein, the SWISS-MODEL software was applied [43]. ProSA-web was used for evaluation of the whole quality of the model [44].

2.7. Linear and Conformational B-Cell Epitopes

We used a web-based Bcepred server to predict continuous B-cell epitopes exploiting physicochemical characteristics [45]. An online server of ABCpred was applied to predict B-cell epitopes [46]. Using the immune epitope database (IEDB), hydrophilicity [47], Bepipred linear epitope prediction [48], antigenicity [49], surface accessibility [50], beta-turn [51], and flexibility [52] were predicted. Afterwards, discontinuous B-cell epitopes were appraised by ElliPro [53] from the 3D structure of protein epitopes.

2.8. MHC-I and MHC-II Epitopes

To this aim, we used the IEDB website to evaluate the half-maximal inhibitory concentration () values of peptides that bind to the main histocompatibility complex (MHC) class I and class II molecules for CDPK7 [54, 55]. All predicted epitopes were then evaluated in terms of antigenicity using the VaxiJen v2.0 server.

2.9. Cytotoxic T-Lymphocyte (CTL) Epitopes

We applied CTLpred online website according to 75.8% accuracy [56]. Next, all predicted epitopes were evaluated regarding antigenicity using the VaxiJen v2.0 server.

2.10. Antigenic and Allergenic Profiles

The antigenicity of the full CDPK7 sequence was estimated by VaxiJen v2.0 [57]. The allergenic profile of CDPK7 was predicted by the AllergenFP v1.0 and AllerTOP v2.0 servers [58, 59].

3. Results

3.1. General Information of CDPK7

The amino acid structure of CDPK7 was obtained from the ToxoDB server with accession no. TGME49_228750. Based on the ProtParam database, the CDPK7 protein entails of 2133 amino acid residues with molecular weight of 219085.79 D, whereas theoretical pI was 5.79. The overall number of negatively (Asp+Glu) charged residues was 209, and positively (Arg+Lys) charged residues was 178. There are a total number of 30441 atoms. The half-life of the CDPK7 was predictable at 30 hours, >20 hours, and >10 hours for mammalian (in vitro), yeast (in vivo), and Escherichia coli (in vivo), respectively. In addition, the instability index of the CDPK7 protein presented an unstable nature with a value of 53.28. In addition, the aliphatic index was calculated 68.78, and GRAVY of the protein was estimated -0.331.

3.2. PTM Sites of CDPK7 Protein

In the present research, the results exhibited that 269 phosphorylation sites (Thr: 64, Tyr: 13, and Ser: 192) (Figures 1(a) and 1(b)) and 30 acylation sites (Table 1) were recognized in the CDPK7, representing that the CDPK7 sequence is composed of 299 possible PTM sites.

3.3. Transmembrane Domains and Subcellular Location

Based on the TMHMM output, no transmembrane domain was found for CDPK7 (Figures 2(a) and 2(b)). Moreover, by PSORT II, the CDPK7 subcellular site was predicted as follows: 78.3% nuclear, 8.7% cytoplasmic, 8.7% plasma membrane, and 4.3% cytoskeletal.

3.4. Secondary and Tertiary Structures

The secondary structure of CDPK7 was predicted via the GOR4 online server, suggesting 502 alpha-helix, 320 extended strands, and 1311 random coils (Figures 3(a) and 3(b)). Moreover, the SWISS-MODEL analysis is shown in Figures 4(a)4(d).

3.5. Refinement and Validation of Tertiary Structure

Protein validation by means of the SWISS-MODEL server displayed that 92.86% of residues were situated in favored regions and 1.65% in the outlier regions. According to the Ramachandran plot, there were 97.80% residues in the favored region with 0.27% residues in the outlier regions of the refined model (Figure 5).

3.6. Predicted Linear and Discontinuous B-Cell Epitopes of the CDPK7 Protein

The predicted linear B-cell epitopes by the Bcepred are listed in Table 2. The outputs of the ABCpred server are tabulated in Table 3 (only the epitopes over scores of 0.75 are embedded in Table 3). The higher peptide score proposes the greater chance of being an epitope. The present server estimated 124 epitopes over 0.75 scores on the sequence, in which the linear epitope SSPPGTPASVVSPAAGAGPI (score: 0.95) had the greatest score. Four epitopes with over 0.95 scores were as follows: “SSPPGTPASVVSPAAGAGPI,” “EVPQAAQPSKGPTKSAMLLQ,” “GGVSPPPQVPPVVVRAASPR,” and “GETVSKRLLFANSAKEQREW.” The average score of antigenicity, beta-turn, flexibility, hydrophilicity, Bepipred linear epitope prediction, and surface accessibility for the CDPK7 protein using the IEDB online server was 1.026, 1.042, 1.017, 2.396, 0.350, and 1.00, respectively (Figure 6). Five discontinuous B-cell epitopes were predicted using the ElliPro server (Table 4).

3.7. MHC-Binding Epitopes

The results are listed in Tables 5 and 6. Epitopes were assessed regarding antigenicity, and those highly antigenic epitopes were finally selected.

3.8. CTL Epitope Prediction

The high-ranked CTL epitopes predicted by the CTLpred tool for CDPK7 protein are summarized in Table 7. Epitopes were assessed regarding antigenicity, and those highly antigenic epitopes were finally selected.

3.9. Antigenic and Allergenic Profiles

The antigenic profile of CDPK7 was conducted by the VaxiJen web server with score of 0.7074 (threshold: 0.5). Based on AllergenFP and AllerTOP v2.0 analyses, the CDPK7 protein was appraised as possible nonallergen.

4. Discussion

Toxoplasmosis is a significant menace to human society as well as livestock industry [2, 8, 60]. Thus, the design and improvement of an efficient vaccine against T. gondii infection is still a great challenge for researchers against toxoplasmosis in domestic animals and humans [61]. Recently, bioinformatics tools are more focused for rational vaccine design, with some advantage, including the following: (i) time- and cost-effectiveness; (ii) accurately targeting, long-lasting immunity with favorable polarity in cellular components; and (iii) elimination of undesired responses through specific, epitope-based construct design. Nevertheless, the obtained in silico results are only theoretical data which must be confirmed using wet lab experiments inevitably [62].

It has been known that CDPK7 contributes to several functions in T. gondii such as gliding movement, host-cell invasion, and egress as well as other vital growth processes [34]. Here, we conducted a comprehensive analysis of TgCDPK7, a member of the CDPK family in T. gondii. The amino acid sequence of CDPK7 comprises 2133 residues with an average MW of 219085.79 D, which characterizes a suitable antigenic nature (the peptides with MW more than 10 kDa are considered as good immunogens) [63]. According to the Expasy ProtParam server, GRAVY and the aliphatic index of the CDPK7 were achieved at -0.331 and 68.78, respectively. In summary, the great value of aliphatic index means that the peptide has more stability in a broad range of various temperatures. Moreover, the low/negative value of the GRAVY factor signifies the better interaction of peptide with the molecules of water. It is efficient to identify that PTMs have a fundamental role in cell stability [64]. The acquired outcomes show that CDPK7 comprises 299 potential PTM sites (269 phosphorylation and 30 acylation positions), representing that these positions may organize protein activity.

To predict the secondary structure of CDPK7, the GOR4 tool was recruited. The results of secondary structure of CDPK7 verified and included 502 (out of 2133) alpha-helix, 320 extended strands, and 1311 random coils. It is known that the key role of the proteins is related to their three-dimensional structure. As such, to comprehend the influences between both structures and functions, assessment of 3D structure is the key aim of expecting a protein’s nature [65].

Humoral and cellular immunity are strongly stimulated in T. gondii infection [66, 67], in such a way that the establishment of IgG antibodies avoids the protozoan from attachment to the receptors of host cell [67]. Interferon-γ (IFN-γ), CD4+, and CD8+ T cells as the main members of T cells play a dynamic role in constraining acute and chronic infection. These major cytokines prevent the reactivation of bradyzoites in the host tissue cyst [66]. Epitope prediction has critical value to evaluate the specificity of antigen. Furthermore, epitope evaluation may reveal the pathogenesis and immune process of the pathogen in design vaccine researches [65, 68]. The strength of using in silico is the detection of the component epitopes that are critical for the interaction of antibodies and antigens. Several linear B-cell epitopes were predicted by the ABCpred server, among which those epitopes above 0.9 score were of great significance to be included in multiepitope vaccine constructs. Moreover, we applied the IEDB online server to evaluate the values of peptides that link to the MHC class I/II molecules for CDPK7. According to the obtained results from IEDB, the T-cell epitopes on CDPK7 have the capability to bind intensely to MHC class I and class II molecules. It is important to note that the lower values show the higher-level of affinity, which show an appropriate T-cell epitope.

Other the main stage, CTLpred is a special approach used to predict CTL epitopes, which is important in vaccine-related studies. This tool relies on elegant machine learning methods, such as ANN and SVM. We recognized the CTL epitopes using the CTLpred online database to select the top CDPK7 epitopes. The CTLpred server utilizes consensus and combined estimates, in line with these two methods [56]. Evaluation of antigenicity and allergenicity showed that CDPK7 protein has immunogenic and nonallergenic nature.

5. Conclusion

Well antigenicity, hydrophilicity, surface accessibility, and flexibility indexes were detected for CDPK7. Hence, we recommend that a suitable vaccine should be designed and verified both in silico and in vivo by the potential B- and T-cell epitopes predicted in this study.

Abbreviations

3D:Three-dimensional
ACC:Auto cross covariance
ANN:Artificial neural network
CD:Cluster of differentiation
CDPK:Calcium-dependent protein kinase
CTL:Cytotoxic T-lymphocyte
GOR:Garnier-Osguthorpe-Robson
GRAVY:Grand average of hydropathicity
IC50:Half-maximal inhibitory concentration
IEDB:Immune epitope database
IFN-γ:Interferon-γ
MHC:Major histocompatibility complex
MW:Molecular weight
PDB:Protein data bank
pI:Isoelectric point
PTM:Post-translational modification
SVM:Support vector machine
T. gondii:Toxoplasma gondii.

Data Availability

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

Ethical Approval

This study received the approval from the Behbahan Faculty of Medical Sciences Ethical Committee (IR.BHN.REC.1399.034).

Disclosure

The funders of this study had no role in the study design, analysis and interpretation of data, writing of the final paper, and the decision to submit the manuscript for publication. The corresponding author had access to the data in the study and had final responsibility for the decision to submit for publication.

Conflicts of Interest

The authors declare that there is no conflict of interest.

Acknowledgments

This study was financially supported by the Behbahan Faculty of Medical Sciences, Behbahan, Iran (Grant No. 99013).