- About this Journal ·
- Abstracting and Indexing ·
- Advance Access ·
- Aims and Scope ·
- Article Processing Charges ·
- Articles in Press ·
- Author Guidelines ·
- Bibliographic Information ·
- Citations to this Journal ·
- Contact Information ·
- Editorial Board ·
- Editorial Workflow ·
- Free eTOC Alerts ·
- Publication Ethics ·
- Reviewers Acknowledgment ·
- Submit a Manuscript ·
- Subscription Information ·
- Table of Contents
Volume 2011 (2011), Article ID 781643, 15 pages
Comparative Structures and Evolution of Vertebrate Carboxyl Ester Lipase (CEL) Genes and Proteins with a Major Role in Reverse Cholesterol Transport
1Department of Genetics, Texas Biomedical Research Institute, San Antonio, TX 78245-0549, USA
2Southwest National Primate Research Center, Texas Biomedical Research Institute, San Antonio, TX 78245-0549, USA
3School of Biomolecular and Physical Sciences, Griffith University, Nathan, QLD 4111, Australia
Received 25 June 2011; Accepted 30 August 2011
Academic Editor: Akihiro Inazu
Copyright © 2011 Roger S. Holmes and Laura A. Cox. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Bile-salt activated carboxylic ester lipase (CEL) is a major triglyceride, cholesterol ester and vitamin ester hydrolytic enzyme contained within pancreatic and lactating mammary gland secretions. Bioinformatic methods were used to predict the amino acid sequences, secondary and tertiary structures and gene locations for CEL genes, and encoded proteins using data from several vertebrate genome projects. A proline-rich and O-glycosylated 11-amino acid C-terminal repeat sequence (VNTR) previously reported for human and other higher primate CEL proteins was also observed for other eutherian mammalian CEL sequences examined. In contrast, opossum CEL contained a single C-terminal copy of this sequence whereas CEL proteins from platypus, chicken, lizard, frog and several fish species lacked the VNTR sequence. Vertebrate CEL genes contained 11 coding exons. Evidence is presented for tandem duplicated CEL genes for the zebrafish genome. Vertebrate CEL protein subunits shared 53–97% sequence identities; demonstrated sequence alignments and identities for key CEL amino acid residues; and conservation of predicted secondary and tertiary structures with those previously reported for human CEL. Phylogenetic analyses demonstrated the relationships and potential evolutionary origins of the vertebrate CEL family of genes which were related to a nematode carboxylesterase (CES) gene and five mammalian CES gene families.
Bile-salt activated carboxylic ester lipase (CEL; also designated as cholesterol esterase and lysophospholipase) is a major triglyceride, cholesterol ester and vitamin ester hydrolytic enzyme contained within pancreatic and lactating mammary gland secretions [5–10]. CEL is also secreted by the liver and is localized in plasma where it contributes to chylomicron assembly and secretion, the selective uptake of cholesteryl esters in HDL by the liver, LDL lipid metabolism, and reverse cholesterol transport [11–14]. Plasma CEL may also contribute to endothelial cell proliferation, the induction of vascular smooth muscle proliferation, and thrombus formation through interaction with platelet CXCR4 . More recently, CEL expression has been reported in human pituitary glands where it may function in regulating hormone secretion in association with the CEL hydrolytic activity of ceramides .
Structures for several human and animal CEL genes and cDNA sequences have been determined, including human (Homo sapiens) [7, 17–19], gorilla (Gorilla gorilla) , mouse (Mus musculus) [21–23], rat (Rattus norvegicus) [24–26], and cow (Bos taurus) CEL genes [1, 27]. The human CEL gene comprises 11 exons and is localized on chromosome 9 . Several Alu repetitive sequence elements and putative transcription factor binding sites have been identified in the 5′-untranslated (UTR) region, including pancreatic-specific binding sites, which contribute to a high level of expression in the exocrine pancreas [17, 29, 30]. Exon 11 of the human CEL gene encodes a variable number of tandem repeat sequences region (VNTR) (17 repeats are most common) which is highly polymorphic in human populations and contributes to plasma cholesterol and lipid composition . Moreover, rare CEL gene defects in this region are responsible for a monogenically derived diabetes condition called maturity-onset diabetes of the young type 8 (MODY8), also known as diabetes and pancreatic exocrine dysfunction (DPED), which causes a defect in insulin secretion [31, 32].
Human CEL is expressed predominantly in the lactating mammary gland and beta cells of the exocrine pancreas, where the enzyme contributes significantly to triglyceride, cholesterol ester and vitamin ester metabolism [5–10]. CEL also promotes large chylomicron production in the intestine, and its presence in plasma supports interactions with cholesterol and oxidized lipoproteins  which may influence atherosclerosis progression . CEL expression has also been reported in the human pituitary gland, and a possible role for CEL in the regulation of hormone secretion and ceramide metabolism has been described . Studies of Cel−/Cel− knock out mice have shown that other enzymes besides CEL are predominantly responsible for the hydrolysis of dietary cholesteryl esters, retinyl esters, and triglycerides . Metabolic studies of Cel-null mice however have reported that a lack of CEL activity causes an incomplete digestion of milk fat and lipid accumulation by enterocytes in the ileum of neonatal mice which suggests a major role for this enzyme in triglyceride hydrolysis in breast-fed animals [9, 34]. Moreover, reverse cholesterol transport is elevated in carboxyl ester lipase-knockout mice which supports a significant role for this enzyme in the biliary disposal of cholesterol from the body .
A CEL-like gene (designated as CELL) has also been identified on human and gorilla chromosome 9, about 10 kilobases downstream of CEL, which is transcribed in many tissues of the body but lacks exons 2–7 and is unlikely to be translated into protein [17, 20, 35]. The CELL pseudogene gene duplication apparently occurred prior to the separation of Hominidae (man, chimpanzee, gorilla, and orangutan) from Old World monkeys (macaque) with CELL being restricted to genomes of man and the great apes .
Three-dimensional structural analyses of human CEL have shown that the enzyme belongs to the alpha/beta hydrolase fold family with several key structural and catalytic features, including an active site catalytic triad located within the enzyme structure and partially covered by a surface loop, the carboxyl terminus region of the protein which regulates enzymatic activity by forming hydrogen bonds with the surface loop to partially shield the active site, and a loop domain which binds bile salt and frees the active site to access water-insoluble substrates [1, 10, 36–38]. In both conformations, CEL forms dimeric subunit structures with active sites facing each other. The common variant of the human CEL gene contains VNTR repeats, but there is a high degree of polymorphism in the repeated region [31, 32]. While the biological function of the polymorphic repeat region is unknown, it has been suggested that it may be important for protein stability and/or secretion of the enzyme, particularly given that this region contains many O-glycosyl bonds linking carbohydrate residues to the CEL C-terminus, including fucose, galactose, glucosamine, galactosamine, and neuraminic acid residues .
This paper reports the predicted gene structures and amino acid sequences for several vertebrate CEL genes and proteins, the predicted secondary and tertiary structures for vertebrate CEL protein subunits, and the structural phylogenetic and evolutionary relationships for these genes and enzymes with mammalian CES (carboxylesterase) gene families [40, 41].
2.1. Vertebrate CEL Gene and Protein Identification
BLAST (Basic Local Alignment Search Tool) studies were undertaken using web tools from the National Center for Biotechnology Information (NCBI) (http://blast.ncbi.nlm.nih.gov/Blast.cgi) . Protein BLAST analyses used vertebrate CEL amino acid sequences previously described (Table 1). Nonredundant protein sequence databases for several mammalian genomes were examined using the blastp algorithm, including human (Homo sapiens) , chimp (Pan troglodytes) , orangutan (Pongo abelii) (http://genome.wustl.edu/); the cow (Bos Taurus) , horse (Equus caballus) , mouse (Mus musculus) , opossum (Monodelphis domestica) , platypus , chicken (Gallus gallus) , clawed toad (Xenopus tropicalis) , and zebrafish (Danio rerio) . This procedure produced multiple BLAST “hits” for each of the protein databases which were individually examined and retained in FASTA format, and a record kept of the sequences for predicted mRNAs and encoded CEL-like proteins. These records were derived from annotated genomic sequences using the gene prediction method: GNOMON and predicted sequences with high similarity scores for human CEL . Predicted CEL-like protein sequences were obtained in each case and subjected to analyses of predicted protein and gene structures.
BLAT (BLAST-Like Alignment Tool) analyses were subsequently undertaken for each of the predicted vertebrate CEL amino acid sequences using the University of California Santa Cruz (UCSC) Genome Browser (http://genome.ucsc.edu/cgi-bin/hgBlat)  with the default settings to obtain the predicted locations for each of the vertebrate CEL genes, including predicted exon boundary locations and gene sizes. Structures for human, mouse, and rat CEL isoforms (splice variants) were obtained using the AceView website (http://www.ncbi.nlm.nih.gov/IEB/Research/Acembly/index.html?human) to examine predicted gene and protein structures . Alignments of vertebrate CEL sequences with human carboxylesterase (CES) protein sequences were assembled using the ClustalW2 multiple sequence alignment program  (http://www.ebi.ac.uk/Tools/clustalw2/index.html).
2.2. Predicted Structures and Properties of Vertebrate CEL Protein Subunits
Predicted secondary and tertiary structures for vertebrate CEL-like subunits were obtained using the PSIPRED v2.5 web site tools (http://bioinf.cs.ucl.ac.uk/psipred/)  and the SWISS MODEL web tools (http://swissmodel.expasy.org/), respectively [55, 56]. The reported tertiary structure for bovine CEL (PDB: 1aqlB)  served as the reference for the predicted human and zebrafish CEL tertiary structures, with a modeling range of residues 21 to 552. Theoretical isoelectric points and molecular weights for vertebrate CEL subunits were obtained using Expasy web tools (http://au.expasy.org/tools/pi_tool.html). SignalP 3.0 web tools were used to predict the presence and location of signal peptide cleavage sites (http://www.cbs.dtu.dk/services/SignalP/) for each of the predicted vertebrate CEL sequences . The NetNGlyc 1.0 Server was used to predict potential N-glycosylation sites for vertebrate CEL subunits (http://www.cbs.dtu.dk/services/NetNGlyc/).
2.3. Comparative Human (CEL) and Mouse (Cel) Tissue Expression
The UCSC Genome Browser (http://genome.ucsc.edu/)  was used to examine GNF1 Expression Atlas 2 data using various expression chips for human CEL and mouse Cel genes, respectively, (http://biogps.gnf.org/) . Gene array expression “heat maps” were examined for comparative gene expression levels among human and mouse tissues showing high (red), intermediate (black), and low (green) expression levels.
2.4. Phylogenetic Studies and Sequence Divergence
Alignments of vertebrate CEL and human, mouse, and nematode CES-like (carboxylesterase) protein sequences were assembled using BioEdit v.5.0.1 and the default settings . Alignment ambiguous regions were excluded prior to phylogenetic analysis yielding alignments of 480 residues for comparisons of sequences (Table 1). Evolutionary distances were calculated using the Kimura option  in TREECON . Phylogenetic trees were constructed from evolutionary distances using the neighbor-joining method  and rooted with the nematode CES sequence. Tree topology was reexamined by the boot-strap method (100 bootstraps were applied) of resampling, and only values that were highly significant (≥90) are shown .
3. Results and Discussion
3.1. Alignments of Human and Other Vertebrate CEL Subunits
The deduced amino acid sequences for opossum (Monodelphis domestica) and chicken (Gallus gallus) CEL subunits and for zebrafish (Danio rerio) CEL1 and CEL2 subunits are shown in Figure 1 together with the previously reported sequences for human (Homo sapiens) [7, 39], mouse (Mus musculus) , and bovine (Bos taurus) [27, 63] CEL subunits (Table 1). Alignments of the human and other vertebrate CEL subunits examined in this figure showed between 56–80% sequence identities, suggesting that these are products of the same family of genes and proteins (Table 2). The amino acid sequence for human CEL contained 756 residues whereas other vertebrate CEL subunits contained fewer amino acids: 599 residues (cow), 598 residues (mouse), 579 residues (opossum), 556 residues (chicken), and 550 residues for zebrafish CEL1 and CEL2 (Figure 1; Table 1). These differences are predominantly explained by changes in the number of VNTR 11 residue repeats at the respective CEL C-termini, with human CEL containing 17 repeats, whereas bovine, mouse, and opossum CEL C-termini contained only 3, 3, and 1 repeats, respectively, while chicken and zebrafish CEL subunits exhibited no VNTR-like C-terminus sequences. Table 1 summarizes this feature among all of the vertebrate CEL sequences examined and shows that substantial numbers of C-terminus VNTR repeats were predominantly restricted to higher primates, especially gorilla (Gorilla gorilla) (39 repeats) , human (17 repeats) , and rhesus (Macaca mulatta) (15 repeats) CEL, whereas other mammalian CEL subunits usually contained 3 VNTR repeats, with the exception of the predicted dog (Canis familiaris) CELC-terminus, which contained 13 VNTR-repeat sequences. A comparison of the 11-residue repeat sequences for the mammalian CEL subunits examined showed the following consensus sequence: Pro-Val-Pro-Pro-Thr-Gly-Asp-Ser-Glu-Ala-Ala (Figure 2), for which the first 4 residues have been proposed to play a role in facilitating O-glycosylation at the 5th residue (Thr) position .
Several other key amino acid residues for mammalian CEL have been recognized (sequence numbers refer to human CEL) (Figure 1). These include the catalytic triad for the active site (Ser194; Asp320; His435) forming a charge relay network for substrate hydrolysis [10, 64]; the hydrophobic N-terminus signal peptide (residues 1–20) [7, 65]; disulfide bond forming residues (Cys84/Cys100 and Cys266/Cys277) [7, 66]; arginine residues (Arg83/Arg446) which contribute to bile-salt binding and activation [1, 36]; a heparin binding site (residues 21–121); as well as the 11-residue VNTR repeat (×17) at the CEL C-terminus (residues 562–756). Identical residues were observed for each of the vertebrate CEL subunits for the active site triad, disulfide bond forming residues and key arginine residues contributing to bile salt activation, however, the N-terminus 20-residue signal peptide underwent changes in sequence but retained predicted signal peptide properties (Figure 1; Table 1). The N-glycosylation site reported for human CEL at Asn207-Ile208-Thr209  was retained for each of the 22 vertebrate CEL sequences examined, with the exception of platypus (Ornithorhynchus anatinus) CEL which contained two predicted N-glycosylation sites at Asn381-Val382-Thr383 and Asn548-Leu549-Thr550 (Table 3). Predicted N-glycosylation sites were also observed at other positions, including Asn381-Ile382-Thr383 for opossum (Monodelphis domestica) CEL; Asn270-Thr271-Thr272 and Asn381-Leu382-Thr383 for chicken (Gallus gallus) CEL; Asn270-Thr271-Thr272 for lizard (Anolis carolensis); Asn550-Val551-Thr552 for fugu (Takifugu rupides) CEL (Table 3). Given the reported role of the N-glycosylated carbohydrate group in contributing to the stability and maintaining catalytic efficiency of a related enzyme (carboxylesterase or CES1) , this property may be shared by the vertebrate CEL subunits as well, especially for those containing multiple predicted sites for N-glycosylation, such as chicken CEL, which contains three such sites.
3.2. Predicted Secondary and Tertiary Structures of Vertebrate CEL Subunits
Analyses of predicted secondary structures for vertebrate CEL sequences were compared with the previously reported secondary structure for bovine and human CEL [1, 68] (Figure 1). Similar α-helix β-sheet structures were observed for all of the vertebrate CEL subunits examined. Consistent structures were particularly apparent near key residues or functional domains including the β-sheet and α-helix structures near the active site Ser194 (β8/αD) and Asp320 (β10/α8) residues, and the N-glycosylation site at Asn207-Ile208-Thr209 (near β8) . The single helix at the C-termini (αN) for the vertebrate CEL subunits was readily apparent, as were the five β-sheet structures at the N-termini of the CEL subunits (β1–β5). It is apparent from these studies that all of these CEL subunits have highly similar secondary structures.
Figure 3 describes predicted tertiary structures for mouse CEL and zebrafish CEL1 protein sequences which showed significant similarities for these polypeptides with bovine [1, 36] and human CEL . Identification of specific structures within the predicted mouse CEL and zebrafish CEL1 sequences was based on the reported structure for a truncated human CEL which identifies a sequence of twisted β-sheets interspersed with several α-helical structures [10, 68] which are typical of the alpha-beta hydrolase superfamily . The active site CEL triad was centrally located which is similar to that observed in other lipases and esterases [40, 70, 71]. The major difference between CEL and other serine esterases is an apparent insertion at positions 139–146 (for human CEL) (Figure 1 of Supplementary Material available online at doi 10.1155/2011/781643) which appears to act as a surface loop that partially covers the opening to the catalytic triad and allows access to the active site by water soluble substrates by the truncated CEL . This active site loop is also readily apparent in the predicted structures for mouse CEL and zebrafish CEL1. These comparative studies of vertebrate CEL proteins suggest that the properties, structures, and key sequences are substantially retained for all of the vertebrate sequences examined.
3.3. Predicted Gene Locations and Exonic Structures for Vertebrate CEL Genes
Table 1 summarizes the predicted locations for vertebrate CEL genes based upon BLAT interrogations of several vertebrate genomes using the reported sequences for human, gorilla, mouse, rat, and bovine CEL [6, 7, 20, 22–24, 27] and the predicted sequences for other vertebrate CEL proteins and the UCSC Genome Browser . Human and mouse CEL genes were located on human chromosome 9 and mouse chromosome 2, which are distinct to the carboxylesterase (CES for human or Ces for mouse) gene family cluster locations in each case: on human chromosome 16 and mouse chromosome 8, respectively (Table 1; see ). The zebrafish (Danio rerio) genome showed evidence of tandem duplicated CEL genes, with predicted CEL1 and CEL2 genes being located about 7.3 kilobases apart on zebrafish chromosome 21 (Table 1). This is in contrast with many other gene duplication events during zebrafish evolution that have occurred predominantly by polyploidisation or duplication of large chromosomal segments rather than by tandem gene duplication .
Figure 1 summarizes the predicted exonic start sites for cow, opossum, chicken, and zebrafish CEL genes with each having 11 exons, in identical or similar positions to those reported for the human CEL and mouse Cel genes [17, 22, 23]. In contrast, human CES1 [73, 74], CES2, CES3 [75, 76], CES4 , and CES5 [77, 78] genes contained 14, 12, 13, 14, and 13 exons, respectively, which are predominantly in distinct positions to those described for vertebrate CEL genes, with the exception of the last exon in each case (Figure 1 of Supplementary Material). Consequently, even though CEL and CES genes and proteins are members of the same serine hydrolase superfamily [10, 40], it is apparent that CEL is not a close relative of the CES gene family, for which at least five genes are clustered on a single chromosomes on human and mouse chromosomes and are more similar in gene structure to each other than they are to the CEL gene (Figure 1 of Supplementary Material; see ).
Figure 4 illustrates the predicted structures of mRNAs for human and mouse CEL transcripts for the major transcript isoform in each case . The transcripts were 10.5 and 7.6 kilobases in length, respectively, with 10 introns and 11 exons present for these CEL mRNA transcripts. The human CEL genome sequence contained a microRNA site (miR485-5p) located in the 3′-untranslated region and a CpG island (CpG51). The occurrence of the CpG island within the CEL gene may reflect a role in regulating gene expression  which may contribute to a higher than average gene expression level reported for human CEL (×1.5 times higher). Figure 2 of Supplementary Material shows a nucleotide sequence alignment diagram for the CpG51 region of the human CEL gene in comparison with several other mammalian and other vertebrate CEL genes. The Multiz alignment patterns observed demonstrated extensive sequence conservation for the CpG island which contains dinucleotide and trinucleotide repeat sequences in most genomes examined.
The prediction of a microRNA (miRNA; miR485-5p) binding site in the 3′untranslated region of human CEL is also potentially of major significance for the regulation of this gene. MicroRNAs are small noncoding RNAs that regulate mRNA and protein expression and have been implicated in regulating gene expression during embryonic development . Moreover, a recent study of a related miRNA gene (miR-375) has been recently shown to be selectively expressed in pancreatic islets and has been implicated both in the development of islets and the function of mature pancreatic beta cells . A similar role may be played by miR485-5p with respect to the regulation of CEL expression during pancreatic beta cell development. Table 1 of Supplementary Material presents comparative nucleotide sequences for miR485-5p-like CEL gene regions for several vertebrate genomes which shows high levels of sequence identity, particularly among mammalian CEL miRNA target sites and suggests that this site has been predominantly conserved during vertebrate evolution, particularly by eutherian mammalian CEL genes.
Figure 5 shows a UCSC Genome Browser Comparative Genomics track that shows evolutionary conservation and alignments of the nucleotide sequences for the human CEL gene, including the 5′-flanking, 5′-untranslated, intronic, exonic, and 3′untranslated regions of this gene, with the corresponding sequences for 10 vertebrate genomes, including 5 eutherian mammals (e.g., mouse, rat), a marsupial (opossum), a monotreme (platypus), and lower vertebrate genomes. Extensive conservation was observed among these genomic sequences, particularly for the rhesus CEL gene and for other eutherian mammalian genomes. In contrast with the eutherian mammalian genomes examined, other vertebrate genomes retained conserved sequences only within the 11 exonic CEL regions. It would appear that exonic CEL nucleotide sequences have been conserved throughout vertebrate evolution whereas only in eutherian mammalian genomes have other regions of the CEL gene been predominantly conserved.
3.4. Comparative Human and Mouse CEL Tissue Expression
Figure 6 presents “heat maps” showing comparative gene expression for various human and mouse tissues obtained from GNF Expression Atlas Data using the GNF1H (human) and GNF1M (mouse) chips (http://genome.ucsc.edu/; http://biogps.gnf.org/) . These data supported a high level of tissue expression for human CEL, particularly for the pancreatic islets, the pituitary gland, and fetal liver, which is consistent with previous reports for these genes (see ). High levels of CEL gene expression have also been reported for the human mammary gland where CEL plays a major role in lipid digestion in breast milk by neonates . The localization of CEL within the human pituitary gland is of major interest as this enzyme also hydrolyzes ceramides , which suggests a possible role in the regulation of hormone secretion in both normal and adenomatous pituitary cells . A high level of expression of CEL in mouse tissues was also observed (×3.4 times average expression) (Figure 4), particularly for the pancreas, mammary gland, and spleen (Figure 5), where similar metabolic roles for this enzyme in cholesterol ester, retinyl ester, and triglyceride hydrolysis and metabolism have been described . Recent metabolic studies using Cel−/Cel− (knock-out) mice (or CELKO mice) have demonstrated that CEL is not an essential enzyme for these metabolic functions [82, 83], although CELKO neonatal mice exhibit an incomplete digestion of milk fat [9, 84], and in adult CELKO mice causes an elevation in reverse cholesterol transport (RCT) in adult animals . The latter finding is potentially of major clinical significance for this enzyme, given that any increase in RCT and the associated increased biliary disposal of cholesterol may contribute to preventing atherosclerosis [85, 86].
3.5. Phylogeny and Divergence of Vertebrate CEL and Mammalian/Nematode CES Sequences
A phylogenetic tree (Figure 7) was calculated by the progressive alignment of human and other vertebrate CEL amino acid sequences with human and mouse CES1, CES2, CES3, CES4, and CES5 sequences. The phylogram was “rooted” with a nematode CES sequence and showed clustering of the CEL sequences which were distinct from the human and mouse CES families. In addition, the zebrafish CEL1 and CEL2 sequences showed clustering within the fish CEL sequences examined, which is consistent with these genes being products of a recent duplication event during teleost fish evolution. Overall, these data suggest that the vertebrate CEL gene arose from a gene duplication event of an ancestral CES-like gene, resulting in at least two separate lines of gene evolution for CES-like and CEL-like genes. This is supported by the comparative biochemical and genomic evidence for vertebrate CEL and CES-like genes and encoded proteins, which share several key features of protein and gene structure, including having similar alpha-beta hydrolase secondary and tertiary structures [10, 40, 41, 71, 78] (Figure 1 of Supplementary Material).
In conclusion, the results of the present study indicate that vertebrate CEL genes and encoded CEL enzymes represent a distinct alpha-beta hydrolase gene and enzyme family which share key conserved sequences and structures that have been reported for the human CES gene families. CEL is a major triglyceride, cholesterol ester and vitamin ester hydrolytic enzyme contained within exocrine pancreatic and lactating mammary gland secretions and is also localized in plasma where it contributes to chylomicron assembly and secretion, in the selective uptake of cholesteryl esters in HDL in the liver and in reverse cholesterol transport, including biliary disposal of cholesterol. Bioinformatic methods were used to predict the amino acid sequences, secondary and tertiary structures and gene locations for CEL genes, and encoded proteins using data from several vertebrate genome projects. A proline-rich and O-glycosylated 11-amino acid C-terminal repeat sequence (VNTR) previously reported for human and other higher primate CEL proteins was also observed for other eutherian mammalian CEL sequences examined. Opossum CEL, however, contained a single C-terminal copy of this sequence while CEL proteins from lower vertebrates lacked the VNTR sequence. Evidence is presented for tandem duplicated CEL genes for the zebrafish genome. Vertebrate CEL protein subunits shared 53–97% sequence identities and exhibited sequence alignments and identities for key CEL amino acid residues as well as extensive conservation of predicted secondary and tertiary structures with those previously reported for human CEL. Phylogenetic analyses demonstrated the relationships and potential evolutionary origins of the vertebrate CEL family of genes which were related to a nematode carboxylesterase (CES) gene and five mammalian CES gene families. These studies indicated that CEL genes have apparently appeared early in vertebrate evolution prior to the teleost fish common ancestor more than 500 million years ago .
This project was supported by NIH Grants P01 HL028972 and P51 RR013986. In addition, this investigation was conducted in facilities constructed with support from Research Facilities Improvement Program Grant nos. 1 C06 RR13556, 1 C06 RR15456, and 1 C06 RR017515. The authors gratefully acknowledge the assistance of Dr Bharet Patel of Griffith University Brisbane Australia with the phylogeny studies.
- X. Wang, C. S. Wang, J. Tang, F. Dyda, and X. C. Zhang, “The crystal structure of bovine bile salt activated lipase: insights into the bile salt activation mechanism,” Structure, vol. 5, no. 9, pp. 1209–1218, 1997.
- D. Thierry-Mieg and J. Thierry-Mieg, “AceView: a comprehensive cDNA-supported gene and transcripts annotation,” Genome Biology, vol. 7, p. S12, 2006.
- A. I. Su, T. Wiltshire, S. Batalov et al., “A gene atlas of the mouse and human protein-encoding transcriptomes,” Proceedings of the National Academy of Sciences of the United States of America, vol. 101, no. 16, pp. 6062–6067, 2004.
- W. J. Kent, C. W. Sugnet, T. S. Furey et al., “The human genome browser at UCSC,” Genome Research, vol. 12, no. 6, pp. 996–1006, 2002.
- J. Nilsson, L. Blackberg, P. Carlsson, S. Enerback, O. Hernell, and G. Bjursell, “cDNA cloning of human-milk bile-salt stimulated lipase and evidence for its identity to pancreatic carboxylic ester hydrolase,” European Journal of Biochemistry, vol. 192, no. 2, pp. 543–550, 1990.
- D. Y. Hui and J. A. Kissel, “Sequence identity between human pancreatic cholesterol esterase and bile salt-stimulated milk lipase,” FEBS Letters, vol. 276, no. 1-2, pp. 131–134, 1990.
- T. Baba, “Structure of human milk bile salt activated lipase,” Biochemistry, vol. 30, no. 2, pp. 500–510, 1991.
- L. Nyberg, A. Farooqi, L. Bläckberg, R. D. Duan, Å. Nilsson, and O. Hernell, “Digestion of ceramide by human milk bile salt-stimulated lipase,” Journal of Pediatric Gastroenterology and Nutrition, vol. 27, no. 5, pp. 560–567, 1998.
- P. N. Howles, G. N. Stemmerman, C. M. Fenoglio-Preiser, and D. Y. Hui, “Carboxyl ester lipase activity in milk prevents fat-derived intestinal injury in neonatal mice,” American Journal of Physiology, vol. 277, no. 3, pp. G653–G661, 1999.
- D. Y. Hui and P. N. Howles, “Carboxyl ester lipase: structure-function relationship and physiological role in lipoprotein metabolism and atherosclerosis,” Journal of Lipid Research, vol. 43, no. 12, pp. 2017–2030, 2002.
- R. Shamir, W. J. Johnson, K. Morlock-Fitzpatrick et al., “Pancreatic carboxyl ester lipase: a circulating enzyme that modifies normal and oxidized lipoproteins in vitro,” Journal of Clinical Investigation, vol. 97, no. 7, pp. 1696–1704, 1996.
- R. J. Kirby, S. Zheng, P. Tso, P. N. Howles, and D. Y. Hui, “Bile salt-stimulated carboxyl ester lipase influences lipoprotein assembly and secretion in intestine: a process mediated via ceramide hydrolysis,” Journal of Biological Chemistry, vol. 277, no. 6, pp. 4104–4109, 2002.
- S. H. Bengtsson-Ellmark, J. Nilsson, M. Orho-Melander, K. Dahlenborg, L. Groop, and G. Bjursell, “Association between a polymorphism in the carboxyl ester lipase gene and serum cholesterol profile,” European Journal of Human Genetics, vol. 12, no. 8, pp. 627–632, 2004.
- L. M. Camarota, L. A. Woollett, and P. N. Howles, “Reverse cholesterol transport is elevated in carboxyl ester lipase-knockout mice,” FASEB Journal, vol. 25, no. 4, pp. 1370–1377, 2011.
- L. Panicot-Dubois, G. M. Thomas, B. C. Furie, B. Furie, D. Lombardo, and C. Dubois, “Bile salt-dependent lipase interacts with platelet CXCR4 and modulates thrombus formation in mice and humans,” Journal of Clinical Investigation, vol. 117, no. 12, pp. 3708–3719, 2007.
- S. La Rosa, D. Vigetti, C. Placidi et al., “Localization of carboxyl ester lipase in human pituitary gland and pituitary adenomas,” Journal of Histochemistry and Cytochemistry, vol. 58, no. 10, pp. 881–889, 2010.
- U. Lidberg, J. Nilsson, K. Stromberg et al., “Genomic organization, sequence analysis, and chromosomal localization of the human carboxyl ester lipase (CEL) gene and a CEL-like (CELL) gene,” Genomics, vol. 13, no. 3, pp. 630–640, 1992.
- B. V. Kumar, J. A. Aleman-Gomez, N. Colwell et al., “Structure of the human pancreatic cholesterol esterase gene,” Biochemistry, vol. 31, no. 26, pp. 6077–6081, 1992.
- K. Madeyski, U. Lidberg, G. Bjursell, and J. Nilsson, “Structure and organization of the human carboxyl ester lipase locus,” Mammalian Genome, vol. 9, no. 4, pp. 334–338, 1998.
- K. Madeyski, U. Lidberg, G. Bjursell, and J. Nilsson, “Characterization of the gorilla carboxyl ester lipase locus, and the appearance of the carboxyl ester lipase pseudogene during primate evolution,” Gene, vol. 239, no. 2, pp. 273–282, 1999.
- C. H. Warden, R. C. Davis, M. Y. Yoon et al., “Chromosomal localization of lipolytic enzymes in the mouse: pancreatic lipase, colipase, hormone-sensitive lipase, hepatic lipase, and carboxyl ester lipase,” Journal of Lipid Research, vol. 34, no. 8, pp. 1451–1455, 1993.
- K. Mackay and R. M. Lawn, “Characterization of the mouse pancreatic/mammary gland cholesterol esterase-encoding cDNA and gene,” Gene, vol. 165, no. 2, pp. 255–259, 1995.
- A. S. Lidmer, M. Kannius, L. Lundberg, G. Bjursell, and J. Nilsson, “Molecular cloning and characterization of the mouse carboxyl ester lipase gene and evidence for expression in the lactating mammary gland,” Genomics, vol. 29, no. 1, pp. 115–122, 1995.
- J. A. Kissel, R. N. Fontaine, C. W. Turck, H. L. Brockman, and D. Y. Hui, “Molecular cloning and expression of cDNA for rat pancreatic cholesterol esterase,” Biochimica et Biophysica Acta, vol. 1006, no. 2, pp. 227–236, 1989.
- R. N. Fontaine, C. P. Carter, and D. Y. Hui, “Structure of the rat pancreatic cholesterol esterase gene,” Biochemistry, vol. 30, no. 28, pp. 7008–7014, 1991.
- X. Chen, E. H. Harrison, and E. A. Fisher, “Molecular cloning of the cDNA for rat hepatic, bile salt-dependent cholesteryl ester/retinyl ester hydrolase demonstrates identity with pancreatic carboxylester lipase,” Proceedings of the Society for Experimental Biology and Medicine, vol. 215, no. 2, pp. 186–191, 1997.
- E. M. Kyger, R. C. Wiegand, and L. G. Lange, “Cloning of the bovine pancreatic cholesterol esterase/lysophospholipase,” Biochemical and Biophysical Research Communications, vol. 164, no. 3, pp. 1302–1309, 1989.
- A. K. Taylor, J. L. Zambaux, I. Klisak et al., “Carboxyl ester lipase: a highly polymorphic locus on human chromosome 9qter,” Genomics, vol. 10, no. 2, pp. 425–431, 1991.
- M. Kannius-Janson, U. Lidberg, K. Hultén, A. Gritli-Linde, G. Bjursell, and J. Nilsson, “Studies of the regulation of the mouse carboxyl ester lipase gene in mammary gland,” Biochemical Journal, vol. 336, no. 3, pp. 577–585, 1998.
- M. Kannius-Janson, U. Lidberg, G. Bjursell, and J. Nilsson, “The tissue-specific regulation of the carboxyl ester lipase gene in exocrine pancreas differs significantly between mouse and human,” Biochemical Journal, vol. 351, no. 2, pp. 367–376, 2000.
- H. Ræder, S. Johansson, P. I. Holm et al., “Mutations in the CEL VNTR cause a syndrome of diabetes and pancreatic exocrine dysfunction,” Nature Genetics, vol. 38, no. 1, pp. 54–62, 2006.
- J. Torsvik, S. Johansson, A. Johansen et al., “Mutations in the VNTR of the carboxyl-ester lipase gene (CEL) are a rare cause of monogenic diabetes,” Human Genetics, vol. 127, no. 1, pp. 55–64, 2010.
- W. Weng, L. Li, A. M. Van Bennekum et al., “Intestinal absorption of dietary cholesteryl ester is decreased but retinyl ester absorption is normal in carboxyl ester lipase knockout mice,” Biochemistry, vol. 38, no. 13, pp. 4143–4149, 1999.
- R. Miller and M. E. Lowe, “Carboxyl ester lipase from either mother's milk or the pancreas is required for efficient dietary triglyceride digestion in suckling mice,” Journal of Nutrition, vol. 138, no. 5, pp. 927–930, 2008.
- J. Nilsson, M. Hellquist, and G. Bjursell, “The human carboxyl ester lipase-like (CELL) gene is ubiquitously expressed and contains a hypervariable region,” Genomics, vol. 17, no. 2, pp. 416–422, 1993.
- J. C. H. Chen, L. J. W. Miercke, J. Krucinski et al., “Structure of bovine pancreatic cholesterol esterase at 1.6 Å: novel structural features involved in lipase activation,” Biochemistry, vol. 37, no. 15, pp. 5107–5117, 1998.
- Y. Liang, R. Medhekar, H. L. Brockman, D. M. Quinn, and D. Y. Hui, “Importance of arginines 63 and 423 in modulating the bile salt-dependent and bile salt-independent hydrolytic activities of rat carboxyl ester lipase,” Journal of Biological Chemistry, vol. 275, no. 31, pp. 24040–24046, 2000.
- E. Aubert-Jousset, V. Sbarra, and D. Lombardo, “Site-directed mutagenesis of the distal basic cluster of pancreatic bile salt-dependent lipase,” Journal of Biological Chemistry, vol. 279, no. 38, pp. 39697–39704, 2004.
- C. S. Wang, A. Dashti, K. W. Jackson, J. C. Yeh, R. D. Cummings, and J. Tang, “Isolation and characterization of human milk bile salt-activated lipase C-tail fragment,” Biochemistry, vol. 34, no. 33, pp. 10639–10644, 1995.
- M. Cygler, J. D. Schrag, J. L. Sussman et al., “Relationship between sequence conservation and three-dimensional structure in a large family of esterases, lipases, and related proteins,” Protein Science, vol. 2, no. 3, pp. 366–382, 1993.
- R. S. Holmes, M. W. Wright, S. J. F. Laulederkind et al., “Recommended nomenclature for five mammalian carboxylesterase gene families: human, mouse, and rat genes and proteins,” Mammalian Genome, vol. 21, no. 9-10, pp. 427–441, 2010.
- F. Altschul, V. Vyas, A. Cornfield, et al., “Basic local alignment search tool,” Journal of Molecular Biology, vol. 215, pp. 403–410, 1997.
- International Human Genome Sequencing Consortium, “Initial sequencing and analysis of the human genome: international human genome sequencing consortium,” Nature, vol. 409, pp. 860–921, 2001.
- Chimpanzee Sequencing and Analysis Consortium, “Initial sequence of the chimpanzee genome and comparison with the human genome,” Nature, vol. 437, pp. 69–87, 2005.
- Bovine Genome Project, 2008, http://hgsc.bcm.tmc.edu/projects/bovine.
- Horse Genome Project, 2008, http://www.uky.edu/Ag/Horsemap/.
- R. H. Waterston, K. Lindblad-Toh, E. Birney et al., “Initial sequencing and comparative analysis of the mouse genome,” Nature, vol. 420, no. 6915, pp. 520–562, 2002.
- T. S. Mikkelsen, M. J. Wakefield, B. Aken et al., “Genome of the marsupial Monodelphis domestica reveals innovation in non-coding sequences,” Nature, vol. 447, no. 7141, pp. 167–177, 2007.
- W. C. Warren, L. W. Hillier, J. A. Marshall Graves et al., “Genome analysis of the platypus reveals unique signatures of evolution,” Nature, vol. 453, no. 7192, pp. 175–183, 2008.
- International Chicken Genome Sequencing Consortium, “Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution,” Nature, vol. 432, pp. 695–716, 2004.
- U. Hellsten, R. M. Harland, M. J. Gilchrist et al., “The genome of the western clawed frog Xenopus tropicalis,” Science, vol. 328, no. 5978, pp. 633–636, 2010.
- J. Sprague, L. Bayraktaroglu, Y. Bradford et al., “The Zebrafish Information Network: the zebrafish model organism database provides expanded support for genotypes and phenotypes,” Nucleic Acids Research, vol. 36, no. 1, pp. D768–D772, 2008.
- M. A. Larkin, G. Blackshields, N. P. Brown et al., “Clustal W and Clustal X version 2.0,” Bioinformatics, vol. 23, no. 21, pp. 2947–2948, 2007.
- L. J. McGuffin, K. Bryson, and D. T. Jones, “The PSIPRED protein structure prediction server,” Bioinformatics, vol. 16, no. 4, pp. 404–405, 2000.
- N. Guex and M. C. Peitsch, “SWISS-MODEL and the Swiss-PdbViewer: an environment for comparative protein modeling,” Electrophoresis, vol. 18, no. 15, pp. 2714–2723, 1997.
- J. Kopp and T. Schwede, “The SWISS-MODEL repository of annotated three-dimensional protein structure homology models,” Nucleic Acids Research, vol. 32, pp. D230–D234, 2004.
- O. Emanuelsson, S. Brunak, G. von Heijne, and H. Nielsen, “Locating proteins in the cell using TargetP, SignalP and related tools,” Nature Protocols, vol. 2, no. 4, pp. 953–971, 2007.
- T. A. Hall, “BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT,” Nucleic Acids Symposium Series, vol. 41, pp. 95–99, 1999.
- M. Kimura, The Neutral Theory of Molecular Evolution, Cambridge University Press, Cambridge, UK, 1983.
- Y. van de Peer and R. De Wachter, “Treecon for windows: a software package for the construction and drawing of evolutionary trees for the microsoft windows environment,” Bioinformatics, vol. 10, no. 5, pp. 569–570, 1994.
- N. Saitou and M. Nei, “The neighbor-joining method: a new method for reconstructing phylogenetic trees,” Molecular Biology and Evolution, vol. 4, no. 4, pp. 406–425, 1987.
- J. Felsenstein, “Confidence limits on phylogenies: an approach using the bootstrap,” Evolution, vol. 39, pp. 783–789, 1985.
- H. Tanaka, I. Mierau, and F. Ito, “Purification and characterization of bovine pancreatic bile salt-activated lipase,” Journal of Biochemistry, vol. 125, no. 5, pp. 883–890, 1999.
- L. P. DiPersio, R. N. Fontaine, and D. Y. Hui, “Site-specific mutagenesis of an essential histidine residue in pancreatic cholesterol esterase,” Journal of Biological Chemistry, vol. 266, no. 7, pp. 4033–4036, 1991.
- G. von Heijne, “Patterns of amino acids near signal-sequence cleavage sites,” European Journal of Biochemistry, vol. 133, no. 1, pp. 17–21, 1983.
- O. Lockridge, S. Adkins, and B. N. La Du, “Location of disulfide bonds within the sequence of human serum cholinesterase,” Journal of Biological Chemistry, vol. 262, no. 27, pp. 12945–12952, 1987.
- D. L. Kroetz, O. W. McBride, and F. J. Gonzalez, “Glycosylation-dependent activity of baculovirus-expressed human liver carboxylesterases: cDNA cloning and characterization of two highly similar enzyme forms,” Biochemistry, vol. 32, no. 43, pp. 11606–11617, 1993.
- S. Terzyan, C. S. Wang, D. Downs, B. Hunter, and X. C. Zhang, “Crystal structure of the catalytic domain of human bile salt activated lipase,” Protein Science, vol. 9, no. 9, pp. 1783–1790, 2000.
- Y. Mechref, P. Chen, and M. V. Novotny, “Structural characterization of the N-linked oligosaccharides in bile salt-stimulated lipase originated from human breast milk,” Glycobiology, vol. 9, no. 3, pp. 227–234, 1999.
- F. K. Winkler, A. D'Arcy, and W. Hunziker, “Structure of human pancreatic lipase,” Nature, vol. 343, no. 6260, pp. 771–774, 1990.
- S. Bencharit, C. L. Morton, Y. Xue, P. M. Potter, and M. R. Redinbo, “Structural basis of heroin and cocaine metabolism by a promiscuous human drug-processing enzyme,” Nature Structural Biology, vol. 10, no. 5, pp. 349–356, 2003.
- J. H. Postlethwait, Y. L. Yan, M. A. Gates et al., “Vertebrate genome evolution and the zebrafish gene map,” Nature Genetics, vol. 18, no. 4, pp. 345–349, 1998.
- J. S. Munger, G. P. Shi, E. A. Mark, D. T. Chin, C. Gerard, and H. A. Chapman, “A serine esterase released by human alveolar macrophages is closely related to liver microsomal carboxylesterases,” Journal of Biological Chemistry, vol. 266, no. 28, pp. 18832–18838, 1991.
- F. Shibata, Y. Takagi, M. Kitajima, T. Kuroda, and T. Omura, “Molecular cloning and characterization of a human carboxylesterase gene,” Genomics, vol. 17, no. 1, pp. 76–82, 1993.
- S. P. Sanghani, S. K. Quinney, T. B. Fredenburg, W. I. Davis, D. J. Murry, and W. F. Bosron, “Hydrolysis of irinotecan and its oxidative metabolites, 7-ethyl-10-[4-N-(5-aminopentanoic acid)-1-piperidino] carbonyloxycamptothecin and 7-ethyl-10-[4-(1-piperidino)-1-amino]-carbonyloxycamptothecin, by human carboxylesterases CES1A1, CES2, and a newly expressed carboxylesterase isoenzyme, CES3,” Drug Metabolism and Disposition, vol. 32, no. 5, pp. 505–511, 2004.
- R. S. Holmes, L. A. Cox, and J. L. VandeBerg, “Mammalian carboxylesterase 3: comparative genomics and proteomics,” Genetica, vol. 138, no. 7, pp. 695–708, 2010.
- T. Ota, Y. Suzuki, T. Nishikawa et al., “Complete sequencing and characterization of 21,243 full-length human cDNAs,” Nature Genetics, vol. 36, no. 1, pp. 40–45, 2004.
- R. S. Holmes, L. A. Cox, and J. L. VandeBerg, “Mammalian carboxylesterase 5: comparative biochemistry and genomics,” Comparative Biochemistry and Physiology Part D, vol. 3, no. 3, pp. 195–204, 2008.
- S. Saxonov, P. Berg, and D. L. Brutlag, “A genome-wide analysis of CpG dinucleotides in the human genome distinguishes two distinct classes of promoters,” Proceedings of the National Academy of Sciences of the United States of America, vol. 103, no. 5, pp. 1412–1417, 2006.
- G. Stefani and F. J. Slack, “Small non-coding RNAs in animal development,” Nature Reviews Molecular Cell Biology, vol. 9, no. 3, pp. 219–230, 2008.
- T. Avnit-Sagi, L. Kantorovich, S. Kredo-Russo, E. Hornstein, and M. D. Walker, “The promoter of the pri-miR-375 gene directs expression selectively to the endocrine pancreas,” PLoS ONE, vol. 4, no. 4, Article ID e5033, 2009.
- A. M. van Bennekum, L. Li, R. Piantedosi et al., “Carboxyl ester lipase overexpression in rat hepatoma cells and CEL deficiency in mice have no impact on hepatic uptake or metabolism of chylomicron-retinyl ester,” Biochemistry, vol. 38, no. 13, pp. 4150–4156, 1999.
- D. Gilham, E. D. Labonté, J. C. Rojas, R. J. Jandacek, P. N. Howles, and D. Y. Hui, “Carboxyl ester lipase deficiency exacerbates dietary lipid absorption abnormalities and resistance to diet-induced obesity in pancreatic triglyceride lipase knockout mice,” Journal of Biological Chemistry, vol. 282, no. 34, pp. 24642–24649, 2007.
- X. Li, S. Lindquist, M. Lowe, L. Noppa, and O. Hernell, “Bile salt-stimulated lipase and pancreatic lipase-related protein 2 are the dominating lipases in neonatal fat digestion in mice and rats,” Pediatric Research, vol. 62, no. 5, pp. 537–541, 2007.
- J. W. Jukema, M. Lenselink, G. J. de Grooth, et al., “Enhancing reverse cholesterol transport/raising HDL cholesterol: new options for prevention and treatment of cardiovascular disease,” Netherlands Heart Journal, vol. 12, pp. 491–496, 2004.
- A. E. van der Velde, “Reverse cholesterol transport: from classical view to new insights,” World Journal of Gastroenterology, vol. 16, no. 47, pp. 5908–5915, 2010.
- P. C. J. Donoghue and M. J. Benton, “Rocks and clocks: calibrating the tree of life using fossils and molecules,” Trends in Ecology and Evolution, vol. 22, no. 8, pp. 424–431, 2007.