Research Article | Open Access
Censored Data Analysis Reveals Effects of Age and Hepatitis C Infection on C-Reactive Protein Levels in Healthy Adult Chimpanzees (Pan troglodytes)
C-reactive protein, a conserved acute-phase protein synthesized in the liver and involved in inflammation, infection, and tissue damage, is an informative biomarker for human cardiovascular disease. Out of 258 captive adult common chimpanzees (Pan troglodytes) assayed for CRP, 27.9% of the data were below the quantitation limit. Data were analyzed by the Kaplan-Meier method and results compared to other methods for handling censored data (including deletion, replacement, and imputation). Kaplan-Meier results demonstrated a modest age effect and a strong effect of HCV infection in reducing CRP but did not allow inference of reference intervals. Results of other methods varied considerably. Substitution schemes differed widely in statistical significance, with estimated group means biased by the size of the substitution constant, while inference of unbiased reference intervals was impossible. Single imputation gave reasonable statistical inferences but unreliable reference intervals. Multiple imputation gave reliable results, for both statistical inference and reference intervals, and was comparable to the Kaplan-Meier standard. Other methods should be avoided. CRP did not predict cardiovascular disease, but CRP levels were reduced by 50% in animals with hepatitis C infection and showed inverse relationships with 2 liver function enzymes. Results suggested that hsCRP can be an informative biomarker of chronic hepatic dysfunction.
C-reactive protein (CRP), a phylogenetically highly conserved protein, has become an important biomarker of acute inflammation and tissue damage in humans [1–3]. CRP is an important biomarker for many aspects of health and disease, including cardiovascular disease, type 2 diabetes, and chronic renal disease and is a predictor of all-cause mortality [2, 4–7]. CRP is synthesized by hepatocytes when induced by cytokines including IL-6 [1, 4]. Circulating plasma levels can rise during the acute-phase response to inflammation, infection, or trauma by 10,000-fold and decrease just as rapidly [1, 2, 8]. CRP has many biological functions related to the recognition and clearance of foreign pathogens and damaged host cells, binding chromatin and small nuclear ribonucleoproteins, which suggested a role in clearance of debris due to apoptosis and necrosis [2, 4, 8]. CRP stimulates the classical complement pathway [1, 2, 4]. Its activation by the same Fc receptors used by IgG, and its earlier response to infection, suggested a role in inducing an adaptive immune response .
CRP is also involved in the development of atherosclerotic lesions and plaque disruption [8, 9]. Epidemiological evidence for the role of inflammation in the etiology of coronary heart disease, myocardial infarction, and peripheral vascular disease has indicated a role for CRP as a biomarker for cardiovascular disease [8, 10–12]. CRP was a better overall predictor of cardiovascular events than LDL cholesterol, conferred additional prognostic information to Framingham risk scores, and reduce all-cause mortality among otherwise asymptomatic humans [10, 13–15]. In a 16-year long prospective study, high levels of hsCRP (>3 mg/L) were associated with a 2-fold increased risk of all-cause mortality relative to low hsCRP levels (<1 mg/L) in healthy humans . Meta-analyses have consistently demonstrated a strong association () between elevated CRP levels and major coronary events, [16, 17] although there remains some debate over the most informative cut-off values. A 7-year followup study found that very high levels of hsCRP (>10 mg/L) were more strongly associated with risk of clinical cardiovascular disease and with all-cause mortality, compared to merely high (3–10 mg/L) levels [5, 11, 18].
Cardiovascular disease (CVD) is the primary cause of morbidity and mortality in captive chimpanzees [19–25]. This suggested the potential utility of CRP as a biomarker of CVD in aging chimpanzees, not unlike other biomarker studies [19, 20]. Similarly, hepatitis C has been associated with reduced CRP levels in humans [26–28]. Historical use of chimpanzees in studies of viral hepatitis  suggested a role for hsCRP as a biomarker of hepatic damage. There was one earlier study of CRP in chimpanzees , but results were limited due to small sample size () and wide age range (<1 yr to 44 yr old) with subadults not distinguished from adults (>10 yr old); sex differences were not evaluated, and associations with CVD or hepatitis infection were not investigated . Finally, out-dated “normal ranges” were uncritically defined, rather than using reliable methods from human laboratory medicine [20, 31]. Therefore, it was of interest to evaluate CRP as a potential biomarker for CVD and hepatic dysfunction and to define reference intervals in a large captive population of chimpanzees.
At the time of this cross-sectional study, the Alamogordo Primate Facility (APF) housed 258 adult research-reserve chimpanzees (Pan troglodytes) primarily descended from the West African P.t. verus subspecies . The APF animal program and facilities were fully accredited by AAALAC, with animals maintained in same-sex group housing, to comply with the NIH breeding moratorium . The study was fully approved by the ACUC and conducted in accordance with the Guide for the Care and Use of Laboratory Animals . Animals were maintained in socially compatible groups in indoor dens (180 ft2, 9.5 ft high) with radiant heated floors and air conditioning, 24 hr access to outdoor dens (242 ft2), and weekly access to outdoor play yards (802 ft2). Diet consisted of commercial primate chow (Purina Lab Diet Monkey Diet Jumbo 5LR2) plus daily fresh fruits and vegetables delivered to simulate naturalistic foraging opportunities. All animals received a complete physical examination and health assessment annually . Prior (<2001) experimental exposures resulted in 61 animals with HCV infection (HCV antibody and PCR positive, >103 HCV genome equivalents/mL), and the remainder (196) not infected (Table 1). Seven subadults (<10 years old) on-site were excluded from analyses [19, 20, 35].
2.2. hsCRP Assay
All 258 adults (116 female, 142 male) were assayed for hsCRP. Ten mL whole blood samples were collected in serum separator tubes, separated by low-speed centrifugation, transferred into sterile vials, and shipped overnight on ice to a clinical reference laboratory (Tricore Industries, Albuquerque, NM). Samples were tested for hsCRP with a rate turbidimetric immunoassay using polyclonal goat and mouse anti-CRP antibody bound to latex particles [36, 37]. The detection limit (DL, sometimes mistakenly called analytical sensitivity) was 0.06 mg/L, below the level recommended to predict cardiovascular events . The reporting or quantitation limit (QL), below which non-linearity and high error relative to signal render single reportable numbers unreliable, was 0.4 mg/L, well within reported limits (mean ) for other hsCRP assays [38–40]. Unreliable estimates below the QL threshold are termed left censored and require use of statistical methods designed for analysis of left-censored data .
To validate the use of the human CRPH assay, chimpanzee CRP DNA sequence (NCBI entry XM_001170732) and its deduced amino acid sequence were aligned with the human homologues (NCBI NM_000567)  and species-specific differences identified with ClustalX2 [41, 42]. The effects of species-specific amino acid substitutions on the structure and function of chimpanzee CRP, relative to the human protein, were inferred using Polyphen-2, an on-line protein structure/function server, based on biochemical and comparative principles [43, 44]. Chimpanzee CRP mRNA showed 9 nucleotide (nt) substitutions and 2 indels relative to the human sequence. Of these, 3 substitutions (bases 367, 374 and 601) occurred in the actual protein coding region, 2 of which resulted in non-synonymous substitutions. The first (nt position 367) involved an A/G transition at the second codon position and resulted in a non-synonymous amino acid change, from aspartic acid in humans to glycine in chimpanzees (D88G). The second (nt 374) involved an A/G transition at the third wobble codon position and was silent. The third (nt 601) involved an A/G transition at the second codon position and resulted in a non-synonymous amino acid change, from glycine in humans to glutamic acid in chimpanzees (G166E). Both coding mutations occurred in the pentaxin domain, D88G in a beta-pleated sheet, and G166E in an alpha helix . Neither occurred at a critical site in the mature protein, such as calcium binding sites or disulfide bonds [3, 45]. Structural predictions using PolyPhen [43, 44] indicated that both mutations were benign (D88G, Polyphen ; G166E, Polyphen ). The mouse CRP protein sequence (Uniprot P14847) exhibited 70 amino acid differences (29.4%) from the human protein sequence, including alanine at mutated position 166, compared to only 2 differences (0.9%) between humans and chimpanzees. In short, there was no evidence that either of the 2 non-synonymous mutations would alter the binding efficiency or reduce the reliability of the CRPH assay in chimpanzees.
2.3. Statistical Methods
All statistical analyses were performed on SYSTAT Version 11.0 (SYSTAT Software, Inc., Richmond, CA). We hypothesized that hsCRP levels would be elevated in unhealthy versus healthy animals, and that HCV-infected animals would have reduced CRP levels, compared to uninfected animals [20, 28, 31]. The Kaplan-Meier non-parametric product-limit method (KM) was used to analyze left-censored data, after “flipping” the data by subtraction from a large constant, resulting in a right censored dataset [39, 46, 47]. Choice of constant is arbitrary but does not affect results . Statistical significance was assessed by Tarone-Ware statistics, which are intermediate in value between log-rank and Wilcoxon statistics . KM was considered the inferential standard of reference for the other statistical methods described below. KM does have some limitations. Estimated means are unreliable due to extensive skew of non-normally distributed survival data . The median is more robust to skew and outliers . But depending on the pattern and extent of censoring, the exact median cannot always be estimated by KM. Therefore, KM does not necessarily allow reliable estimation of effect sizes. Finally, unless censored data are replaced by probable values, the bottom end of the distribution remains missing, and inference of reliable reference intervals is not possible. Therefore, other statistical methods for analyzing censored data (substitution, single MLE imputation, and multiple imputation) were used to replace censored observations with probable values. Those results were compared to the KM standard and also used to estimate reference intervals. ANOVA was used to analyze continuous data in the substitution and imputation datasets, with statistical significance determined by omnibus F statistics and single degree-of-freedom focused comparisons [50, 51]. Age and sex were used as covariates because they influence the distribution of health, disease, and CRP levels in populations [3, 19–21, 52]. The Shapiro-Wilks goodness-of-fit test rejected the assumption of a Normal (Gaussian) distribution (, ). Substitution datasets could not be normalized, due to clustering of identical substituted values at the bottom of the distribution (Figure 1(a)). The case-deletion, MLE, and MI datasets were normalized with a transformation [11, 18, 53–55]. Transformation efficacy was confirmed visually (Figure 1(b)) and with coefficient-based tests (G1/skew and G2/kurtosis) [31, 56]. Outliers were detected and eliminated using the robust interquartile method . Categorical data were analyzed with contingency table methods, with statistical significance determined by likelihood ratio statistics [57, 58].
2.4. Data Imputation
hsCRP levels were assayed for all 258 adult chimpanzees (≥10 yr old). Of these, 185 adults (88 females, 98 males) had reliably quantified hsCRP levels, while 72 animals (28 females, 44 males) had unreliable (sub-threshold) levels (Table 1). This degree of censoring is considered moderate . These data involved type I censoring, characterized by a fixed cut-off value at the quantitation limit (QL) and a variable number of censored observations . (Type II censoring involves a variable cut-off value but a fixed number of censored observations . For example, an LD50 study would terminate after observing 50% mortality, regardless of the survival times of the remaining study subjects . Type II censoring will not be further discussed.) We used Little’s  missing completely at random (MCAR) test statistic and contingency table methods [57, 58] to test for nonrandom patterns of hsCRP data missingness (nondetected or observed) relative to other factors (age, sex, hepatitis C infection, cardiovascular disease, and health status).
Many scientific fields, from medicine to zoology, encounter an otherwise peculiar feature of data, wherein some observations fail to exceed a lower (or exceed an upper) threshold of detection [20, 39, 46, 47]. Such unobserved data are called censored, and the observations are called nondetects. Censoring that occurs relative to a lower limit is called left censored. In environmental epidemiology, many environmental pollutants occur at such minute concentrations that they cannot be reliably measured . Left censoring raises the question whether nondetects represented true zero concentrations, or whether they were very small, but nonzero quantities only reported as zero, due to technical constraints . Censoring that occurs relative to an upper limit is called right censored. Data on patients in prospective epidemiological studies who survive to the last date of followup are right censored, because the event of interest (such as mortality or cancer remission) was not observed during the study’s timeframe . Patients alive at last observation will have a minimum survival duration. Similarly, very high biomarker concentrations will exceed the limit of linearity (LOL) of the concentration/signal response, resulting in a minimum concentration [38, 39]. The presence of lower or upper limits result in informative or non-ignorable censoring . Although exact values remain unknown, censored data are informative because they are known to be less than (or greater than) some lower (or upper) threshold [19, 39]. The class of statistical methods developed to handle this troublesome feature of censored data is known as censored data analysis (CDA) methods.
The first CDA method compared was casewise deletion, which simply deleted all nondetects from the analysis . Casewise deletion remains widely used , even though exclusion of left-censored nondetects means elimination of the smallest values, which introduces upward bias into parameter estimates [39, 65, 66]. We also compared the effects of 3 CDA methods to replace nondetects with single numerical estimates. First were substitution methods, which replace all nondetects with an arbitrary constant (typically zero, some function of QL, or the mean) [62, 67]. Problems with substitution methods include the arbitrary choice of constant and underestimation of variability associated with the use of a constant rather than a randomly distributed variable . Three substitution datasets were created by replacing each nondetect with a constant (zero, QL/2, and QL).
The second CDA method used maximum likelihood estimation (MLE) for single imputation. The mean () and standard deviation () of the lognormal distribution assumed to generate the censored hsCRP dataset were estimated from observed data [68, 69]. The assumption that the true underlying distribution of observed hsCRP values was lognormal [67, 68] was confirmed by the Anderson-Darling test and by coefficient-based tests . Realistic values were simulated for all nondetects by random sampling from a lognormal distribution defined by the estimated parameters . Simulated values below the QL were considered eligible as replacements and randomly substituted for all nondetects [11, 55, 71, 72].
The last CDA method used the NORM statistical software for multiple imputation (MI) . MI datasets were formed by imputing probable values for all hsCRP nondetects, based on information derived from the variance/covariance matrix plus random error [74, 75]. Imputed values were only considered eligible substitutions for nondetects if they were less than the QL, in order to preserve the informative left-censoring mechanism, and were randomly substituted for nondetects. For data missing at random (non-informative censoring), as few as imputed datasets may be needed [75, 76]. Due to the informative nature of left-censored data, we used more imputations (), as recommended [64, 77]. Regression coefficients and estimated group means were averaged across imputations, and within and between variance components estimated following Rubin’s method [76, 78]. Results from the Kaplan-Meier (KM) non-parametric method  were treated as the standard of comparison for the other CDA methods.
2.5. Reference Intervals
The KM approach to censored data does not substitute probable imputed values for nondetects but relies on analysis of the original censored dataset. Use of the casewise deletion dataset, in which all of the lowest values remain unobserved (nondetects), will yield biased reference intervals. Therefore, the substitution and imputation datasets were compared for their reliability in reference interval estimation. There are two common mistakes in estimating so-called “normal ranges” for clinical analytes. The first is to uncritically apply statistical methods based on the normal distribution to nonnormally distributed variables (biomarkers and other clinical analytes), rather than using appropriate methods from laboratory medicine [19–21, 31]. The second mistake is to assume that statistically significant covariates are automatically useful as clinical guidelines [19–21, 31]. Partitioning requires justification that the resulting subgroups have less reduced variability and narrower reference intervals relative to pooled data [31, 79, 80]. We used several guidelines to determine when to partition into subgroups. A statistic that exceeded the critical value of 5.0 was sufficient justification for partitioning. Alternately, a and either ≥10% reduction in subgroup standard deviations relative to pooled data, or a ratio of standard deviations ≥1.5, also justified partitioning [31, 79, 80]. Reference intervals were estimated using the non-parametric Harrell-Davis bootstrap for large () samples [81, 82] or the robust method for smaller samples [67, 83]. All calculations were performed on MedCalc version 220.127.116.11 .
3.1. Nonrandom Missingness of hsCRP
We first tested for the presence of bias or nonrandom patterns of missing hsCRP data relative to the covariates (see Table 1). There was no association of hsCRP (observed, nondetects) with sex (, ) or decade of life (, ). hsCRP levels had a near-significant association with heart disease (, ). There were significant associations between hsCRP and both HCV infection (, ) and health status (, ). These associations were quite strong. HCV-infected animals were 6.4 times more likely to have hsCRP nondetects, compared to uninfected animals (Little’s MCAR , ). Similarly, healthy animals were 2.9 times more likely to have hsCRP nondetects, compared to unhealthy animals (Little’s MCAR , ). These associations signaled a need to account for the effects of HCV infection and health status in subsequent analyses.
3.2. Determinants of Serum hsCRP Level
We analyzed the different hsCRP datasets (casewise deletion, substitution, single MLE substitution, and multiple imputation) using the covariates age, sex, HCV infection, and cardiovascular disease. The casewise deletion dataset () was smaller than all other CDA datasets (). Results of the Kaplan-Meier survival analysis were treated as the standard, against which the other strategies were compared.
Neither sex (, ) nor CVD status (, ) had significant associations with hsCRP levels. As a quantitative covariate, age had a significant linear effect on hsCRP (, ). Each year of life increased hsCRP by 0.027 mg/L. At that rate, a 10-year-old chimpanzee with 1.00 mg/L hsCRP would be expected to rise to 1.31 mg/L by the age of 20 yr. But when categorized by decade of life, the age trend failed to reach significance (, ; 10–19 yr, 0.59 mg/L; 20–29 yr, 0.68 mg/L; 30+ yr, 0.76 mg/L; overall mean equaled 0.66 mg/L; see Table 2). Health had a modest effect (, ), with healthy animals having lower median hsCRP (0.62 mg/L) than sick animals (0.79 mg/L; see Table 2). HCV infection strongly reduced hsCRP levels (, ). Infected animals had an estimated median of 0.30 mg/L of hsCRP, compared to 0.79 mg/L for uninfected animals (Table 2). These KM results were taken as the standard for comparison of other CDA methods.
|Tarone-Ware statistics ( value) for 5 covariates (HCV infection, decade of life, sex, health, and CVD). Data was the right censored “flipped” version of the original left-censored data. All tests were on 1 degree of freedom, except decade which used 2 degrees of freedom. Estimated medians were interpolated from the ranked KM results. |
3.4. Casewise Deletion
The casewise deletion dataset was analyzed next (). Neither sex nor health nor CVD status had any effect on hsCRP levels (Table 3). HCV status was highly significant (, ; Table 3). Modeled as a quantitative covariate, age had a statistically significant effect of hsCRP (, ) but its effect was relatively weak (). A 10-year-old animal with 1.00 mg/L of hsCRP would be expected to rise to 1.22 mg/L by the age of 20 yr. When categorized by decade of life, there was an insignificant tendency for hsCRP to increase linearly across decades (, ; see Table 3). Specifically, 10–19 yr olds had lower mean levels of hsCRP (0.91 mg/L) than 20–29 yr olds (1.02 mg/L), which were lower than the 30+ yr olds (1.14 mg/L).
|-statistics ( value) for 5 covariates (HCV infection, age by decade of life, sex, health, and cardiovascular disease), for each of 6 different datasets used to handle missing data. Datasets include casewise deletion (the original left-censored data with 72 sub-threshold nondetects), 3 substitution datasets [QL, QL/2 or Zero], MLE (a single imputation based on maximum likelihood estimation from a log-normal distribution), and MI (multiple imputation of estimates based on the variance-covariance matrix plus random error). |
1A single degree of freedom linear contrast (−1 0 +1) was used to test the hypothesis that hsCRP increased linearly with decade of life (10–19 yr olds, 20–29 yr olds, and 30+ yr olds).
3.5. Substitution Methods
ANOVA analyses of the 3-substitution datasets (zero, QL/2, and QL) indicated agreement on lack of significance for sex and CVD and significant effects for HCV infection and age (Table 3). But results for health status varied by substitution model. Health was not significant for QL substitution, but it was significant for QL/2 and especially zero substitution (Table 3). The main difference between substitution models was that estimated effect sizes (group means) for age and HCV status reflected the size of the substitution constant (Figures 2(a) and 2(c); Tables 4 and 6).
|Expected level of hsCRP (mg/L) by decade of life. For datasets wherein decade of life was not statistically significant, group means were not relevant, cells were combined, and the single cell entry reflects the single overall mean. Number for the KM standard refers to estimated median level. All others refer to estimated means.|
3.6. MLE Single Imputation
We then analyzed the MLE single imputation dataset. Results indicated that age, health, and HCV infection status all significantly influenced hsCRP levels (Table 3). But there was no effect of sex or heart disease (Table 3).
3.7. Multiple Imputation
We then analyzed the MI datasets. Results indicated that neither sex nor CVD were associated with hsCRP levels (Table 3). However, hsCRP was significantly associated with HCV infection, health status, and decade of life (Table 3).
3.8. Comparison of CDA Methods
Comparison of these analyses revealed several trends in the results of the different CDA methods. First, casewise deletion always yielded the largest estimated group means, while zero substitution always yielded the smallest (Figures 2(a), 2(b), and 2(c); Tables 4, 5, and 6). This was because exclusion of the smallest (left censored) values by casewise deletion inevitably introduced upward bias in estimated group means. Conversely, zero substitution introduced downward bias. The other CDA methods yielded group means intermediate to these extremes. Secondly, substitution methods introduced severe bias. Specifically, estimated group means from the substitution datasets were related to the magnitude of the substitution constant. For decade, health and HCV infection, estimated means were largest for QL, intermediate for QL/2, and lowest for zero substitution (Figures 2(a), 2(b), and 2(c); Tables 4, 5, and 6). Substitution methods also failed to distinguish different group means by decade of life, resulting instead in a constant mean (Table 4; Figure 2(a), flat horizontal lines). By contrast, MLE and MI detected more subtle differences among group means already identified by KM (Figures 2(a), 2(b), and 2(c), sloped lines across factor levels). And only MI detected the significant increase in hsCRP levels across decade of life (Figure 2(a); Table 4). Substitution methods performed erratically for health status. QL failed to detect significant group differences, but QL/2 and zero detected some differences (Figure 2(b); Table 5). Casewise deletion failed to detect a significant effect of health status, while MLE and MI detected large increases in hsCRP by health (Figure 2(b); Table 5). Finally, all CDA methods agreed in detection of lower levels of hsCRP in HCV+ animals (Figure 2(c); Table 6). This suggested that the effect of HCV infection on hsCRP was so strong that even biased methods could easily detect it. Note that the between-group difference was more or less constant, as indicated by parallel lines (Figure 2(c); Table 6), except MI, which detected a steeper drop in hsCRP for HCV+ animals. The real issue concerns the reliability of the inferred group means depending on the CDA method used, with substitution methods being particularly subject to bias.
|Expected mean level of hsCRP (mg/L), by HCV infections status (infected or uninfected). Numbers for the KM standard refer to estimated median levels. All others refer to estimated means. |
|Expected mean level of hsCRP (mg/L), by HCV infections status (not infected, infected). All datasets detected significant group differences, but resulted in different expected group means. Numbers for the KM standard refer to estimated median levels. All others refer to estimated means. |
3.9. Association of hsCRP with Liver Enzymes
We then tested for an association of hsCRP with 2 standard enzymes used to examine hepatocyte damage, alanine aminotransferase (Alt), and alkaline phosphatase (Alp) . hsCRP was trichotomized (nondetect, <1.0 mg/L, and ≥1.0 mg/L). Archival data on Alt and Alp levels in healthy adult chimpanzees were logn transformed to induce normality, then trichotomized by tertiles (<89, 89–125.9, and ≥126 U/L). There was a strong inverse association between Alp and hsCRP levels (, ). Specifically, a chimpanzee with sub-threshold levels of hsCRP was 1.8 times more likely to have high rather than low levels of Alp, compared to chimpanzees with high levels of hsCRP. Alt was likewise trichotomized (<34 U/L, 34–47.9, and ≥48 U/L) and similar results were obtained. There was a highly significant inverse association between Alt and hsCRP (, ). Specifically, chimpanzees with sub-threshold levels of hsCRP were 11.8 times more likely to have high rather than low levels of Alt, compared to chimpanzees with high levels of hsCRP.
The effects of HCV infection on the liver enzymes, Alp and Alt, were further analyzed with ANOVA. For Alp, neither sex nor age were significant as covariates (, ; and , ). But HCV infections was a powerful predictor of Alp levels (, ). Chimpanzees with HCV infection averaged 128.8 U/mL of Alp, for an increase of 22% from average level for non-infected animals (105.6 U/mL). For Alt, age was not significant (, ). There was a significant sex difference (, ) with males having higher levels of Alt (55.0 U/mL) than females (43.9 U/mL). HCV infection was again a strong predictor of Alp levels (, ). Chimpanzees with HCV infection averaged 66.3 U/mL of Alp, representing an increase of 82% from average levels for non-infected animals (36.5 U/mL).
3.10. Reference Intervals
To determine justification for separate reference intervals, all statistically significant covariates in each CDA dataset (Table 3) were evaluated with partitioning tests (Table 7). For casewise deletion, the only significant covariate was HCV infection (Table 3). Results of the partitioning test did not suggest a need for separate reference intervals. Therefore, all observed data regardless of HCV infection status were used to construct a single 90% reference interval for all animals. This interval ranged from 0.40 mg/L to 3.37 mg/L, with a median of 0.80 mg/L (Figure 3(a)).
|(a) Health status|
|(b) Decade of life|
|(c) HCV infection status|
|Listed factors (health, decade, and HCV) were considered clinically significant and separate reference intervals constructed if , , or . Only statistically significant covariates were tested (see Table 2). n/a means the factor was not significant and was not tested. See text for discussion. |
For the substitution datasets, partitioning tests justified separate intervals by HCV status. For uninfected animals, the substitution datasets yielded medians (0.7 mg/L) and 95th percentiles (3.3 mg/L) identical to the casewise deletion and MLE intervals (Figure 3(a)). The only differences were in their lower (5%ile) boundaries, which varied according to the size of the substitution constant (0, 0.2, and 0.4). Substitution of a constant eliminated variation at the bottom of these distributions, so the robust method could not be used to estimate reference intervals for HCV-infected animals in the substitution datasets . So reference intervals for the 3 substitution datasets were not presented.
For the MLE dataset, HCV negative animals had an upper boundary (3.3 mg/L) and a median (0.7 mg/L) nearly identical to casewise deletion. But the bottom 5th percentile (0.2 mg/L) was considerably lower in MLE (0.2 mg/L) than casewise deletion (0.4 mg/L). For HCV-infected animals, use of the necessary robust procedure  with the small sample size resulted in a biologically impossible negative value (−0.1 mg/L) for the lower 5th percentile (Figure 3(a)). Both upper and lower MLE boundaries were more extreme than the MI intervals (Figure 3(a)). This was probably because random substitution used in MLE was unconstrained by the need to select statistically probable values based on the variance/covariance matrix, as implemented in the MI procedure.
For the MI datasets, partitioning tests did not justify collapsing by health or decade of life, but HCV was highly significant (Table 7). The 90% reference interval for HCV uninfected chimpanzees was (0.3, 3.3) mg/L (Figure 3(b)). The 90% reference interval for HCV-infected chimpanzees was (0, 0.7) mg/L (Figure 3(b)). HCV-infected animals had median hsCRP levels (0.35 mg/L) half the size of uninfected animals (0.70 mg/L). The range of variation was 4 times greater among HCV non-infected than infected animals (Figure 3(b)). This may reflect the presence of other, as-yet unidentified conditions that influence hsCRP levels in uninfected animals, as in humans . Median hsCRP level for HCV-negative animals (0.7 mg/L) was higher than the upper 90% boundary (0.3 mg/L) for HCV-positive animals, and equaled their 90th percentile. This small overlap indicated that hsCRP is considerably reduced in HCV-infected animals. In contrast to MLE, the MI dataset did not result in a negative lower boundary for HCV+ animals (Figures 3(a) and 3(b)).
hsCRP was of interest for its potential role as a biomarker of CVD and hepatic damage in chimpanzees. One problematic feature of these data was the presence of a moderate degree (28%) of censoring. The Kaplan-Meier nonparametric product-limit method for right censored survival data has well-understood properties but has seldom been applied to left-censored biomarker data [39, 47]. KM results showed that hsCRP levels were weakly associated with age, modestly associated with ill health, and strongly associated with hepatitis C infection. For age, there was a significant positive linear effect on hsCRP levels, and the trend was nearly significant when categorized by decade. For health status, sick animals had estimated median hsCRP (0.62 mg/L), 27% higher than healthy animals (0.79 mg/L). HCV-infected animals had estimated median hsCRP levels (0.3 mg/L) reduced by 62% relative to noninfected animals (0.79 mg/L).
The KM results provided a standard for comparison of the performance of other CDA methods, all of which replaced nondetects with single specific values by different methods [76, 77]. Overall, the MI results most closely replicated the KM standard and also gave more precise results, such as group means, than KM. MLE single imputation gave reasonable results, even without utilizing information from the variance/covariance matrix, like MI. The casewise deletion dataset was highly upward biased, giving the largest estimated group means, while sometimes failing to detect significant effects. Inferences from the substitution datasets varied widely, making it difficult to identify any single “best” substitution constant. For example, only QL/2 identified the presence of a significant health effect known by KM results to exist (Tables 2 and 5). Furthermore, bias was evident in the group means estimated from all the substitution datasets, with the extent of bias determined by the size of the substitution constant.
Regardless of CDA method, hsCRP was not associated with the presence of cardiovascular disease. This association has been observed repeatedly in prospective human studies [5, 7, 18, 53]. Its absence in chimpanzees was particularly surprising because chimpanzee CVD is characterized by myocardial fibrosis [22, 25] which is probably induced by hypertension [20, 21], and hsCRP is known to promote myocardial fibrosis . The reason(s) for this lack of association in chimpanzees probably cannot be answered without detailed analysis of long-term (20+ years) followup data. Such prospective studies would also be needed to determine if hsCRP is also associated with non-vascular mortality, as observed in humans . Also consistent with KM results, no other CDA method identified a sex difference. The lack of a sex difference in hsCRP levels has been repeatedly observed in human studies [55, 72, 87, 88].
The significant effect of health status on hsCRP in KM was identified in all but 2 CDA datasets (casewise deletion, QL). Health has also been shown to influence chimpanzee blood pressure [20, 21]. Interestingly, serum levels of hsCRP in healthy chimpanzees estimated by KM (0.62 mg/L) and MI (0.54 mg/L) were less than mean levels in healthy humans (0.8 mg/L; see also Table 5) .
For age, only MI identified the significant linear effect of age on hsCRP levels. For KM, this age trend declined to near-significance () when categorized by decade, while for MI the decade effect remained significant (; Tables 2 and 3). The presence of a modest age effect has been reported in numerous human studies [35, 37, 72, 90–93]. This age trend was observed earlier in chimpanzees, although not reported as such. Reanalysis of published chimpanzee data showed near-significant effect of age on hsCRP levels (, ), with subadults (<10 yo) 3.9 times more likely to have low (<1.0 mg/L) hsCRP levels than adults (≥10 yo;) . Similarly, in the present study, 6 of 7 subadults excluded from analysis had sub-threshold levels of hsCRP. These observations suggest that elevated hsCRP may be a salient clinical characteristic of geriatric chimpanzees that deserves further study.
All CDA methods confirmed the KM result that hsCRP levels declined with HCV infection. But effect sizes varied widely among methods. Estimated KM medians showed that HCV infection reduced serum hsCRP to less than half the level (0.30 mg/L) of uninfected animals (0.79 mg/L). These results were reasonably closely approximated by MLE (0.39 mg/L infected; 0.83 mg/L uninfected) and MI (0.42 mg/L infected, 0.89 mg/L uninfected). This difference, known in humans, has not previously been reported in chimpanzees [26–28, 89]. Contingency table analysis also indicated that HCV-infected animals were 6.4 times more likely to have sub-threshold hsCRP levels, relative to uninfected animals. These results indicated that reduced hsCRP levels (including nondetects) could be useful for monitoring and early detection of progressive liver dysfunction related to hepatitis C infection. Similar findings in humans led to a recommendation that separate reference intervals be estimated for HCV-infected individuals . In humans, hsCRP levels initially rose in response to liver inflammation due to hepatitis infection, then declined with persistent infection [28, 89]. This pattern was thought to result from reduced CRP production in the liver secondary to progressive destruction of liver tissue [28, 89]. Analysis of 2 liver enzymes, Alt and Alp, demonstrated that liver dysfunction was associated with reduced serum hsCRP. High enzyme levels were associated with low hsCRP levels. These liver enzymes were also elevated by 22–82% in HCV-infected chimpanzees. The elevation of liver enzymes in HCV-infected animals, while hsCRP was reduced, suggested specific impairment of CRP synthesis, as in human studies [28, 89]. Thus, hsCRP appears to be an informative biomarker for long-term liver damage in HCV-infected chimpanzees. Confirmation of this etiology will require monitoring long-term (20+ yr) changes in hsCRP levels, across the course of hepatitis infection, plus postmortem evidence of hepatocellular damage.
Overall, the 6 CDA methods for handling nondetects produced different outcomes, including statistical significance of covariates, estimated group means, and reference intervals. Compared to the KM standard, the best method was MI, which it most closely resembled. The worst method was casewise deletion, which gave highly biased results. Casewise deletion even failed to detect the powerful influence of HCV infection in reducing hsCRP levels, found in all other datasets. Consequently, the casewise deletion method should be avoided in biomarker analyses. Substitution methods performed poorly because they failed to identify significant covariates, while estimated group means depended strongly on the substitution constant used to replace nondetects. Such bias is a serious limitation, because estimation of effect sizes is the primary reason for use of statistical methods in the first place. Constant substitution methods should always be avoided in biomarker analysis. Finally, MLE performed better than casewise deletion and substitution methods but was less precise than MI for statistical inference. Although the values imputed by MLE derived probabilistically from an underlying lognormal distribution, they did not reflect probable values of hsCRP. That is because, unlike MI, MLE does not utilize information in the variance/covariance matrix. And MLE’s reliance on a single imputed value fails to account for random variation in estimated nondetects . Overall, MI gave results most similar to the KM standard. Although KM is a well-accepted method in survival analysis, it has not been used widely for biomarker analyses . The primary limitations of KM are that estimated group means do not represent real effect sizes, because skew in the original censored data introduces bias, but medians may not be directly estimable depending on the pattern of censoring. Therefore, the best analytical method for handling left-censored biomarker data was MI.
Regarding reference intervals, KM used the original left-censored data, equivalent to casewise deletion, which yielded upward biased reference intervals. Substitution methods biased the lower reference limit, with each lower limit equal to the substitution constant. Furthermore, substitution of a constant prevented use of the robust method for estimation of reference intervals for HCV-infected animals, which was another severe limitation of substitution methods. The MLE reference intervals were wider than the MI reference intervals, probably due to the inability of the MLE method to utilize information from the variance : covariance matrix. These limitations are further reasons to avoid casewise deletion, substitution, and MLE methods for biomarker analysis. For MI, the effect of decade of life was statistically significant but too small (0.08 mg/L/decade) to improve clinical precision and did not justify separate reference intervals. Separate reference intervals by HCV infection status were indicated in all datasets except casewise deletion. MI produced more reasonable intervals, with unbiased and positive lower boundaries (unlike casewise deletion, substitution, and MLE). Furthermore, the reduction in median hsCRP among infected (0.4 mg/L) versus uninfected chimpanzees (0.7 mg/L; Figure 3(b)) was comparable to the reduction observed among HCV-infected humans relative to healthy controls  that these differences were thought to result from cumulative liver damage from chronic HCV infection, [26–28, 50, 89] suggested the need to distinguish chimpanzee reference intervals based on HCV infection status.
In summary, hsCRP data was moderately (27.9%) leftcensored. Missing data, although common, are not commonly reported. Standard reporting practices should include a description of the pattern and degree of missingness and the probable censoring mechanism, because different censoring mechanisms can bias results in different ways. Bias can also result from use of inappropriate analytical methods. Put another way, the presence of nondetects depends on the censoring mechanism, while the extent of bias depends on how missing data are handled in analysis. Casewise deletion ignores missing data and results in upward bias. Substitution methods introduce bias depending on the size of the substitution constant and should also be avoided. MLE-based single substitution was adequate for statistical inference but inadequate for reference intervals, because it gave biologically impossible negative values for lower boundaries. KM analysis using flipped censored data can be considered the analytical standard. But KM estimates of mean effects will be biased due to skew, while median effect sizes may not be estimable at all, depending on the pattern and extent of censoring. MI gave the best results for both statistical inference and reference intervals, although it was somewhat more complicated to implement. Overall, results demonstrated that hsCRP decreased significantly with HCV infection, increased moderately with health status, and increased modestly with age. We urge the use of KM or MI to handle left-censored biomarker data by other researchers, because other methods will yield biased and unreliable results. We recommend use of the hsCRP reference values estimated from the MI data (Figure 3(b)), as clinical guidelines for evaluating liver dysfunction in captive chimpanzees, in conjunction with liver enzymes, Alt and Alp.
|ANOVA:||Analysis of variance|
|CDA:||Censored data analysis|
|hsCRP:||High-sensitivity C-reactive protein|
|MLE:||Maximum likelihood estimation|
Conflict of Interests
The authors declare no conflict of interests.
The authors thank Brian Corning, D.V.M., and an anonymous reviewer for helpful comments on the paper. This study was supported by NIH Contract no. HHSN268201100065C.
- S. Black, I. Kushner, and D. Samols, “C-reactive protein,” Journal of Biological Chemistry, vol. 279, no. 47, pp. 48487–48490, 2004.
- G. M. Hirschfield and M. B. Pepys, “C-reactive protein and cardiovascular disease: new insights from an old molecule,” Monthly Journal of the Association of Physicians, vol. 96, no. 11, pp. 793–807, 2003.
- T. B. Ledue and N. Rifai, “Preanalytic and analytic sources of variations in C-reactive protein measurement: Implications for cardiovascular disease risk assessment,” Clinical Chemistry, vol. 49, no. 8, pp. 1258–1271, 2003.
- T. W. Du Clos, “Function of C-reactive protein,” Annals of Medicine, vol. 32, no. 4, pp. 274–278, 2000.
- M. Hamer, Y. Chida, and E. Stamatakis, “Association of very highly elevated C-reactive protein concentration with cardiovascular events and all-cause mortality,” Clinical Chemistry, vol. 56, no. 1, pp. 132–135, 2010.
- A. D. Pradhan, J. E. Manson, N. Rifai, J. E. Buring, and P. M. Ridker, “C-reactive protein, interleukin 6, and risk of developing type 2 diabetes mellitus,” Journal of the American Medical Association, vol. 286, no. 3, pp. 327–334, 2001.
- J. Zacho, A. Tybjærg-Hansen, and B. G. Nordestgaard, “C-reactive protein and all-cause mortality-the Copenhagen City Heart Study,” European Heart Journal, vol. 31, no. 13, pp. 1624–1632, 2010.
- OMIM, C-reactive protein, pentraxin-related, CRP, OMIM entry #123260, 2006, http://omim.org/entry/123260.
- R. Ross, “Atherosclerosis-an inflammatory disease,” The New England Journal of Medicine, vol. 340, pp. 115–126, 1999.
- A. R. Folsom, L. E. Chambless, C. M. Ballantyne et al., “An assessment of incremental coronary risk prediction using C-reactive protein and other novel risk markers: the atherosclerosis risk in communities study,” Archives of Internal Medicine, vol. 166, no. 13, pp. 1368–1373, 2006.
- T. A. Pearson, G. A. Mensah, R. W. Alexander et al., “Markers of inflammation and cardiovascular disease: application to clinical and public health practice: a statement for healthcare professionals from the centers for disease control and prevention and the American Heart Association,” Circulation, vol. 107, no. 3, pp. 499–511, 2003.
- P. M. Ridker, M. Cushman, M. J. Stampfer, R. P. Tracy, and C. H. Hennekens, “Plasma concentration of C-reactive protein and risk of developing peripheral vascular disease,” Circulation, vol. 97, no. 5, pp. 425–428, 1998.
- S. Mora, K. Musunuru, and R. S. Blumenthal, “The clinical utility of high-sensitivity C-reactive protein in cardiovascular disease and the potential implication of JUPITER on current practice guidelines,” Clinical Chemistry, vol. 55, no. 2, pp. 219–228, 2009.
- P. M. Ridker, N. Rifai, L. Rose, J. E. Buring, and N. R. Cook, “Comparison of C-reactive protein and low-density lipoprotein cholesterol levels in the prediction of first cardiovascular events,” The New England Journal of Medicine, vol. 347, no. 20, pp. 1557–1565, 2002.
- P. M. Ridker, J. E. Buring, N. R. Cook, and N. Rifai, “C-reactive protein, the metabolic syndrome, and risk of incident cardiovascular events: an 8-year follow-up of 14 719 initially healthy American women.,” Circulation, vol. 107, no. 3, pp. 391–397, 2003.
- J. Danesh, R. Collins, P. Appleby, and R. Peto, “Association of fibrinogen, C-reactive protein, albumin, or leukocyte count with coronary heart disease: meta-analyses of prospective studies,” Journal of the American Medical Association, vol. 279, no. 18, pp. 1477–1482, 1998.
- J. Danesh, P. Whincup, W. Walker et al., “Low grade inflammation and coronary heart disease: prospective study and updated meta-analyses,” British Medical Journal, vol. 321, pp. 199–204, 2000.
- W. Koenig, N. Khuseyinova, J. Baumert, and C. Meisinger, “Prospective study of high-sensitivity C-reactive protein as a determinant of mortality: results from the MONICA/KORA Augsburg cohort study, 1984–1998,” Clinical Chemistry, vol. 54, no. 2, pp. 335–342, 2008.
- J. J. Ely, M. A. Bishop, M. L. Lammey, M. M. Sleeper, J. M. Steiner, and D. R. Lee, “Use of biomarkers of collagen types I and III fibrosis metabolism to detect cardiovascular and renal disease in chimpanzees (Pan troglodytes),” Comparative Medicine, vol. 60, no. 2, pp. 154–158, 2010.
- J. J. Ely, T. Zavaskis, M. L. Lammey, M. M. Sleeper, and D. R. Lee, “Association of brain-type natriuretic protein and cardiac troponin I with incipient cardiovascular disease in chimpanzees (Pan troglodytes),” Comparative Medicine, vol. 61, no. 2, pp. 163–169, 2011.
- J. J. Ely, T. Zavaskis, M. L. Lammey, and D. Rick Lee, “Blood pressure reference intervals for healthy adult chimpanzees (Pan troglodytes),” Journal of Medical Primatology, vol. 40, no. 3, pp. 171–180, 2011.
- M. L. Lammey, G. B. Baskin, A. P. Gigliotti, D. R. Lee, J. J. Ely, and M. M. Sleeper, “Interstitial myocardial fibrosis in a captive chimpanzee (Pan troglodytes) population,” Comparative Medicine, vol. 58, no. 4, pp. 389–394, 2008.
- B. M. Seiler, E. J. Dick, R. Guardado-Mendoza et al., “Spontaneous heart disease in the adult chimpanzee (Pan troglodytes),” Journal of Medical Primatology, vol. 38, no. 1, pp. 51–58, 2009.
- M. M. Sleeper, C. J. Doane, P. H. Langner, S. Curtis, K. Avila, and D. R. Lee, “Successful treatment of idiopathic dilated cardiomyopathy in an adult chimpanzee (Pan troglodytes),” Comparative Medicine, vol. 55, no. 1, pp. 80–84, 2005.
- N. Varki, D. Anderson, J. G. Herndon et al., “Heart disease is common in humans and chimpanzees, but is caused by different pathological processes,” Evolutionary Applications, vol. 2, no. 1, pp. 101–112, 2009.
- M. Floris-Moore, A. A. Howard, Y. Lo, E. E. Schoenbaum, J. H. Arnsten, and R. S. Klein, “Hepatitis C infection is associated with lower lipids and high-sensitivity C-reactive protein in HIV-infected men,” AIDS Patient Care and STDs, vol. 21, no. 7, pp. 479–491, 2007.
- M. M. Nascimento, A. Bruchfeld, M. E. Suliman et al., “Effect of hepatitis C serology on C-reactive protein in a cohort of Brazilian hemodialysis patients,” Brazilian Journal of Medical and Biological Research, vol. 38, no. 5, pp. 783–788, 2005.
- J. I. Tsui, M. A. Whooley, A. Monto, K. Seal, P. C. Tien, and M. Shlipak, “Association of hepatitis C virus seropositivity with inflammatory markers and heart failure in persons with coronary heart disease: data from the heart and soul study,” Journal of Cardiac Failure, vol. 15, no. 5, pp. 451–456, 2009.
- P. Meuleman and G. Leroux-Roels, “HCV animal models: a journey of more than 30 years,” Viruses, vol. 1, no. 2, pp. 222–240, 2009.
- A. J. Lamperez and T. J. Rowell, “Normal C-reactive protein values for captive chimpanzees (Pan troglodytes),” Contemporary Topics in Laboratory Animal Science, vol. 44, no. 5, pp. 25–26, 2005.
- E. K. Harris and J. C. Boyd, Bases of Reference Values in Laboratory Medicine, Marcel Dekker, New York, NY, USA, 1995.
- J. J. Ely, B. Dye, W. I. Frels et al., “Subspecies composition and founder contribution of the captive U.S. Chimpanzee (Pan troglodytes) population,” American Journal of Primatology, vol. 67, no. 2, pp. 223–241, 2005.
- Institute for Laboratory Animal Research, Chimpanzees in Research Strategies for their Ethical Care, Management and Use, National Academies Press, Washington, DC, USA, 1997.
- Institute for Laboratory Animal Research, Guide to the Care and Use of LaboraTory Animals, National Academy Press, Washington, DC, USA, 1996.
- N. Aziz, J. L. Fahey, R. Detels, and A. W. Butch, “Analytical performance of a highly sensitive C-reactive protein-based immunoassay and the effects of laboratory variables on levels of protein in blood,” Clinical and Diagnostic Laboratory Immunology, vol. 10, no. 4, pp. 652–657, 2003.
- Beckman Coulter Inc., IMMAGE Immunochemistry Systems CRPH Chemistry Information Sheet, High Sensitivity C-Reactive Protein, Chemistry Information Sheet 988629 AG, 2010, http://www.beckmancoulter.com/wsrportal/search/Bernard/#2/10//0/25/1/0/asc/2/Bernard///0/1//0/.
- W. L. Roberts, R. Sedrick, L. Moulton, A. Spencer, and N. Rifai, “Evaluation of four automated high-sensitivity C-reactive protein methods: implications for clinical and epidemiological applications,” Clinical Chemistry, vol. 46, no. 4, pp. 461–468, 2000.
- D. A. Armbruster, M. D. Tillman, and L. M. Hubbs, “Limit of detection (LOD)/limit of quantitation (LOQ): comparison of the empirical and the statistical methods exemplified with GC-MS assays of abused drugs,” Clinical Chemistry, vol. 40, no. 7, part 1, pp. 1233–1238, 1994.
- D. R. Helsel, Nondetects and Data Analysis, Wiley, New York, NY, USA, 2005.
- W. L. Roberts, L. Moulton, T. C. Law et al., “Evaluation of nine automated high-sensitivity c-reactive protein methods: implications for clinical and epidemiological applications,” Clinical Chemistry, vol. 47, no. 3, pp. 418–425, 2001.
- M. A. Larkin, G. Blackshields, N. P. Brown et al., “Clustal W and Clustal X version 2.0,” Bioinformatics, vol. 23, no. 21, pp. 2947–2948, 2007.
- J. D. Thompson, T. J. Gibson, F. Plewniak, F. Jeanmougin, and D. G. Higgins, “The CLUSTAL X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools,” Nucleic Acids Research, vol. 25, no. 24, pp. 4876–4882, 1997.
- I. A. Adzhubei, S. Schmidt, L. Peshkin et al., “A method and server for predicting damaging missense mutations,” Nature Methods, vol. 7, no. 4, pp. 248–249, 2010.
- V. Ramensky, P. Bork, and S. Sunyaev, “Human non-synonymous SNPs: server and survey,” Nucleic Acids Research, vol. 30, no. 17, pp. 3894–3900, 2002.
- Uniprot Consortium, P02741 (CRP_HUMAN), 2013, http://www.uniprot.org/uniprot/P02741.
- E. L. Kaplan and P. Meier, “Nonparametric estimation from incomplete observations,” Journal of the American Statistical Association, vol. 53, pp. 457–481, 1958.
- J. H. Ware and D. L. Demets, “Reanalysis of some baboon descent data,” Biometrics, vol. 32, no. 2, pp. 459–463, 1976.
- D. W. Hosmer and S. Lemesow, Applied Survival Analysis, Wiley, New York, NY, USA, 1999.
- J. W. Tukey, Exploratory Data Analysis, Addison-Wesley, Reading, Mass, USA, 1977.
- R. Rosenthal and R. L. Rosnow, Contrast Analysis: Focused Comparisons in the Analysis of Variance, Cambridge University Press, Cambridge, UK, 1985.
- G. W. Snedecor and W. G. Cochrane, Statistical Methods, Iowa State University Press, Ames, Iowa, USA, 6th edition, 1967.
- R. H. Friis and T. A. Sellers, Epidemiology For Public Health Practice, Jones and Bartlett, Sudbury, Canada, 4th edition, 2009.
- R. Clarke, J. R. Emberson, E. Breeze et al., “Biomarkers of inflammation predict both vascular and non-vascular mortality in older men,” European Heart Journal, vol. 29, no. 6, pp. 800–809, 2008.
- E. S. Ford, W. H. Giles, G. L. Myers, N. Rifai, P. M. Ridker, and D. M. Mannino, “C-reactive protein concentration distribution among US children and young adults: findings from the National Health and Nutrition Examination Survey, 1999-2000,” Clinical Chemistry, vol. 49, no. 8, pp. 1353–1357, 2003.
- E. M. Macy, T. E. Hayes, and R. P. Tracy, “Variability in the measurement of C-reactive protein in healthy subjects: implications for reference intervals and epidemiological applications,” Clinical Chemistry, vol. 43, no. 1, pp. 52–58, 1997.
- H. Solberg and R. Gräsbeck, “Reference values,” Advances in Clinical Chemistry, vol. 27, pp. 1–79, 1989.
- A. Agresti, Categorical Data Analysis, John Wiley & Sons, Hoboken, NJ, USA, 2nd edition, 2002.
- S. E. Fienberg, The Analysis of Cross-Classified Categorical Data, MIT Press, Cambridge, Mass, USA, 2nd edition, 1980.
- C. N. Kroll and J. R. Stedinger, “Estimation of moments and quantiles using censored data,” Water Resources Research, vol. 32, no. 4, pp. 1005–1012, 1996.
- A. Rispin, D. Farrar, E. Margosches et al., “Alternative methods for the median lethal dose (LD50) test: the up-and-down procedure for acute oral toxicity,” ILAR Journal, vol. 43, no. 4, pp. 233–243, 2002.
- R. J. A. Little, “A test of missing completely at random for multivariate data with missing values,” Journal of the American Statistical Association, vol. 83, pp. 1198–1202, 1988.
- R. C. Antweiler and H. E. Taylor, “Evaluation of statistical treatments of left-censored environmental data using coincident uncensored data sets: I. Summary statistics,” Environmental Science and Technology, vol. 42, no. 10, pp. 3732–3738, 2008.
- P. Hougaard, Analysis of Multivariate Survival Data, Springer, New York, NY, USA, 2000.
- R. J. Glynn, N. M. Laird, and D. B. Rubin, “Multiple imputation in mixture models for nonignorable nonresponse with follow-ups,” Journal of the American Statistical Association, vol. 88, no. 423, pp. 984–993, 1993.
- A. R. T. Donders, G. J. M. G. van der Heijden, T. Stijnen, and K. G. M. Moons, “Review: a gentle introduction to imputation of missing values,” Journal of Clinical Epidemiology, vol. 59, no. 10, pp. 1087–1091, 2006.
- N. J. Horton and K. P. Kleinman, “Much ado about nothing: a comparison of missing data methods and software to fit incomplete data regression models,” American Statistician, vol. 61, no. 1, pp. 79–90, 2007.
- P. S. Horn, A. J. Pesce, and B. E. Copeland, “A robust approach to reference interval estimation and evaluation,” Clinical Chemistry, vol. 44, no. 3, pp. 622–631, 1998.
- M. M. Finkelstein and D. K. Verma, “Exposure estimation in the presence of nondetectable values: another look,” American Industrial Hygiene Association Journal, vol. 62, no. 2, pp. 195–198, 2001.
- P. Wessa, “Free Statistics Software, Office for Research Development and Education, version 1. 1. 23-r7,” 2012, http://www.wessa.net/.
- M. A. Stephens, “EDF statistics for goodness of fit and some comparisons,” Journal of the American Statistical Association, vol. 69, pp. 730–737, 1974.
- L. Fu and Y. G. Wang, “Nonparametric rank regression for analyzing water quality concentration data with multiple detection limits,” Environmental Science and Technology, vol. 45, no. 4, pp. 1481–1489, 2011.
- W. G. Wood, J. Lüdemann, R. Mitusch, J. Heinrich, R. Maass, and U. Frick, “Evaluation of a sensitive immunoluminometric assay for the determination of C-reactive protein (CRP) in serum and plasma and the establishment of reference ranges for different groups of subjects,” Clinical Laboratory, vol. 46, no. 3-4, pp. 131–140, 2000.
- J. L. Schafer, “Software for multiple imputation,” 2000, http://sites.stat.psu.edu/~jls/misoftwa.html.
- I. G. N. Darmawan, “NORM software review: handling missing values with multiple imputation methods,” Evaluation Journal of Australasia, vol. 2, no. 1, pp. 51–57, 2002.
- D. C. Howell, “Multiple imputation using NORM,” 2010, http://www.uvm.edu/~dhowell/StatPages/More_Stuff/Missing_Data/MissingDataWith%20NORM/NormMissingData.html.
- D. Rubin, ,Multiple Imputation for Nonresponse in Surveys, Wiley, New York, NY, USA, 1987.
- J. L. Schafer, Analysis of Incomplete Multivariate Data, Chapman & Hall, London, UK, 1997.
- J. B. Carlin, N. Li, P. Greenwood, and C. Coffey, “Tools for analyzing multiple imputed datasets,” Stata Journal, vol. 3, no. 3, pp. 226–244, 2003.
- E. K. Harris and J. C. Boyd, “On dividing reference data into subgroups to produce separate reference ranges,” Clinical Chemistry, vol. 36, no. 2, pp. 265–270, 1990.
- P. S. Horn and A. J. Pesce, Reference IntervalS: A USer'S Guide, AACC Press, Washington, DC, USA, 2005.
- F. E. Harrell and C. E. Davis, “A new distribution-free quantile estimator,” Biometrika, vol. 69, no. 3, pp. 635–640, 1982.
- A. D. Hutson and M. D. Ernst, “The exact bootstrap mean and variance of an L-estimator,” Journal of the Royal Statistical Society B, vol. 62, no. 1, pp. 89–94, 2000.
- Clinical and Laboratory Standards Institute, Defining, Establishing, and Verifying Reference Intervals in the Clinical Laboratory, Approved Guidelines, CLSI document C28-A3, Fort Wayne, Ind, USA, 3rd edition, 2008.
- MedCalc Software, MedCalc Software, Version 11. 3, Mariakerke, Belguim, 2010, http://www.medcalc.be/download.php.
- S. L. Stockman and M. A. Scott, Fundamentals of Veterinary Clinical Pathology, Wiley, New York, NY, USA, 2nd edition, 2008.
- R. Zhang, Y. Y. Zhang, X. R. Huang et al., “C-reactive protein promotes cardiac fibrosis and inflammation in angiotensin II-induced hypertensive cardiac disease,” Hypertension, vol. 55, no. 4, pp. 953–960, 2010.
- E. J. Erlandsen and E. Randers, “Reference interval for serum C-reactive protein in healthy blood donors using the Dade Behring N Latex CRP mono assay,” Scandinavian Journal of Clinical and Laboratory Investigation, vol. 60, no. 1, pp. 37–44, 2000.
- B. Herbeth, G. Siest, and J. Henny, “High sensitivity C-reactive protein (CRP) reference intervals in the elderly,” Clinical Chemistry and Laboratory Medicine, vol. 39, no. 11, pp. 1169–1170, 2001.
- M. B. Pepys and G. M. Hirschfield, “C-reactive protein: a critical update,” Journal of Clinical Investigation, vol. 111, no. 12, pp. 1805–1812, 2003.
- O. Chenillot, J. Henny, J. Steinmetz, B. Herbeth, C. Wagner, and G. Siest, “High sensitivity C-reactive protein: biological variations and reference limits,” Clinical Chemistry and Laboratory Medicine, vol. 38, no. 10, pp. 1003–1011, 2000.
- S. P. Fortmann, E. Ford, M. H. Criqui et al., “CDC/AHA workshop on markers of inflammation and cardiovascular disease,” Circulation, vol. 110, no. 25, pp. e554–e559, 2004.
- H. Schlebusch, N. Liappis, E. Kalina, and C. Klein, “High sensitive CRP and creatinine: reference intervals from infancy to childhood,” LaboratoriumsMedizin, vol. 26, no. 5-6, pp. 341–346, 2002.
- Z. Wang and W. E. Hoy, “Population distribution of high sensitivity C-reactive protein values in Aboriginal Australians: a comparison with other populations,” Clinical Biochemistry, vol. 39, no. 3, pp. 277–281, 2006.
Copyright © 2013 John J. Ely et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.