A Proposed Approach for Joint Modeling of the Longitudinal and Time-To-Event Data in Heterogeneous Populations: An Application to HIV/AIDS’s Disease

Roustaei, Narges; Ayatollahi, Seyyed Mohammad Taghi; Zare, Najaf

doi:https://doi.org/10.1155/2018/7409284

BioMed Research International

On this page

Abstract Introduction Materials and Methods Discussion Conclusion Abbreviations Conflicts of Interest Acknowledgments References Copyright Related Articles

Research Article | Open Access

Volume 2018 | Article ID 7409284 | https://doi.org/10.1155/2018/7409284

A Proposed Approach for Joint Modeling of the Longitudinal and Time-To-Event Data in Heterogeneous Populations: An Application to HIV/AIDS’s Disease

Narges Roustaei,¹Seyyed Mohammad Taghi Ayatollahi,¹and Najaf Zare¹

Academic Editor: Momiao Xiong

Received13 Jul 2017

Revised15 Nov 2017

Accepted05 Dec 2017

Published09 Jan 2018

Abstract

In recent years, the joint models have been widely used for modeling the longitudinal and time-to-event data simultaneously. In this study, we proposed an approach (PA) to study the longitudinal and survival outcomes simultaneously in heterogeneous populations. PA relaxes the assumption of conditional independence (CI). We also compared PA with joint latent class model (JLCM) and separate approach (SA) for various sample sizes (150, 300, and 600) and different association parameters (0, 0.2, and 0.5). The average bias of parameters estimation (AB-PE), average SE of parameters estimation (ASE-PE), and coverage probability of the 95% confidence interval (CP) among the three approaches were compared. In most cases, when the sample sizes increased, AB-PE and ASE-PE decreased for the three approaches, and CP got closer to the nominal level of 0.95. When there was a considerable association, PA in comparison with SA and JLCM performed better in the sense that PA had the smallest AB-PE and ASE-PE for the longitudinal submodel among the three approaches for the small and moderate sample sizes. Moreover, JLCM was desirable for the none-association and the large sample size. Finally, the evaluated approaches were applied on a real HIV/AIDS dataset for validation, and the results were compared.

1. Introduction

In many studies, the repeated measures of a biomarker are recorded together with time to an event of interest. For example, in HIV/AIDS studies, the trajectories of CD4 counts and time-to-death are collected. In such studies, the interest often lies in understanding the relationships between the longitudinal history of a process and its effect on the risk of an event [1–9].

Classical models such as the separate analysis were performed for these types of data; consequently, the association between the longitudinal and survival outcomes is neglected because the linear mixed model for repeated measurements and the Cox model for time-to-event are conducted separately [6, 10, 11]. In addition, some practices consider the dependency between the two outcomes. Hence, the extended Cox model is used to incorporate the repeated measures as time-varying covariates [4]. In this method, time varying covariates are assumed to be observed continuously till the study terminated using this approach. In practice, this assumption usually does not stratify. Moreover, longitudinal biomarkers tend to be measured with error; thus, modeling the longitudinal measures by a mixed model accounts for this measurement error, which is neglected in the extended Cox model, thus leading to biased and inefficient estimates [4, 10, 12–14].

In recent years, joint model has been used to analyze the longitudinal and survival outcomes simultaneously to consider association between the two outcomes [1, 14, 15]. Joint model enjoys some advantages as compared to classical approaches such as Cox and linear mixed models alone and provides more powerful, accurate, efficient, and robust estimations [4, 10, 12, 16].

Most of the joint models allow subjects to just follow one pattern [5, 6, 13], and the baseline hazard is considered the same for all subjects. Thus, they become inappropriate when there are subgroups with different patterns of response profiles [13].

Joint latent class model (JLCM) is a type of joint models that assumes the population of the subjects to be heterogeneous with multiple homogenous patterns; it is known as the latent class (subpopulation, subtype, or subgroup), having its own longitudinal trajectory and survival curve [2, 5, 6, 17].

Conditional independence (CI) as a fundamental assumption of the JLCM shows that the entire association between longitudinal and survival outcomes is captured by the latent class structure. Thus, given these latent classes, the two types of outcomes are independent [17–20]. However, the CI assumption may not sufficiently show the strength of association and might underestimate the association between the longitudinal and survival processes [13]. Furthermore, to ensure the CI assumption, JLCM has to be examined for various numbers of latent classes, which may ultimately lead to choosing an inappropriate and meaningless size of classes.

We designed a simulation study to combine the joint model with the latent class framework which proposed an approach (PA) for heterogeneous population of subjects free from the CI assumption. At first, the class membership for each subject based on the latent class framework was identified for appropriate number of latent classes. Then, the joint model for longitudinal and survival processes was conducted separately in each latent class for PA. In addition, the separate approach (SA), the linear mixed model for the longitudinal data, and the extended Cox model for the survival outcome were applied separately in each latent class. Finally, we compared PA with JLCM and SA for various sample sizes and different association parameters. In addition, we focused on both the longitudinal and survival outcomes in this study.

2. Materials and Methods

2.1. Models Framework

2.1.1. Joint Latent Class Model (JLCM)

JLCM assumes that the subjects in each latent class have their own specific longitudinal trajectory and risk of the event, which is useful in many types of research with different patterns of the longitudinal and survival outcomes. In addition, JLCM can be performed for normal and nonnormal distributions and ordinal outcomes [6, 21]. This model does not require normal distribution of random-effects assumption, since it consists of several subpopulations, where this assumption is not realistic [22].

JLCM includes three components: the latent class membership, the longitudinal, and survival submodels. Given the latent class , there is no association between two processes of the longitudinal and survival outcomes; consequently, dependency between time-to-event and longitudinal processes is captured by the structure of latent class [5]. Several methods were introduced to evaluate the CI assumption: evaluation based on the posterior classification, analysis of the residuals conditional on the event, and a score test [19, 23, 24]. Among these approaches, the score test is more powerful than the other methods to assess the CI assumption [2, 5].

In practice, JLCM is applied to a number of latent classes from one to three; the appropriate number of latent classes is determined using the best Bayesian information criterion (lower BIC) and satisfactory CI assumption [6, 20].

Each subject is assigned to each latent class, which has the highest class membership probabilities [25]. A case that is wrongly classified is called misclassified on a categorical variable [13].

2.1.2. Separate Approach (SA)

Commonly, the linear mixed model is used for continuous longitudinal measurements. Also, the parametric or semiparametric survival models are used for modeling the time-to-event data [11]. In SA, the probability that a subject belongs to a latent class structure can be modeled via a latent class framework. Next, the linear mixed model for modeling the longitudinal measurements and the extended Cox model by incorporating repeated measurements into the survival data were conducted for each latent class.

2.1.3. Proposed Approach (PA)

We incorporated the latent class framework to identify its subgroups behind the observed longitudinal measurements and survival outcome. PA provides an approach that achieves appropriate number of the latent classes in heterogeneous populations without requiring the CI assumption. Appropriate number of latent classes are determined by a suitable and easier interpretation according to researcher’ comments. For PA, each subject was allocated to an appropriate class according to the highest class membership probabilities. Then, joint model was conducted for each class; additionally, in each latent class, the association between the longitudinal and time-to-event data was modeled by the entire longitudinal trajectory as a covariate in the survival submodel.

(1) Latent Class Framework. The class membership probability for a subject belonging to a latent class can be modeled via a multinomial logistic regression with vector of covariate :Let represent the latent variable with latent classes.

is the intercept for class and is the vector of class-specific parameters associated with the set covariates . Also, to ensure identifiability, and , that is, last latent class as [2, 5, 25].

In the application, parameters from latent class framework are estimated by maximizing the log likelihood function with iteration of Expected-Maximization (EM) algorithm with steps of Newton-Raphson [26, 27].

(2) Longitudinal Submodel. The longitudinal submodel is specified as a class-specific linear mixed model. Let be the total number of subjects and let be the number of repeated measurements for subject . The longitudinal submodel given to each latent class can be written asGiven the latent class , is the longitudinal outcome for subject at the time of , and represents the random effect covariate vectors at the time for subject , associated with the -vector of random effect , where is the fixed effects covariate vectors at the time , which is associated with the -vector of fixed effect. The random error term, is usually assumed to be normally distributed.

(3) Survival Submodel. The survival submodel is specified as a Cox or any parametric survival model. Given latent class , the survival submodel is specified aswhere is the baseline hazard function for class and is the covariate vector associated with the -vector parameters for the latent class .

The quantity, , is the trajectory of the longitudinal function for class to connect the longitudinal process with the survival outcome. The parameter links the longitudinal and time-to-event outcomes in each class.

2.2. Simulation Studies

We conducted this simulation study to examine bias, SE, the average bias of parameters estimation (AB-PE), the average SE of parameters estimation (ASE-PE), and coverage probability of the 95% confidence interval (CP) for three approaches (PA, JLCM, and SA) for the longitudinal and survival submodels. AB-PE shows the average of absolute bias of all parameters estimation. CP shows the proportion of time that confidence interval contains the true value.

A multinomial logistic model was considered for the latent class membership for each subject: We considered a binary and a continuous covariate, where is called a treatment effect, which was assumed as a binomial distribution with and . We assumed two latent classes (), where approximately 50% of the subjects belonged to class 1.

The longitudinal outcome was generated from a linear mixed model, where time of measurements was fixed at with a maximum of 11 measurements. The longitudinal submodel given to each latent class isTo achieve appropriate heterogeneous classes and to decrease misclassification rate, we considered the parameters with opposite direction in two classes from a previous study [13]. Thus, in the first class, we set coefficients to be and assumed subject-specific unobservable heterogeneity in class 1, . The error term had normal standard distribution. In the second class, we set coefficients to be and assumed that , and where the random intercept effect was assumed independent from the error term. The is called trajectory function for each class.

The survival submodel assumed a Cox model with a Weibull baseline hazard function. The event time was generated using an inverse cumulative hazard function [15, 28, 29]. The censored time is noninformative and is uniformly distributed random variable on 2.5+ uniform . Therefore, the observed failure time for the th subject was considered as the minimum of true event time and censored time [20, 30]. As some previous studies, the censoring rate was considered around 60% in this simulation study [13, 28].

The survival submodel was generated for each latent class as follows:The treatment effect on the time-to-event was and −0.5 in classes 1 and 2, respectively. The shape and scale parameters, , of baseline hazard function were (0.6, 0.001) and (1, 0.001) in classes 1 and 2, respectively.

Sets of simulated data were performed for three sample sizes (150, 300, and 600 as small, moderate, and large sample sizes). Similar to previous study, three association parameters between longitudinal and survival outcomes were considered for none, moderate, and considerable association, respectively [12]. The magnitude of the association parameters was assumed the same in the two classes. For each simulation, the three approaches of PA, SA, and JLCM were fitted. We ran 1000 replications for each set of simulated data.

There are several methods to estimate parameters in joint models, including ML, restricted maximum likelihood (REML), and Bayesian method [18]. In PA, Gauss-Hermite integration method for maximizing the log likelihood of the joint distribution and EM iterations algorithm or quasi-Newton iterations were used. In JLCM, ML with EM algorithm was implemented to estimate parameters. For SA approach, ML in the longitudinal submodel and REML in the survival submodel were used for parameters estimation. The JM and LCMM packages in R version 3.1.1 software were used in this study.

3. Results of Simulation Study

3.1. Effect of Sample Size

Simulations results showed that in most cases when the three approaches were used the sample size increased, while AB-PE and ASE-PE decreased, and the CP went close to nominal level of 0.95. Tables 1–3 and Figures 1 and 2 present detailed information.

3.2. Effect of Association between the Longitudinal and Survival Outcomes

3.2.1. None-Association

For JLCM, the model with the best BIC and the CI assumption satisfied included two latent classes () for the moderate and large sample sizes, while in the small sample size, was the best-fit model. For the small sample size, results were reported for the two classes, since we can compare the models together. The average misclassification rates for the two latent classes for sample sizes of 150, 300, and 600 were 24%, 5%, and 1.4%, respectively.

AB-PE and ASE-PE for the longitudinal submodel of PA and SA for the small sample size were the same and the lowest among the three approaches. Additionally, PA had lower ASE-PE than SA for the moderate and large sample sizes. For the large sample size, AB-PE of the longitudinal submodel for the three approaches was the same, and JLCM had the lowest ASE-PE (Figure 1).

AB-PE for the treatment effect on time-to-event in the PA and SA was approximately the same for the small and moderate sample sizes. In addition, PA had better CP as compared with SA. Besides, for the large sample size, AB-PE and ASE-PE for the treatment effect on the survival submodel of JLCM were the lowest (Figure 2) and had a good CP amongst the three approaches.

The average of absolute bias and SE for the association parameter of PA and SA was approximately the same, and by increasing the sample size, bias and its SE decreased.

Bias, SE, and CP estimated parameters for the three approaches are presented in Table 1. AB-PE and their ASE-PE for the longitudinal and survival submodels are shown in Figures 1 and 2.

3.2.2. Moderate Association

For JLCM, the average misclassification rate for the two latent classes in the small sample size was approximately 20%, which was greater than the other sample sizes.

PA had the smallest AB-PE and ASE-PE for the longitudinal submodel among the three approaches for small and moderate sample sizes. As for the large sample size, PA and JLCM had the same AB-PE, but JLCM had smaller ASE-PE in comparison with the other approaches for the longitudinal submodel (Figure 1).

AB-PE for the treatment effect on the survival submodel of PA was lower than JLCM and SA for the all sample sizes. In addition, ASE-PE of PA was the lowest among the three approaches for the small and moderate sample sizes. Furthermore, JLCM had the lowest ASE-PE, and SA had the highest AB-PE among the three approaches for the large sample size (Figure 2).

Figures 1 and 2 show the results. In addition, by increasing the sample size, CP for PA and JLCM were close to 0.95 (Table 2).

3.2.3. Considerable Association

JLCM with one to three numbers of latent classes was performed. For the moderate and large sample sizes, the three appropriate numbers of latent classes were detected based on the best BIC, and satisfaction CI assumption, and for the small sample size, one latent class was preferred. We reported the estimation of parameters for the two classes in order to compare the three approaches together. The average misclassification rates for the two latent classes were 47%, 26%, and 10%, for the small, moderate, and large sample sizes, respectively.

PA had the lowest AB-PE and ASE-PE and plausible CP for the longitudinal outcome, as well as the treatment effect on the survival submodel for the three sample sizes. In the three approaches, if sample size increases, AB-PE and ASE-PE decrease (Figures 1 and 2) and the CP get closer to nominal level of 0.95. In addition, bias of association parameter for PA and SA was negative in two classes. Moreover, the average absolute bias of association parameter for SA was higher than PA. The average of CP for PA, JLCM, and SA was 0.970, 0.837, and 0.833, for the large sample size, respectively. For bias, SE, and CP information of parameters estimation, refer to Table 3.

4. Empirical Example

4.1. The Data and Methods Description

The number of new HIV infections has declined by 38% worldwide from 2001 to 2013, followed by a significant decline in AIDS-related deaths [31]. According to the World Health Organization [32] report, 36.7 million people will be living with HIV/AIDS by the end of 2015 [32].

Among infectious diseases, the HIV/AIDS studies are a good example to be used in joint modeling of the longitudinal and survival processes. There are some literatures available that have used the joint modeling on such data [3, 6, 33, 34]. In HIV/AIDS studies, CD4 cells are considered as a sign of disease progression in HIV-infected people. CD4 cells help to coordinate the immune system’s response to certain microorganisms such as viruses; a low CD4 count is an indication of a higher risk of infection [6, 33, 35].

In this study, the HIV/AIDS dataset from Community Programs for Clinical Research on AIDS (CPCRA) was used [36], and a total of 467 patients infected with HIV were included in this study. The two outcomes were the longitudinal measurements of CD4, recorded at different time points: at the study entry, 2, 6, 12, and 18 months, and the time-to-death outcome. In CPCRA study, patients received two treatments, Zalcitabine (ddC) or Didanosine (ddI), randomly. Only a brief description of the dataset used in this study was mentioned here, since they have been fully described elsewhere [36].

In the present study, the HIV/AIDS dataset was used as an example to evaluate PA. To predict the class membership, an intercept-only-model or different covariates such as baseline hemoglobin (Hgb), treatment, and gender were considered from the literature [6, 13, 18]. In this study, based on Hgb and the treatment covariates, the class membership probability for each patient was identified via latent class framework. Then the patients were divided into two latent classes based on their highest posterior class membership probabilities. The number of latent classes was chosen in a way that there were enough observations in each latent class for easier classification, consistency with our simulation-based study, and easier interpretation.

PA and SA for modeling the influence of effective covariates on CD4 count and time-to-death were conducted in each class. In addition, we fitted JLCM for longitudinal CD4 measures and time-to-death with the number of latent classes varying from 1 to 3.

Due to the skewed distribution of CD4 cell level, and the presence of zero values, log(CD4+1) was used as the longitudinal outcome. The baseline hazard functions were estimated by Weibull distribution.

We used latent GOLD software ver. 4.5 to identify the probability of the class membership for each subject and influential covariates on classes.

4.2. Results of Application Data

The results of latent class framework, using the PA and SA, showed that Hgb was significant ( value < 0.001), while the treatment ( value = 0.170) was an insignificant covariate on the subtype classification. Based on the classification, 51% of the patients were in the first class.

The PA on HIV/AIDS Dataset. In both classes, CD4 values decreased with time. The estimates of association parameters between CD4 and the time-to-death were significant and negative in both classes. Treatment had a significant effect on time-to-death in the second class. The effective covariates on the longitudinal and survival submodels are presented in Table 4.

The Kaplan-Meier survival plot and the mean of log(CD4+1) stratified by posterior classification are presented in Figures 3 and 4. Patients in the second class had a better survival rate and higher log(CD4+1) values.

The SA on HIV/AIDS Dataset. In the longitudinal submodel, time effect in the first class occurrence reduces CD4 cells significantly. Hgb had a small, but significant effect on CD4 in the first class and a strong significant effect in the second class. In the survival submodel, the treatment had a significant effect on time-to-death in the second class. The estimated association parameters between CD4 and the time-to-death were significant and negative in both classes. The results for SA are presented in Table 4.

The JLCM on HIV/AIDS Dataset. BIC calculated for two latent classes was 4280.560, which was smaller than one class (4417.210) and three classes (4304.480). Also, the CI assumption was not rejected ( value = 0.250) for this model; thus, the model with two latent classes was preferred. The probability of belonging to the latent classes was not significantly associated with the treatment ( value = 0.370) and Hgb ( value = 0.566).

In the longitudinal submodel, the time effect was negative and significant in the two latent classes. In the survival submodel, the treatment in the second class and Hgb in the first class were significantly associated with risk of death for HIV/AIDS patients (Table 4).

Overall, ASE-PE for the longitudinal submodel was 0.024, 0.024, and 0.039 for the PA, SA, and JLCM, respectively. Furthermore, ASE-PE for the survival submodel among the three approaches were 0.106, 0.394, and 0.343 for the PA, SA, and JLCM, respectively.

5. Discussion

5.1. Discussions about Simulation Results

According to the simulations results, in most cases, in the three approaches when the sample size increased, AB-PE and ASE-PE decreased, and CP got closer to nominal level of 0.95. This finding is consistent with a simulation-based study for a parametric latent class joint model of the longitudinal and survival outcome [2].

Our main finding occurred when there was a considerable association () between two processes. PA provided lower AB-PE and ASE-PE than JLCM and SA for the three sample sizes; hence, PA yielded unbiased and more efficient estimation of parameters than JLCM and SA for the longitudinal and survival submodels. The results of a similar study are consistent with those of PA for heterogeneous populations [13]. However, PA used the full longitudinal trajectory to connect the longitudinal and survival data, whereas in the similar study, only the shared random effect was used. This study showed that the model worked well in estimating longitudinal and survival parameters in a sample size of 400 and for the considerable association between the two processes.

To the best of our knowledge, no comparison has been made between JLCM and other approaches for the heterogeneous populations. However, to compare with similar studies, we used the ones that had assumed that the subjects exhibited one pattern. For comparison between PA and SA, the results are consistent with previous studies that had conducted simulation-based studies where there was a strong association between the two outcomes. Their results showed that the joint modeling that utilizes information from both outcomes tends to produce almost unbiased estimates and smaller SEs of all the parameters than separate model [8, 37, 38]. Furthermore, since AB-PE and ASE-PE in JLCM were higher than PA, it seems that JLCM cannot contain the strength of association entirely by latent structures. In addition, the number of latent classes in JLCM could not be estimated directly and for some sample sizes, the appropriate number of classes is selected according to lower BIC, and acceptance of the CI assumption was not consistent with the true size of classes. Therefore, it led to biased estimation of parameters, while PA achieved an appropriate number of latent classes directly with no need for the CI assumption and BIC criterion. In addition, the association parameter for SA was underestimated in comparison with PA. This result concurs with those of a similar study which showed that using the longitudinal outcome as a time-varying covariate into the survival model is not recommended, due to severe underestimation of the association parameter [15].

When there was the moderate correlation between the longitudinal and survival processes, PA was preferred over JLCM and SA for the small and moderate sample sizes. In addition, the average misclassification rate for the small sample size in JLCM was high; hence, AB-PE and ASE-PE were increased. Furthermore, for the large sample size, the average misclassification rate for JLCM was low; thus, AB-PE of the longitudinal submodel for JLCM and PA was the same and JLCM was more efficient.

For the case of none-association between the longitudinal and survival processes, results of the longitudinal submodel of PA were similar to SA in the small sample size. Our finding is consistent with a similar study for none-association between the longitudinal and survival data [39]. Also, PA was more efficient than SA in the moderate and large sample sizes. For the small and moderate sample sizes, the results of the effect of the treatment on the time-to-event of PA and SA were found to be similar. This result is consistent with similar studies that had shown when there was no association between the longitudinal and survival data; the longitudinal information did not improve the estimation of the treatment effect on the survival outcome [12, 39]. Moreover, JLCM was unbiased and more efficient than the other two approaches in the large sample size. Computationally, JLCM was faster and easier than the time consuming PA. In addition, for the large sample size, the misclassification rate was the lowest; hence, the entire association between the longitudinal and survival outcomes can be considered with the latent structure. Therefore, in this case, JLCM was more desirable than the two other approaches.

We believe that our PA can address the heterogeneity and consider the association structure behind the longitudinal and survival processes. One of the advantages of using PA was its capability to reduce AB-PE and ASE-PE by increasing the sample size and intensity of the association parameter. However, it leads to increased computation and time required to fit the model, which is one of the disadvantages of PA.

Finally, this study had some limitations that have to be addressed. First, in this study, we used the same magnitude parameters with opposite direction to consider the two heterogeneous classes. Second, association parameters for the two classes were the same. Third, this study was limited to two latent classes and continuous longitudinal and single event data. Further researches have to use PA with various options for the survival and longitudinal processes such as a nonlinear mixed model for the longitudinal data and a parametric or recurrent survival model. Moreover, we used ML estimation, while the Bayesian inference can be an alternative approach for estimation of parameters. Also, further simulation studies can be performed to evaluate statistical properties for PA including the statistical power.

5.2. Discussions about HIV/AIDS Results

The results showed that Hgb was a significant covariate in classifying subjects via latent class framework, concurring with the results of a study on this dataset [13].

According to the results of the application, the time effect was significant in each class for CD4 longitudinal outcome in PA and JLCM. This study produced results which corroborate the findings of a great deal of the previous works that used this dataset [3, 13]. There were no statistically significant differences between the two treatments (ddC and ddI), on CD4 longitudinal outcome in the two latent classes in the three approaches. This result agrees with the findings of similar studies that had investigated the effect of the treatment on CD4 longitudinal outcome [11, 13]. In addition, Hgb had a significant positive effect on CD4 values in both classes in PA and SA. This result matches those observed in earlier studies on this dataset [13].

As for the survival submodel, the treatment was a significant factor on time-to-death in the second class in the three approaches. Patients in the second class had a better survival rate when given ddC. Furthermore, Hgb was not a significant factor of the death rate in the two classes for PA and SA. This finding is consistent with a similar study where Hgb was imported into the model [13]. In JLCM, Hgb was a significant factor of death rate in the first class but did not have a significant effect in the second class.

The estimated association parameters ( and ) between CD4 and time-to-death were negative and significant in both classes for PA and SA. This implies that a higher CD4 count is associated with a lower death rate or a reduced number of CD4 significantly increases the risk of death in patients [10, 40].

Overall, the results of PA in this study confirm those of the previous studies on this dataset and with the biomedical literature [11, 13, 14, 18]. Moreover, PA and SA had the same ASE-PE approximately for the longitudinal submodel that are consequently more efficient than JLCM. In addition, PA had lower ASE-PE for the survival submodel; hence, PA is more efficient than the other two approaches for indicating the influence covariates on time-to-death in patients with HIV/AIDS. The results of the three approaches on CPCRA data confirm our result in the simulation study when there was a considerable association parameter between the longitudinal outcome and time-to-event in the large sample size.

The application study on CPCRA data shows the advantages of our PA. Therefore, by using appropriate latent class joint model, we can assign treatment ddC to patients with a higher chance of being classified into the second class based on their baseline hemoglobin (Hgb), thereby increasing the survival rate. In other cases, when the treatments have side effects, we could utilize an appropriate latent class joint modeling to identify a subgroup of patients that are most likely to have side effects. Hence, we can assign treatments in a personalized manner to avoid such subgroup, which can further benefit the patients.

6. Conclusion

This simulation-based study provided an approach for the joint model, by considering the association between the longitudinal and time-to-event data for heterogeneous populations which does not require the CI assumption. This study concluded that for the three approaches when the sample size increased, AB-PE and ASE-PE decreased to some extent, and CP reached the nominal level of 0.95. Finally, when there were a considerable association and the large sample size, PA was preferred.

Abbreviations

JLCM:	Joint latent class model
PA:	Proposed approach
SA:	Separate approach
BIC:	Bayesian information criterion
CI:	Conditional independence
CP:	Coverage probability
SE:	Standard error
AB-PE:	Average bias of parameters estimation
ASE-PE:	Average SE of parameters estimation
EM:	Expected-Maximization
ML:	Maximum likelihood
REML:	Restricted maximum likelihood
CPCR:	Community Programs for Clinical Research on AIDS
ddC:	Zalcitabine
ddI:	Didanosine.

Conflicts of Interest

The authors declare that they have no conflicts of interest regarding the publication of this paper.

Acknowledgments

The present paper was extracted from Ph.D. dissertation of Ms. Narges Roustaei and was supported by Shiraz University of Medical Sciences, Shiraz, Iran (Grant no. 94-10580). The authors would like to thank Mr. Jamshid Jamali, Mr. Saeid Ghanbari, and Ms. Zahra Amini Farsani for their constructive comments. The authors wish to thank Mr. Hossein Argasi at the Research Consultation Center (RCC) of Shiraz University of Medical Sciences for his invaluable assistance in editing this manuscript.

References

Q. Chen, R. C. May, J. G. Ibrahim, H. Chu, and S. R. Cole, “Joint modeling of longitudinal and survival data with missing and left-censored time-varying covariates,” Statistics in Medicine, vol. 33, no. 26, pp. 4560–4576, 2014.
View at: Publisher Site | Google Scholar | MathSciNet
J. Han, E. H. Slate, and E. A. Pena, “Parametric latent class joint model for a longitudinal biomarker and recurrent events,” Statistics in Medicine, vol. 26, no. 29, pp. 5285–5302, 2007.
View at: Publisher Site | Google Scholar | MathSciNet
L. Liu and X. Huang, “Joint analysis of correlated repeated measures and recurrent events processes in the presence of death, with application to a study on acquired immune deficiency syndrome,” Journal of the Royal Statistical Society: Series C (Applied Statistics), vol. 58, no. 1, pp. 65–81, 2009.
View at: Publisher Site | Google Scholar | MathSciNet
S. Li, “Joint modeling of recurrent event processes and intermittently observed time-varying binary covariate processes,” Lifetime Data Analysis, vol. 22, no. 1, pp. 145–160, 2016.
View at: Publisher Site | Google Scholar
C. Proust-Lima, M. Sene, J. M. Taylor, and H. Jacqmin-Gadda, “Joint latent class models for longitudinal and time-to-event data: a review,” Statistical Methods in Medical Research, vol. 23, no. 1, pp. 74–90, 2014.
View at: Publisher Site | Google Scholar | MathSciNet
C. Brombin, C. Di Serio, and P. M. V. Rancoita, “Joint modeling of HIV data in multicenter observational studies: A comparison among different approaches,” Statistical Methods in Medical Research, vol. 25, no. 6, pp. 2472–2487, 2016.
View at: Publisher Site | Google Scholar
D. Rizopoulos, G. Verbeke, and G. Molenberghs, “Multiple-imputation-based residuals and diagnostic plots for joint models of longitudinal and survival outcomes,” Biometrics, vol. 66, no. 1, pp. 20–29, 2010.
View at: Publisher Site | Google Scholar
M. Sudell, R. Kolamunnage-Dona, and C. Tudur-Smith, “Joint models for longitudinal and time-to-event data: A review of reporting quality with a view to meta-analysis,” BMC Medical Research Methodology, vol. 16, no. 1, article no. 168, 2016.
View at: Publisher Site | Google Scholar
A. Chakrabortya and K. Dasb, “Inferences for joint modelling of repeated ordinal scores and time to event data,” Computational and Mathematical Methods in Medicine, vol. 11, no. 3, pp. 281–295, 2010.
View at: Publisher Site | Google Scholar
L. Wu, W. Liu, G. Y. Yi, and Y. Huang, “Analysis of longitudinal and survival data: Joint modeling, inference methods, and issues,” Journal of Probability and Statistics, Article ID 640153, 2012.
View at: Publisher Site | Google Scholar
X. Guo and B. P. Carlin, “Separate and joint modeling of longitudinal and event time data using standard computer packages,” The American Statistician, vol. 58, no. 1, pp. 16–24, 2004.
View at: Publisher Site | Google Scholar | MathSciNet
J. G. Ibrahim, H. Chu, and L. M. Chen, “Basic concepts and methods for joint models of longitudinal and survival data,” Journal of Clinical Oncology, vol. 28, no. 16, pp. 2796–2801, 2010.
View at: Publisher Site | Google Scholar
Y. Liu, L. Liu, and J. Zhou, “Joint latent class model of survival and longitudinal data: An application to CPCRA study,” Computational Statistics & Data Analysis, vol. 91, pp. 40–50, 2015.
View at: Publisher Site | Google Scholar
H. J. Lim, P. Mondal, and S. Skinner, “Joint modeling of longitudinal and event time data: application to HIV study,” Journal of Medical Statistics and Informatics, vol. 1, no. 1, p. 1, 2013.
View at: Google Scholar
M. J. Sweeting and S. G. Thompson, “Joint modelling of longitudinal and time-to-event data with application to predicting abdominal aortic aneurysm growth and rupture,” Biometrical Journal, vol. 53, no. 5, pp. 750–763, 2011.
View at: Publisher Site | Google Scholar | MathSciNet
L. M. Chen, J. G. Ibrahim, and H. Chu, “Sample size and power determination in joint modeling of longitudinal and survival data,” Statistics in Medicine, vol. 30, no. 18, pp. 2295–2309, 2011.
View at: Publisher Site | Google Scholar | MathSciNet
H. Lin, B. W. Turnbull, C. E. McCulloch, and E. H. Slate, “Latent class models for joint analysis of longitudinal biomarker and event process data: application to longitudinal prostate-specific antigen readings and prostate cancer,” Journal of the American Statistical Association, vol. 97, no. 457, pp. 53–65, 2002.
View at: Publisher Site | Google Scholar | MathSciNet
D. Rizopoulos, Joint models for longitudinal and time-to-event data: With applications in R, CRC Press, Boca Raton, Fl, USA, 2012.
H. Jacqmin-Gadda, C. Proust-Lima, J. M. G. Taylor, and D. Commenges, “Score test for conditional independence between longitudinal outcome and time to event given the classes in the joint latent class model,” Biometrics, vol. 66, no. 1, pp. 11–19, 2010.
View at: Publisher Site | Google Scholar
C. Proust-Lima and J. M. G. Taylor, “Development and validation of a dynamic prognostic tool for prostate cancer recurrence using repeated measures of posttreatment PSA: A joint modeling approach,” Biostatistics, vol. 10, no. 3, pp. 535–549, 2009.
View at: Publisher Site | Google Scholar
C. Proust-Lima and B. Liquet, “lcmm: an R package for estimation of latent class mixed models and joint latent class models,” in Proceedings of the The R User Conference, useR! 2011, University of Warwick, Coventry, UK, August, 2011.
View at: Google Scholar
C. Proust-Lima, L. Letenneur, and H. Jacqmin-Gadda, “A nonlinear latent class model for joint analysis of mutivariate longitudinal data and a binary outcome,” Statistics in Medicine, vol. 26, no. 10, pp. 2229–2245, 2007.
View at: Publisher Site | Google Scholar | MathSciNet
C. Proust-Lima, P. Joly, J.-F. Dartigues, and H. Jacqmin-Gadda, “Joint modelling of multivariate longitudinal outcomes and a time-to-event: A nonlinear latent class approach,” Computational Statistics & Data Analysis, vol. 53, no. 4, pp. 1142–1154, 2009.
View at: Publisher Site | Google Scholar
H. Lin, C. E. McCulloch, and R. A. Rosenheck, “Latent pattern mixture models for informative intermittent missing data in longitudinal studies,” Biometrics, vol. 60, no. 2, pp. 295–305, 2004.
View at: Publisher Site | Google Scholar
C. Proust-Lima, V. Philipps, and B. Liquet, “Estimation of extended mixed models using latent classes and latent processes: the R package lcmm,” Journal of Statistical Software, vol. 78, no. 2, pp. 1–56, 2017.
View at: Publisher Site | Google Scholar
J. A. Hagenaars and A. L. McCutcheon, Applied Latent Class Analysis, Cambridge University Press, 2002.
J. Petersen, K. Bandeen-Roche, E. Budtz-Jørgensen, and K. G. Larsen, “Predicting latent class scores for subsequent analysis,” Psychometrika, vol. 77, no. 2, pp. 244–262, 2012.
View at: Publisher Site | Google Scholar | MathSciNet
P. C. Austin, “Generating survival times to simulate Cox proportional hazards models with time-varying covariates,” Statistics in Medicine, vol. 31, no. 29, pp. 3946–3958, 2012.
View at: Publisher Site | Google Scholar | MathSciNet
R. Bender, T. Augustin, and M. Blettner, “Generating survival times to simulate Cox proportional hazards models,” Statistics in Medicine, vol. 24, no. 11, pp. 1713–1723, 2005.
View at: Publisher Site | Google Scholar | MathSciNet
D. Rizopoulos, “Dynamic Predictions and Prospective Accuracy in Joint Models for Longitudinal and Time-to-Event Data,” Biometrics, vol. 67, no. 3, pp. 819–829, 2011.
View at: Publisher Site | Google Scholar
D. Gökengin, F. Doroudi, J. Tohme, B. Collins, and N. Madani, “HIV/AIDS: Trends in the Middle East and North Africa region,” International Journal of Infectious Diseases, vol. 44, pp. 66–73, 2016.
View at: Publisher Site | Google Scholar
WHO organization., “HIV/AIDS,” 2017, http://www.who.int/hiv/data/en.
View at: Google Scholar
M. Farahani, V. Novitsky, R. Wang et al., “Prognostic value of HIV-1 RNA on CD4 trajectories and disease progression among antiretroviral-naive HIV-infected adults in Botswana: A joint modeling analysis,” AIDS Research and Human Retroviruses, vol. 32, no. 6, pp. 573–578, 2016.
View at: Publisher Site | Google Scholar
S. L. Brilleman, M. J. Crowther, M. T. May, M. Gompels, and K. R. Abrams, “Joint longitudinal hurdle and time-to-event models: an application related to viral load and duration of the first treatment regimen in patients with HIV initiating therapy,” Statistics in Medicine, vol. 35, no. 20, pp. 3583–3594, 2016.
View at: Publisher Site | Google Scholar | MathSciNet
R. Song, H. I. Hall, T. A. Green, C. L. Szwarcwald, and N. Pantazis, “Using CD4 Data to Estimate HIV Incidence, Prevalence, and Percent of Undiagnosed Infections in the United States,” Journal of Acquired Immune Deficiency Syndromes, vol. 74, no. 1, pp. 3–9, 2017.
View at: Publisher Site | Google Scholar
D. I. Abrams, A. I. Goldman, C. Launer et al., “A comparative trial of didanosine or zalcitabine after treatment with zidovudine in patients with human immunodeficiency virus infection,” The New England Journal of Medicine, vol. 330, no. 10, pp. 657–662, 1994.
View at: Publisher Site | Google Scholar
R. M. Elashoff, G. Li, and N. Li, “An approach to joint analysis of longitudinal measurements and competing risks failure time data,” Statistics in Medicine, vol. 26, no. 14, pp. 2813–2835, 2007.
View at: Publisher Site | Google Scholar | MathSciNet
R. Henderson, P. Diggle, and A. Dobson, “Joint modelling of longitudinal measurements and event time data,” Biostatistics, vol. 1, no. 4, pp. 465–480, 2000.
View at: Publisher Site | Google Scholar
Ö. Asar, J. Ritchie, P. A. Kalra, and P. J. Diggle, “Joint modelling of repeated measurement and time-to-event data: An introductory tutorial,” International Journal of Epidemiology, vol. 44, no. 1, pp. 334–344, 2015.
View at: Publisher Site | Google Scholar
Y.-K. Tseng, F. Hsieh, and J.-L. Wang, “Joint modelling of accelerated failure time and longitudinal data,” Biometrika, vol. 92, no. 3, pp. 587–603, 2005.
View at: Publisher Site | Google Scholar | MathSciNet

Copyright

Copyright © 2018 Narges Roustaei et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

2263

Downloads

1583

Citations