Abstract

Background and Aims. Benzodiazepines have been widely used for long periods of time despite their adverse effects. The acute effects on cognition are well established. However, less is known about the long-term effects. This study critically reviewed existing evidence of the association between long-term exposure to benzodiazepines and risk of cognitive decline in adults. Methods. A systematic review with narrative synthesis was conducted. PubMed and PsycINFO databases were searched using combinations of keywords related to “benzodiazepines” and “cognitive function” from database inception to 12 February 2018 to identify prospective longitudinal studies. The records were evaluated for relevance according to the inclusion and exclusion criteria. Results. Fourteen studies involving 2145 long-term benzodiazepine users were included. Meta-analysis was not undertaken because the combined result would not be meaningful as the included studies differed in several key aspects such as frequency and duration of benzodiazepine use, follow-up periods, cognitive domains, cognitive tests, scoring systems, and statistical analysis. The definition of long-term benzodiazepine use was problematic in all the studies. The exposure was determined by measures which were assumed to represent the whole period in-between the follow-ups. Only 3 of the 14 studies provided support for an association between long-term benzodiazepine use and cognitive decline with a small to medium effect size. However, these three studies used different methods to assess the strength of this association. Global cognitive functioning, verbal memory, intelligence, psychomotor speed, and speed of processing were the cognitive domains affected which also varied across these three studies. Conclusions. Little evidence of an association between long-term benzodiazepine use and a higher risk of cognitive decline among the general adult population was found. However, discrepancies among the results and inconsistencies regarding the cognitive domains affected and methodological limitations prevent definite conclusions. Therefore, future research with prospective studies specially designed would be of great value.

1. Introduction

The prevalence of long-term use of benzodiazepine (BZD) worldwide, including the BZD related drugs called Z-drugs, is approximately 3% in the general population despite the repeated warnings over their safety and lack of evidence of long-term effectiveness [13]. The acute cognitive effects of BZDs are well established [3, 4]. However, the long-term impact on cognition is still an area of debate, despite epidemiological evidence which suggests that exposure to BZD might be a risk factor for cognitive impairment and cognitive decline [3, 5, 6]. Cognitive impairment refers to impairments in one or more cognitive functions according to a cognitive performance cutoff point and measured at a single time point whereas cognitive decline involves any deterioration in cognitive function over time according to change in cognitive performance between baseline and follow-up [7, 8]. Cognitive decline is more difficult to assess properly than cognitive impairment as its definition is sometimes arbitrary depending on the cognitive test and the statistical approach used [7]. For this reason, cognitive decline associated with long-term BZD use has been assessed by few prospective longitudinal studies.

Verdoux et al. [9] explored the association between long-term BZD use and cognitive decline through a literature review of prospective studies with unselected subjects from general population and concluded that there were many inconsistencies across the studies. Methodological issues such as differences in sampling method, nonspecification of parameters of BZD use, variation in the definition of BZD long-term exposure, cognitive assessment, follow-up characteristics, potential confounders, and statistical analyses might have compromised the comparability of findings [9, 10].

Also, a series of meta-analyses of the effects of long-term BZD use (≥1 year) on cognition were conducted by Barker et al. in 2004 [11, 12] and updated by Crowe and Stranks in 2018 [13]. Crowe and Stranks corroborated the findings of Barker et al. that there is an association between current [11] or previous [12] long-term BZD use and cognitive impairment. However, the findings might be difficult to generalise as most studies included in these meta-analyses were conducted on populations of problematic BZD users requiring specialised treatment, and thus the impairments measured could be related to the return of premorbid symptoms rather than reflecting the effects of chronic BZD treatment [9, 14]. In addition, these three meta-analyses address the concept of cognitive impairment indicating a cross-sectional deleterious effect of BZDs as opposed to the concept of cognitive decline which involves the longitudinal effects of BZDs on cognitive change with age [7].

Considering that the existing review [9] is somewhat out of date and no systematic review has been conducted on epidemiological studies that addressed the association between long-term BZD use and cognitive decline, the question remains open. Taken together, the potential burden of impaired cognitive functioning on society, the discrepancy of findings, and the methodological limitations of the studies, an updated review of prospective longitudinal studies addressing this issue is warranted. The aim of this review was to evaluate and integrate current data available by systematically reviewing studies that examined the association between cognitive decline and long-term exposure to BZDs in the general population.

2. Methods

This is a systematic literature review of prospective longitudinal studies that assessed the risk of cognitive decline in long-term BZD users compared with nonusers. It was conducted in line with the PRISMA (Preferred Reporting Items for Systematic reviews and Meta-Analyses) guidelines (Supplementary Materials Appendix S1) [15].

2.1. Data Source

The target population was defined as the general population (adults aged 18 or older) with no restriction as to the setting. Studies involving individuals with schizophrenia or other psychotic disorder as well as those with a comorbid substance use disorder other than BZD use were excluded. These conditions have been associated with cognitive impairment and therefore might have affected the results and the generalisability of the findings [16, 17]. The exposure was long-term BZD use defined as the use of a BZD for six months or longer during a time period of one year regardless of whether the use was daily or infrequent [1]. For the purpose of this review, BZDs included the Z-drugs. The comparator (control) group consisted of individuals who were BZD nonusers. The primary outcome of interest investigated was cognitive decline assessed using cognitive function tests [11].

2.2. Inclusion and Exclusion Criteria

Inclusion criteria were as follows: (1) prospective longitudinal studies that investigated the effects of long-term BZD use on cognitive decline; (2) administration of cognitive tests at a minimum of two time points with no minimum period of follow-up required; (3) presence of a control group of BZD nonusers; (4) human studies with adults aged 18 years or over; (5) studies published in English in peer-reviewed journals.

The exclusion criteria were as follows: (1) retrospective or cross-sectional studies; (2) studies with children or adolescents (<18 years old); (3) studies involving participants with schizophrenia or other psychotic disorder; (4) studies involving participants with a substance use disorder other than BZD; (5) BZD use for less than six months or not specified; (6) studies that focused on the effects of withdrawing from BZD; (7) studies with dementia as the only outcome; (8) specific information about the cognitive tests utilized not included; (9) control group which included subjects with prior long-term use of BZD.

2.3. Data Selection

Electronic searches were conducted in the PubMed and PsycINFO databases (via APA PsycNET platform) to identify potential studies to be included in the review. The searches were carried out for scientific papers with no publication date restrictions. Searches were based on combinations of keywords related to “benzodiazepines” and “cognitive function,” with separate searches for Z-drugs (details are given in Supplementary Materials Appendices S2 and S3). The last searches in both databases were performed on 12 February 2018. The references of the selected articles were also screened for further relevant studies. The database searches yielded a total of 978 records, of which 693 were retrieved through PubMed search (680 in Search 1; 13 in Search 2) and 285 through PsycINFO search (282 in Search 1; 3 in Search 2). One-hundred twenty-five records were identified as duplicates and removed. The remaining 853 records were initially evaluated for relevance by review of the title or abstract. Eight-hundred and eleven records met exclusion criteria and were excluded. The full text of the remaining 42 articles and of the four additional articles obtained by hand searching references were assessed against the inclusion and exclusion criteria to determine eligibility. Fourteen studies that fulfilled inclusion criteria were included in the systematic review [7, 10, 1829]. Figure 1 shows the PRISMA flow diagram used to summarize the study selection process [15]. The search strategy was determined by both reviewers. The screening of titles, abstracts, and full texts was conducted by reviewer D.N. and approved by reviewer L.G.

2.4. Data Analysis

A data extraction form was developed and used to record summary information on the study design, participant characteristics, and outcomes of cognitive assessments for studies that met the criteria for inclusion. The methodological quality and risk of bias assessments of the included studies were based on the Joana Briggs Institute Critical Appraisal Checklist for Cohort Studies [30]. These assessments were conducted by reviewer D.N. and approved by reviewer L.G.

Meta-analysis was not undertaken because the combined result would not be meaningful as the outcome measures and the detail of study design of the included studies were too different. The studies differed in several key aspects such as definition of long-term BZD use, duration of BZD use, follow-up periods, cognitive domains, cognitive tests, scoring systems, and statistical analysis. Thus, a narrative synthesis [31] was conducted to provide an overview of the association between BZD use and cognitive decline.

3. Results

The included studies involved 2145 long-term BZD users. A description of the main features of each study can be found in Appendix S4 (Supplementary Materials). Also, a list of the excluded studies [14, 3262] with reasons for exclusion is shown in Appendix S5 (Supplementary Materials).

3.1. Characteristics of Included Studies

All the studies were conducted in developed countries and had a prospective longitudinal design. They used data either from large epidemiological studies or from national databases and took place basically in outpatient setting. Except for one prospective case-control study [28], the studies were all prospective cohort studies, mostly population-based cohort studies which allow the estimation of distributions of relevant variables in a defined population [63].

Assessment of BZD use was based on self-report, pharmacist records, prescription forms, or inspection of drug containers. There was considerable variability across the studies in the described frequency of BZD from daily use to regular use or even irregular use, and there was a lack of clarity in definition of these terms. Chronic BZD use was typically defined as continued use from baseline to the follow-up endpoint and at all the measurements. It differed from previous or temporary use in not extending to the end of follow-up and from new use reported only at the follow-up endpoint. The estimated duration of use varied from 1 year to 9 years. The follow-up evaluation times ranged from 2 to 22 years and were between 3 and 10 years in 85% of the studies.

The cognitive domains were assessed by 30 cognitive tests as shown in Table 1, whereas the items that were extracted from each study such as study attributes, subject characteristics, and pattern of BZD use including frequency and estimated duration are summarised in Table 2.

3.2. Results of Comparisons

Only 3 of the 14 studies provided support for an association between long-term BZD use and cognitive decline [10, 20, 24]. Paterniti et al. [20] used logistic regression models to assess whether long-term BZD users were more likely to develop cognitive decline compared with nonusers. They demonstrated that long-term BZD use was significantly associated with a higher risk of decline for global cognitive functioning (OR = 1.9, 95% CI = 1.0, 3.6), psychomotor speed (OR = 2.5, 95% CI = 1.5, 4.4), and speed of processing (OR = 2.3, 95% CI = 1.3, 4.1) compared with nonusers. Further, Bierman et al. [10] performed multilevel analyses to investigate the effect of long-term BZD use on cognitive decline. They calculated the effect size (F2) and found that long-term BZD users showed decline in global cognitive functioning (F2 = 0.012, ), intelligence (F2 = 0.007, ), and verbal memory (delayed recall: F2 = 0.003, ; retention: F2 = 0.004, ) compared with nonusers. Finally, multivariate analyses stratified on gender by Boef-Cazou et al. [24] revealed that BZD use significantly affected verbal memory, namely, delayed free recall, but only in women. The amount of change found in women long-term BZD users compared with nonusers was indicated by the β value given with the standard deviation (β = −2.13 ± 0.67, ). The results of statistical analyses from all the included studies are reported with the cognitive domains assessed, the follow-up evaluation times, and whether cognitive decline was found or not in Table 3.

On the other hand, the examination of moderators revealed that no sociodemographic variable or specific parameter of BZD use was consistently associated with an increased risk of cognitive decline. Paterniti et al. [20] found no predictors of cognitive decline. Bierman et al. [10] reported that both duration and cumulative exposure to BZD were positively correlated with cognitive decline, but duration of BZD use had a greater effect on cognitive performance than the dosage. Hanlon et al. [19] demonstrated a duration-response as well as a dose-response relationship for cognitive impairment (but not for cognitive decline) and BZD use. Conversely, other studies did not confirm this positive correlation. In the study by Gallacher et al. [26], greater duration of BZD use was not associated with greater risk of cognitive decline as participants with an estimated duration of BZD use of 4 years or less were almost twice as likely to have dementia as those with more than 4 years taking BZDs. Similarly, Gray et al. [27] reported that higher BZD use was not associated with more rapid cognitive decline. They found a small increase in the risk of dementia in people with minimal exposure to BZDs but not with the highest level of exposure.

3.3. Methodological Quality

The details of the appraisal tool applied to each study are shown in Appendix S4 (Supplementary Materials). Considering a study to be of good quality when the proportion of “Yes” answers is at least 65%, of low quality when the score is below 40%, and of moderate quality when it is between 40% and 64%, twelve studies were of moderate quality [7, 10, 1820, 2227, 29], one of good quality [28], and another one of low quality [21] (Table 2).

In all the studies, BZD users and nonusers were recruited from the same population and correctly assigned to their groups. Also, the confounding factors were identified as shown in Table 4. Similarly, strategies to deal with the confounders and regression analysis techniques were stated and seemed appropriate in nearly all the studies.

The most frequently identified methodological weakness was related to the BZD exposure which was not measured in a valid and reliable way in any of the studies. The measurements were determined by indirect means and typically referred to the use at interview or over short periods before ranging from 2 weeks to 2 months. In all the studies, it was assumed that the use lasted up to the following assessment and thus it provided only an estimate of the likely duration of BZD use [26]. As a result, the overall validity of the measurements was compromised. Other common methodological flaws were related to (1) baseline differences between BZD users and controls [7, 18, 2027, 29] as shown in Table 4, (2) lack of strategies to address incomplete follow-up [7, 10, 1827], (3) assessment of cognitive function limited to global cognitive functioning [18, 19, 2325, 29], (4) follow-up times not considered long enough for cognitive decline to occur [1921, 25, 28, 29], and (5) cognition not normal at baseline [7, 2022, 26].

4. Discussion

4.1. Main Findings

This systematic review identified 14 prospective longitudinal studies addressing the association of long-term BZD use and cognitive decline. Overall, the methodological quality of the studies was similar. Twelve studies were of moderate quality, one of good quality [28], and another one of low quality [21]. The large majority of the studies reviewed failed to demonstrate a higher risk of cognitive decline in long-term BZD users. Only 3 of the 14 studies provided support for an association between long-term benzodiazepine use and cognitive decline with a small to medium effect size [10, 20, 24]. These studies were of moderate quality as well as most of the studies which failed to demonstrate a higher risk for cognitive decline. Global cognitive functioning, verbal memory, intelligence, psychomotor speed, and speed of processing were the cognitive domains affected. These three studies used different methods to assess the strength of the association between long-term BZD use and cognitive decline. The effect sizes that Bierman et al. [10] determined (F2 = 0.012, for global cognitive functioning; F2 = 0.003, , and F2 = 0.004, for verbal memory; and F2 = 0.007, for fluid intelligence) were deemed to be small according to Cohen’s convention [64] about F2 values (0.02 is small, 0.15 medium, and 0.35 large). On the other hand, Paterniti et al. [20] computed odds ratios (ORs) rather than calculating effect sizes. In order to assess the magnitude of their findings, the equivalence of OR and effect size, namely, Cohen’s d, should be estimated. Cohen’s d, the standardized mean difference between two group means, is generally accepted as an indication of a small (d = 0.2), medium (d = 0.5), and large (d = 0.8) effect size [64,65]. Based on formulas for converting between the ORs and d, taken from Borenstein et al. [66] and made available for use by DeCoster [67], the ORs reported by Paterniti et al. [20] were converted to d (global cognitive functioning: OR = 1.9, d = 0.35; speed of processing: OR = 2.3, d = 0.45; and psychomotor speed: OR = 2.5, d = 0.50) and corresponded to a small to medium effect size.

4.2. Discrepancies among the Studies

The studies in this review controlled for many, but not all, confounding factors (Table 4). Therefore, confounders might provide one possible explanation for the discrepancies among them. For instance, the three studies that found cognitive decline to be associated with long-term BZD use did not adjust for current psychiatric diagnosis or psychiatric history which could account for the cognitive decline [10, 20, 24]. Also, use of psychotropic drugs other than BZDs was either not controlled for [10, 24] or controlled for as a single variate [20] rather than considering each class of drugs separately. Therefore, the impact of a specific class of drug on cognition could have been masked [25]. Other potential confounders not adjusted for were smoking [10] and depressive and anxious symptoms [24]. Moreover, the possibility of reverse causality should be considered in these three studies as the presence of cognitive impairment due to other causes could not be ruled out and might have contributed to long-term BZD use.

4.3. Possible Reasons for the Negative Findings

In addition to methodological limitations which will be further addressed, the depletion of susceptible effect should be considered [68]. With regard to the BZDs, it has been postulated that physicians would stop prescribing the drug to those susceptible to, and unable to tolerate, the cognitive effects. As a result, those who kept using the BZDs would correspond to those not predisposed to develop cognitive decline nor dementia associated with long-term BZD use. The depletion of susceptible effect could not be excluded in some studies [18, 22, 25, 27].

4.4. Implications for Clinical Practice

Although most studies in the current review have not provided evidence of a greater risk of cognitive decline associated with long-term benzodiazepine use, there is also no strong evidence for an absence of cognitive decline either. Therefore, physicians should take a cautious approach to prescribing BZDs, especially for elderly patients, avoiding higher doses and longer duration of treatment than those generally recommended, despite the fact that a quantitative exposure-effect relationship has not yet been established.

4.5. Limitations

The current review was subject to several limitations. First, a selection bias might have occurred in some studies, especially with the elderly. The most debilitated ones were more prone to not be included by virtue of nonresponse or loss to follow-up (e.g., due to death), resulting in a possible underestimation of cognitive decline. Second, the definition of long-term BZD users was problematic in all the studies. It was based on measures at fixed intervals which were assumed to represent the whole period in between the follow-ups. In this way, it is possible that intermittent and chronic BZD users as well as intermittent and nonusers were mixed in the same group. Third, information on BZD use history such as dose, frequency, duration, time since last dose, and half-life was not available in most studies. Fourth, despite the relatively large cohorts, the final sample size of long-term BZD users was usually small, thus limiting the power of analysis. Moreover, the samples were nonclinical, and therefore the amount, frequency, and duration of use might not have reached the threshold for a longitudinal effect on cognitive decline. Fifth, many studies utilized cognitive tests that were more appropriate for screening gross cognitive deficits than detecting subtle neuropsychological changes over time. As a result, cognitive decline might have been underestimated. Sixth, the included studies were not able to control for all confounding factors, and the confounders that were adjusted for varied across the studies, making comparisons more difficult. Seventh, the follow-up was short (<5 years) in several studies, and thus the time frame might not have been sufficient for cognitive decline to be detected. Finally, relevant studies might have been missed due to restricting the eligibility criteria to peer-reviewed publications in English. Similarly, having no studies from developing countries, considering basic economic country conditions, might affect the generalisability of the findings.

4.6. Implications for Research

Future research about the long-term cognitive effects of BZDs should favor prospective studies with a case-control design (i.e., comparison of long-term users and matched nonusers). The source population should be nonclinical, given that several medical conditions may be accompanied by decrements in cognitive function. Given that the major gap in the research conducted thus far has been to find a way to better characterize degree of BZD exposure across studies, it is crucial to collect detailed information on BZD use including whether the use continued with no interruptions between the follow-up evaluations. Further, a comprehensive neuropsychological test battery should be utilized. Finally, the duration of the follow-up period should be at least 5 years which is considered a sufficient period to detect adverse outcomes associated with BZD exposure [29].

5. Conclusions

This systematic review found little evidence of an association between long-term BZD use and progressive cognitive decline. Only 3 of the fourteen studies demonstrated that long-term BZD use was associated with a higher risk of cognitive decline among the general adult population. However, discrepancies among the results and inconsistencies regarding the cognitive domains affected and methodological limitations involving the included studies such as selection bias, problematic definition of long-term BZD users, lack of information on BZD use history, lack of adjustment for some confounding factors, small sample size of long-term BZD users, and insufficient follow-up times in some studies prevent definite conclusions. Therefore, future research with prospective studies specially designed would be of great value.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This review was undertaken in partial fulfilment of a master’s degree with the International Programme in Addiction Studies (IPAS) jointly delivered by the Virginia Commonwealth University, King’s College London, and University of Adelaide. D.N. is grateful to have been awarded a partial scholarship by the IPAS. He thanks his supervisor (L.G.) and all the professors for their encouragement and support during the programme.

Supplementary Materials

Appendix S1: PRISMA checklist. Appendix S2: PubMed search strategy. Appendix S3: PsycINFO search strategy via APA PsycNET Platform. Appendix S4: characteristics of included studies. Appendix S5: characteristics of excluded studies. (Supplementary Materials)