Table of Contents Author Guidelines Submit a Manuscript
Canadian Respiratory Journal
Volume 2016, Article ID 1690482, 6 pages
Research Article

Identifying Primary Spontaneous Pneumothorax from Administrative Databases: A Validation Study

1Division of Thoracic Surgery, Department of Surgery, Western University, London, ON, Canada N6A 3K7
2Department of Epidemiology and Biostatistics, Western University, London, ON, Canada N6A 3K7
3Division of Pediatric Surgery, Department of Surgery, Western University, London, ON, Canada N6A 3K7
4London Health Sciences Centre, London, ON, Canada N6A 5W9
5Division of Urology, Department of Surgery, Western University, London, ON, Canada N6A 3K7

Received 5 January 2016; Accepted 12 May 2016

Academic Editor: Elisa Giovannetti

Copyright © 2016 Eric Frechette et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.


Introduction. Primary spontaneous pneumothorax (PSP) is a disorder commonly encountered in healthy young individuals. There is no differentiation between PSP and secondary pneumothorax (SP) in the current version of the International Classification of Diseases (ICD-10). This complicates the conduct of epidemiological studies on the subject. Objective. To validate the accuracy of an algorithm that identifies cases of PSP from administrative databases. Methods. The charts of 150 patients who consulted the emergency room (ER) with a recorded main diagnosis of pneumothorax were reviewed to define the type of pneumothorax that occurred. The corresponding hospital administrative data collected during previous hospitalizations and ER visits were processed through the proposed algorithm. The results were compared over two different age groups. Results. There were 144 cases of pneumothorax correctly coded (96%). The results obtained from the PSP algorithm demonstrated a significantly higher sensitivity (97% versus 81%, ) and positive predictive value (87% versus 46%, ) in patients under 40 years of age than in older patients. Conclusions. The proposed algorithm is adequate to identify cases of PSP from administrative databases in the age group classically associated with the disease. This makes possible its utilization in large population-based studies.

1. Introduction

Primary spontaneous pneumothorax (PSP) is a relatively frequent clinical entity, reported to be between 6 and 24 cases per 100,000 persons annually [1]. It occurs most commonly in males and has an age distribution peak between 20 and 30 years of age [2, 3]. It is caused by the rupture of lung bulla or blebs. Its treatment varies from simple observation to drainage or surgery in cases of recurrence or massive lung collapse [4]. PSP must be differentiated from SP, which arises from an underlying chronic lung disease, such as COPD, cystic fibrosis, or lymphangioleiomyomatosis, or occurs following chest trauma, as a result of a medical procedure, or some acute medical condition such as status asthmaticus or lung abscess [5]. The peak incidence from a SP has been suggested to arise at a later age, past 55 years of age [6]. Although the initial management of the different types of pneumothorax might be similar, they have different etiology, physiopathology, and principles of long-term treatment [7].

Administrative hospital data are potentially powerful tools, allowing epidemiological study of diseases, but the accuracy of the data must be confirmed. The tenth edition of the International Classification of Diseases (ICD-10-CA) was introduced in Canada in 2000, and it includes codes allowing the identification of patients seeking medical treatment for pneumothorax [8]. A subdivision of the code J93 further differentiates between spontaneous tension pneumothorax, other spontaneous pneumothorax, other pneumothorax, and pneumothorax (unspecified) [9]. Unfortunately, the classification does not clearly allow the identification of PSP, which would be necessary for the development of large population-based studies on the subject using administrative data from emergency room visits and hospital discharge summaries.

The objective of this study was to validate the accuracy of an administrative data algorithm designed to identify PSP cases from the recorded emergency room and hospital discharge summary.

2. Materials and Methods

2.1. Data Source and Quality
2.1.1. Reference Standard

The research protocol was reviewed and approved by the Health Sciences Research Ethics Board at the University of Western Ontario (00000940-105409). A total of 150 consecutive patients who consulted the emergency department of a single major university hospital serving Southwestern Ontario, Canada, from January 2003 to March 2010 with a main ICD-10-CA diagnosis of pneumothorax (J930, J931, J938, or J939) were identified from the hospital registry recorded in the National Ambulatory Care Reporting System (NACRS). These codes include the complete range of disease severity. Patient charts were abstracted by two independent physicians blinded to the recorded administrative data (KG, AS). The reviewers confirmed or rejected the diagnosis of pneumothorax from the documents contained in the charts, including the radiology reports and images if needed. All cases of pneumothorax were defined as PSP or SP. Where there was disagreement between reviewers, the diagnosis was obtained following a review by a third physician (EF).

2.1.2. Administrative Data

For each patient all information from index date and previous visits Discharge Abstract Database (DAD) and NACRS recorded locally was obtained. The information collected for the analysis included the admission and discharge dates and all ICD-10-CA diagnostic codes. The data from the different visits and databases were pooled for each individual and submitted to the algorithm.

2.2. Administrative Data Algorithm

The development of the algorithm was based on the same concepts used clinically to differentiate between PSP and other types of pneumothorax. To identify potential PSP, the algorithm was designed to exclude SP cases using these three steps:(1)Patients with a hospital admission record in DAD in the previous 30 days were excluded unless their main diagnosis was pneumothorax, as they could represent cases of iatrogenic pneumothorax.(2)Patients with a chronic condition associated with SP were excluded. We identified these patients based on all DAD and NACRS recorded locally for 14 conditions present on the index visit date or earlier.(3)We also excluded the patients presenting with an acute condition potentially causing SP in the 30 days prior to the index visit.The complete list of acute and chronic conditions used in the algorithm, with corresponding codes, is detailed in Table 1. The results obtained from the algorithm and from the chart review were then compared. The results were stratified over two age groups, patients under 40 years and patients 40 years or older, this threshold representing the midpoint of the two distribution peaks for PSP and SP.

Table 1: Conditions identified as potential causes of secondary pneumothorax with corresponding ICD-10-CA codes.
2.3. Statistical Methods

The chart review results were considered as the gold standard reference for the validation of the proposed algorithm. Comparing the results from the chart review and administrative data algorithm, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated, including the 95% confidence interval. Significance levels were obtained from Fisher’s exact method using SAS software version 9.3. The kappa statistic was calculated as a measure of agreement between reviewers’ results. Additional calculations of sensitivity and specificity were performed to evaluate the ability of the algorithm to define PSP when limiting the diagnosis to patients below different age thresholds set between 30 and 50 years of age.

3. Results

Of the 150 patients included in this study, 95 were under the age of 40 years and 55 were aged 40 years or over. The chart review could not identify any evidence of pneumothorax in the charts of 6 patients, while 96% had a pneumothorax corresponding appropriately to the main diagnosis. These included 90 cases of PSP, representing 60% of patients. Eighty-two percent of these (74 cases) were found in patients under 40 years, while 18% were identified in older patients. There was an agreement of 88% between the two reviewers, corresponding to a kappa statistic of 0.76, and a third review was necessary in 18 cases.

The review of the administrative data revealed that the code J939 (pneumothorax, unspecified) was the most commonly used to describe the main diagnosis; however in younger patients, the code J931 was also commonly used. The reason to suspect a SP from the administrative data was most commonly related to an associated chronic condition; however in some patients more than one potential cause was identified. These results are detailed in Table 2.

Table 2: Distribution of patients with a main diagnosis code of pneumothorax according to age group.

The ability of the proposed algorithm to identify cases of PSP is detailed in Table 3, which summarizes diagnostic accuracy. Overall, sensitivity of the algorithm for PSP was 94%, being higher in younger patients (97%) than in older patients (81%), . Similarly, a higher PPV was obtained when the algorithm was applied to the younger patients population, 87%, compared to 46% for the patients aged 40 years or more (). The overall specificity of the algorithm was 57%, and no significant difference was identified between the younger and older patient groups. Similarly, there was no significant difference between the two groups NPVs, the overall value estimated to 87%.

Table 3: Diagnostic performance of the administrative data algorithm for the identification of primary spontaneous pneumothorax cases (estimate and 95% confidence interval).

In order to evaluate the appropriateness of the 40-year threshold used in the study, we performed additional modifications on the algorithm for different age levels varying between 30 and 50 years. We calculated the sensitivity and specificity of the algorithm applied to the 150 patients when limiting the diagnosis of PSP to patients younger than each age point. Between the levels of 30 to 50 years, the sensitivity varied from 59% to 88% and the specificity from 82% to 63% (Figure 1).

Figure 1: Sensitivity and specificity of the algorithm’s modifications applied to the 150 patients and restricting the diagnosis of primary spontaneous pneumothorax to patients below the age level detailed on the axis.

4. Discussion

PSP can happen spontaneously without any underlying cause, usually in young and previously healthy individuals. It differs from SP, which represents a proportion of about half of all pneumothoraces [3, 1012].

All pneumothoraces will require similar initial management; however, once urgent treatment is completed, the care given to patients presenting with either PSP or SP will differ [6, 13]. Treatment of the underlying cause of SP will be necessary and could include chest wall stabilization, optimization of the COPD or asthma medication, and antimicrobial treatment of pneumonia or lung abscess. For PSP, aside from smoking cessation and recommendation to avoid certain activities, most individuals will require no further treatment, but surgery could be offered to patients who are at a higher risk of recurrence or complications [4, 7, 14].

There is a need for large epidemiological studies on the subject of PSP to help define predictors of recurrence and lead to treatment recommendation tailored to this otherwise healthy and potentially productive population. Population-based studies from the United Kingdom and France have been published in the last few years and have produced significant knowledge of the subject; however, these included both primary and SP. Hallifax and Rahman noted an unexpectedly low proportion of SP (14%) in a report from the French national healthcare database, which has not been fully explained and suggests that a proportion of the cases might not be classified appropriately [15, 16]. In the present review, 54 patients presented with a SP, representing 38% of cases.

The 10th version of the ICD, used in Canada as in many other jurisdictions, does not include a subclassification allowing the identification of the different types of spontaneous pneumothorax. This situation is problematic and limits the ability of researchers to target PSP specifically. In the United States, a different version of the ICD is used, ICD-10-CM (Clinical Modification), which includes a subdivision of the J931 code corresponding to PSP: J9311 [17]. However, it is difficult to anticipate the effect of this subclassification on eventual epidemiological studies because of the presence of three other J93 codes which could be used for PSP. In our study, only 38% of patients were coded J931; the majority (57%) of the charts were labeled with the code J939: pneumothorax, unspecified.

Code-validation is necessary before conducting population-based studies to ensure adequacy of the patient cohorts [18]. In Canada, the Institute for Clinical Evaluative Sciences (ICES) reported in 2006 that the pneumothorax code J93 is amongst the most accurately used (95%) of the main diagnosis codes, and this correlates to the results of this review which identified a miscode rate of only 4% [8]. However, to the authors’ knowledge, our report represents the first attempt to differentiate between PSP and SP using hospital data. Defining PSP as only the absence of associated codes for underlying lung disease was not considered sufficient in the design of this study and three exclusion criteria were created with the goal of obtaining a better case definition. Based on the results of the administrative data algorithm, these exclusion criteria identified a significant proportion of patients experiencing SP, particularly in younger patients for which the presence of a chronic condition was responsible for only 42% of cases defined as SP.

PSP is known to occur in a younger population than SP, and multiple studies have suggested that peak incidence occurs between 20 and 30 years of age. When creating the inclusion criteria for a population-based study oriented towards this age group, an option would be to use patient age to minimize the number of misdiagnoses that would be associated with inclusion of older patients. This study’s dichotomization of younger versus older patients allowed observation of the effect of this strategy. The ability of the proposed algorithm to accurately identify cases of PSP cases was significantly better when limited to the younger group of patients. However, using this method would exclude cases of true PSP occurring in older patients (16 individuals in this study) and limit the participation to about 82% of potential patients. However, these individuals might also have a nondiagnosed underlying lung condition.

The proposed algorithm demonstrated a good diagnostic accuracy in defining PSP cases amongst patients under 40 years of age. The 97% sensitivity rate found in this age group suggests that almost all PSP cases can be identified with the use of administrative data. And as demonstrated by the 87% PPV, the identification of a PSP case from the algorithm will be correct in a very high proportion of patients. These findings support the use of this method of identification when conducting population-based epidemiological studies of PSP based on administrative data using ICD-10 codes. These findings also suggest that limiting the diagnosis of PSP to the absence of recorded concurrent lung conditions might be insufficient and could be improved by the use of more restrictive exclusion criteria.

The limitations of the proposed algorithm are related to its lower-than-expected specificity. This is a direct consequence of a high number of patients falsely considered to have a PSP, particularly in the older population. This suggests either that the strict criteria used in this study for PSP definition might not be sufficient or, more likely, that the recording of secondary diagnosis into NACRS and DAD databases is occasionally incomplete. Thus, it would be prudent to limit the use of the algorithm to the suggested target population of PSP, as its accuracy might be insufficient when attempting to build a cohort of patients with SP.

5. Conclusions

PSP is a clinical entity distinct from SP for which there is a paucity of epidemiological studies. The ICD-10 codes used in most countries do not allow its direct identification. An algorithm was created, which uses the information available in NACRS and DAD to overcome this deficiency. Of 150 patients presenting with a diagnosis of pneumothorax, we validated this algorithm to have a sensitivity of 97% and a PPV of 87% when used in a population of young patients known to be at higher risk of developing a PSP. These results support the use of administrative data to identify patients consulting for PSP and give researchers the tools required to conduct epidemiological studies on large population-based data. Such studies will improve the understanding of the disease, its treatment, and related outcomes.


Funding sources had no involvement in the design, collection, or analysis of data and the opinions, results, and conclusions reported in this paper are those of the authors and are independent from the funding sources. No endorsement by the Ontario MOHLTC is intended or should be inferred.

Competing Interests

None of the authors have any actual or potential competing interests to declare.

Authors’ Contributions

All authors contributed to the design of the study, interpretation of study results, and writing of the paper. Dr. Eric Frechette performed all data analysis.


Members of the Surgical Investigators Group at ICES Western (Chris Bailey, Muriel Brackstone, Sumit Dave, Luc Dubois, Steve Latosinsky, Danielle MacNeil, Jacob McGee, Dave Nagpal, Stephen E. Pautler, and Kelly Vogt) served as scientific and clinical advisors. The authors would like to thank Dr. Richard Inculet and Dr. Dalilah Fortin for their support and input throughout the study process. They would like also to thank Mrs. Debra Lewis and Mrs. Darlene Palmer for their help during the data collection process. This study was supported by an Internal Research Grant from the Department of Surgery, Western University.


  1. J.-M. Tschopp, O. Bintcliffe, P. Astoul et al., “ERS task force statement: diagnosis and treatment of primary spontaneous pneumothorax,” European Respiratory Journal, vol. 46, no. 2, pp. 321–335, 2015. View at Publisher · View at Google Scholar · View at Scopus
  2. D. Gupta, A. Hansell, T. Nichols, T. Duong, J. G. Ayres, and D. Strachan, “Epidemiology of pneumothorax in England,” Thorax, vol. 55, no. 8, pp. 666–671, 2000. View at Publisher · View at Google Scholar · View at Scopus
  3. L. J. Melton III, N. G. G. Hepper, and K. P. Offord, “Incidence of spontaneous pneumothorax in Olmsted county, Minnesota: 1950 to 1974,” American Review of Respiratory Disease, vol. 120, no. 6, pp. 1379–1382, 1979. View at Google Scholar · View at Scopus
  4. G. Massard, P. Thomas, and J.-M. Wihlm, “Minimally invasive management for first and recurrent pneumothorax,” Annals of Thoracic Surgery, vol. 66, no. 2, pp. 592–599, 1998. View at Publisher · View at Google Scholar · View at Scopus
  5. C. V. Ruckley and R. J. McCormack, “The management of spontaneous pneumothorax,” Thorax, vol. 21, no. 2, pp. 139–144, 1966. View at Publisher · View at Google Scholar · View at Scopus
  6. F. Tanaka, M. Itoh, H. Esaki, J. Isobe, Y. Ueno, and R. Inoue, “Secondary spontaneous pneumothorax,” The Annals of Thoracic Surgery, vol. 55, no. 2, pp. 372–376, 1993. View at Publisher · View at Google Scholar · View at Scopus
  7. A. MacDuff, A. Arnold, and J. Harvey, “Management of spontaneous pneumothorax: British Thoracic Society pleural disease guideline 2010,” Thorax, vol. 65, no. 2, pp. ii18–ii31, 2010. View at Publisher · View at Google Scholar · View at Scopus
  8. D. N. Juurlink, Institute for Clinical Evaluative Sciences in Ontario. Canadian Institute for Health Information Discharge Abstract Database, vol. 69, Institute for Clinical Evaluative Sciences, Toronto, Canada, 2006.
  9. Canadian Institute for Health Information, “ICD-10-CA,”
  10. W. R. Primrose, “Spontaneous pneumothorax: a retrospective review of aetiology, pathogenesis and management,” Scottish Medical Journal, vol. 29, no. 1, pp. 15–20, 1984. View at Google Scholar · View at Scopus
  11. P. Ferraro, G. Beauchamp, F. Lord, C. Emond, and E. Bastien, “Spontaneous primary and secondary pneumothorax: a 10-year study of management alternatives,” Canadian Journal of Surgery, vol. 37, no. 3, pp. 197–202, 1994. View at Google Scholar · View at Scopus
  12. D. Weissberg and Y. Refaely, “Pneumothorax: experience with 1,199 patients,” Chest, vol. 117, no. 5, pp. 1279–1285, 2000. View at Publisher · View at Google Scholar · View at Scopus
  13. M. Noppen and M. H. Baumann, “Pathogenesis and treatment of primary spontaneous pneumothorax: an overview,” Respiration, vol. 70, no. 4, pp. 431–438, 2003. View at Publisher · View at Google Scholar · View at Scopus
  14. M. H. Bauman, C. Strange, J. E. Heffner et al., “Management of spontaneous pneumothorax: an American College of Chest Physicians Delphi Consensus Statement,” Chest, vol. 119, no. 2, pp. 590–602, 2001. View at Publisher · View at Google Scholar · View at Scopus
  15. A. Bobbio, A. Dechartres, S. Bouam et al., “Epidemiology of spontaneous pneumothorax: gender-related differences,” Thorax, vol. 70, no. 7, pp. 653–658, 2015. View at Publisher · View at Google Scholar · View at Scopus
  16. R. J. Hallifax and N. M. Rahman, “Epidemiology of pneumothorax—finally something solid out of thin air,” Thorax, vol. 70, no. 10, pp. 921–922, 2015. View at Publisher · View at Google Scholar · View at Scopus
  17. Centres for Medicare and Medicaid Services, ICD-10, September 2015,
  18. Y. Lacasse, J.-M. Daigle, S. Martin, and F. Maltais, “Validity of chronic obstructive pulmonary disease diagnoses in a large administrative database,” Canadian Respiratory Journal, vol. 19, no. 2, pp. e5–e9, 2012. View at Google Scholar · View at Scopus