Pathological and Immunological Developments in Behcet’s DiseaseView this Special Issue
Diagnosis/Classification Criteria for Behcet's Disease
Historical Background. The ISG criteria for Behcet's, created in 1990, have excellent specificity, but lack sensitivity. The International Criteria for Behcet's Disease (ICBD) was created in 2006, as replacement to ISG. The aim of this study was to compare their performance. ISG and ICBD Criteria. For ISG oral aphthosis is mandatory. The presence of any two of the following (genital aphthosis, skin lesions, eye lesions, and positive pathergy test) will diagnose/classify the patient as BD. For ICBD, vascular lesions were added, while oral aphthosis is no more mandatory. Getting 3 or more points diagnose/classify the patient as BD (genital aphthosis 2 points, eye lesions 2 points, and the remaining each one point). Performance and Comparison of ISG and ICBD. Their sensitivity, specificity, and accuracy (percent agreement), were tested in three independent cohort of patients from Far-East (China), Middle-East (Iran), and Europe (Germany). The sensitivity for ISG was respectively 65.4%, 78.1%, 83.7% and for ICBD 87%, 98.2%, and 96.5%. The specificity for ISG was 99.2%, 98.8%, 89.5% and for ICBD 94.1%, 95.6%, and 73.7%. The accuracy for ISG was 74.2%, 85.5%, 85.5% and for ICBD 88.9%, 97.3%, and 89.5%. Conclusion. ICBD has better sensitivity, and accuracy than ISG.
1. Historical Background
Although Behcet’s Disease (BD) is relatively a young disease (described in 1937), it has already 16 sets of diagnosis/classification criteria. The first of them was proposed by Curth in 1946, less than 10 years after the description of the disease . It was followed by Hewitt et al. in 1969 , Mason and Barnes in 1969 , Hewitt et al. revised in 1971 , Japan in 1972 , Hubault and Hamza in 1974 , O'Duffy in 1974 , Chen in 1980 , Dilsen et al. in 1986 , Japan revised in 1988 , International Study Group (ISG) in 1990 , Iran in 1993 , Classification Tree in 1993 , Dilsen revised in 2000 , Korea in 2003 [15, 16], and the International Criteria for Behcet’s Disease (ICBD) in 2006 [17–19].
The ISG criteria were created in 1990 to bring a consensus on one set of criteria by the collaboration of France, Iran, Japan, Tunisia, Turkey, UK, and USA. With the sensitivity of ISG criteria being low [12, 20–25], during the first International Workshop of Behcet’s Disease in Kuhtai (Austria), it was decided to create an international team to evaluate the performance of ISG criteria and to compare it with the existing BD criteria and revise it if necessary.
The ITR-ICBD team was founded in 2004 with the participation of 27 countries (Austria, Azerbaijan, China, Egypt, France, Germany, Greece, India, Iran, Iraq, Israel, Italy, Japan, Jordan, Libya, Morocco, Pakistan, Portugal, Russia, Saudi Arabia, Singapore, Spain, Taiwan, Thailand, Tunisia, Turkey, and USA). The International Criteria for Behcet’s Disease (ICBD) were presented to the International Conference of Behcet’s Disease in Lisbon (Portugal) in 2006. Originally it had two formats, like the Iranian criteria. Later, it was decided to keep only the traditional format [17–19]. The ICBD were presented to the 2007 World Congress of Dermatology in Argentina and to 2009 ACR congress of Rheumatology in the USA .
2. ISG and ICBD Criteria
The ISG criteria  use 5 items. Two items are mucous membrane manifestations. They are oral aphthosis (OA) and genital aphthosis (GA). The third item is skin manifestations, comprising pseudofolliculitis (PF) and erythema nodosum (EN). The forth item is ocular manifestations. They are anterior uveitis (AU), posterior uveitis (PU), and retinal vasculitis (RV). The fifth item is the presence of pathergy phenomenon (PP). It is detected by the pathergy test [26–30]. In ISG criteria, the presence of OA is mandatory. Two other items from the 4 remaining (GA, skin, eye, PP) are necessary to classify a patient as having BD.
For the international criteria, the ICBD [17–19], vascular manifestations (VMs) have been added to the 5 items of ISG criteria, because they are one of the characteristics of BD, and were used in many criteria before the advent of ISG (Mason and Barnes, Hewitt, Hubault and Hamza, Dilsen, Japan revised, and Dilsen revised criteria). VM is defined as superficial phlebitis, deep vein thrombosis, large vein thrombosis, arterial thrombosis, and aneurysm. Therefore, ICBD use six items: OA, GA, skin (PF, EN), eye lesions (AU, PU, RV), VM, and PP. In the ICBD, genital aphthous lesions and eye lesions have more diagnostic value than the others. They get each 2 points. The other 4 items (OA, skin, VM, PP) get one point each. A patient has to get 3 or more points to be diagnosed/classified as having BD.
3. Performance and Comparison of ISG and ICBD
Many ways and methods can be used to evaluate the performance of a criteria set. The most common used are sensitivity, specificity, and accuracy. Other methods are the positive predictive value, the negative predictive value, the positive likelihood ratio, the negative likelihood ratio, the diagnostic odds ratio, and Youden’s index [35–39].
Sensitivity is the number of BD patients correctly classified (diagnosed) by the criteria. It is expressed as percentage (number of diagnosed BD patients, divided by the total number of BD patients, and then multiplied by 100) . The sensitivity of ISG in their cohort of 886 patients was 92% . The 95% confidence interval (95% CI) was 90% to 93.6%. The sensitivity of ICBD in their cohort of 2556 BD patients was 96.1% (95% CI 95.3–96.8). By chi-square test the difference between the two sets of criteria is statistically significant (χ2= 23.439, ). The sensitivity of ISG in the ICBD cohort of patients was 82.4% (95% CI80.9–83.9).
It is important to look at the sensitivity of the two criteria in independent cohort of patients. Three studies validated the ICBD in their cohort of patients: Germany in 2008 , China in 2008 , and Iran in 2010 . The sensitivity of ISG was, respectively, 83.7% (95% CI 74.3–90.1), 65.4% (95% CI 60.2–70.5), and 78.1% (95% CI 77–79.1). The sensitivity of ICBD was, respectively, 96.5% (95% CI 89.7–99.2), 87% (95% CI 82.8–90.2), and 98.2% (95% CI 97.8–98.5).
Specificity is the number of non-BD patients, correctly recognized as not having BD. It is expressed as percentage (number of non-BD patients correctly recognized as not having BD, divided by the total number of non-BD patients, then multiplied by 100) . The specificity of ISG criteria in their own cohort of patients was 97% (95% CI 90.8–99.3). However, the number of control patients was only 97, and all other control patients having oral aphthosis were discarded from the original cohort of control patients . The specificity of ICBD in their cohort of patients was 88.7% (95% CI 86.8–90.4). The specificity of ISG and ICBD in Germany, China, and Iran was, respectively, 89.5%, 99.2% and 98.8% (ISG), and 73.7%, 94.1%, and 95.6% (ICBD). Table 2 shows the specificity of different criteria in different studies.
Accuracy or percent agreement is the ability of the criteria to correctly recognize BD patients from the non-BD patients. It is also expressed by percentage (number of diagnosed BD patients + number of non-BD patients correctly recognized as not having BD, divided by the total number of BD patients + total number of non-BD patients, and then multiplied by 100) . The accuracy of ISG in their own cohort of patients was 92% (95% CI 90.1–93.5). The accuracy of ICBD in their own cohort of patients was 93.8% (95% CI 93–94.5). The accuracy of ISG and ICBD in Germany, China, and Iran was, respectively, 85.5%, 7402% and 85.5% (ISG), and 89.5%, 88.9%, and 97.3% (ICBD). Table 3 shows the accuracy of different criteria in different studies.
Positive predictive value (PPV) demonstrates the probability that the positive test be true positive. PPV is more influenced by specificity than sensitivity. A criteria set with 90% sensitivity and 90% specificity will have a PPV of 90. If sensitivity increases to 95, PPV will improve to 90.5%, while if specificity increases to 95%, PPV will improve to 94.8%. PPV is also greatly influenced by the prevalence of the disease. Taking the above example, the PPV remains the same (90) in a dedicated BD clinic, where 50% of patients have BD and 50% are controls (patients mimicking BD but are not true BD). In the general population, with a prevalence of 80 for 100,000 inhabitants, the PPV becomes only 0.72. Therefore the results calculated in a specific setting cannot be used in another setting . The PPV was higher for ISG than ICBD criteria in the 3 independent set of patients; however, the difference was very small in the Iranian patients, only 2.8% (Table 4).
Negative predictive value (NPV) indicates the probability of a negative test to be a true negative. The NPV also is influenced by the prevalence of the disease. On the contrary of PPV, the NPV is more influenced by sensitivity than specificity. It is also highly influenced by the prevalence of the disease .
Positive likelihood ratio (PLR) demonstrates the odds of having the disease. If PLR is superior to 5, it means that the test is related to the disease. It is highly influenced by specificity, as is the PPV. It is why the PLR is much higher for ISG criteria than ICBD (Table 4). Higher PLR for ISG means that, if ISG is positive, the chance of having BD is very high, but unfortunately ISG was negative in around 18% of subjects, in the 3 independent sets (Table 1).
Negative likelihood ratio (NLR) shows the odds of not having the disease. It is highly influenced by the sensitivity, as for the NPV. It has therefore better values for ICBD than for ISG criteria (Table 4). The high NLR for ICBD means that, if ICBD are negative, there are little chances for the patient to have BD (only 2% error rate for the Iranian patients: Table 4).
Diagnostic odds ratio (DOR) is a new way to show how much a test is reliable, like combining the PLR and NLR results. If DOR is 1, it means the test (criteria) does not discriminate between the patient and the control. The power of discrimination increases with higher values of DOR. The DOR of ISG is 294 and of ICBD is 1185 in the Iranian patients, demonstrating the high discriminative power of ICBD over ISG (Table 4).
Youden’s index (YI) is a rather old (1950) and simple calculation, combining the results of sensitivity and specificity, to show the performance of the diagnosis criteria. The result goes from zero to one. The more the result approaches 1, the higher the performance of the test is. The ideal is one, meaning a sensitivity and a specificity of 100%. A sensitivity and a specificity of 90% will give a YI of 0.8. The YI of ISG is inferior to ICBD in China and Iran (Table 4).
ICBD are the latest diagnosis/classification criteria, created by the participation of 27 countries from different parts of the world. The large number of Behcet’s disease patients and control patients, from inside and outside of the Silk Road, assures the variability needed to create an international criteria that can work in any country with different ethnicities. The validation of the criteria in the Far East, Middle-East, and Europe demonstrates its validity.
H. O. Curth, “Recurrent genito-oral aphthosis with hypopion (Behcet's syndrome),” Archives of Dermatology, vol. 54, pp. 179–196, 1946.View at: Google Scholar
J. Hewitt, J. P. Escande, P. H. Laurent, and L. Perlemuter, “Criteres de prevision du syndrome de Behcet,” Bulletin de la SocieteFrancaise de Dermatology et de Syphiligraphie, vol. 76, pp. 565–568, 1969.View at: Google Scholar
R. M. Mason and C. G. Barnes, “Behçet's syndrome with arthritis,” Annals of the Rheumatic Diseases, vol. 28, no. 2, pp. 95–103, 1969.View at: Google Scholar
J. Hewitt, J. P. Escande, and S. Manesse, “Revision of the diagnostic criteria of Behcet's syndrome,” La Presse Medicale, vol. 79, no. 20, p. 901, 1971.View at: Google Scholar
Behcet's Disease Research Committee of Japan, “Behcet's disease guide to the diagnosis of Behcet's disease (1972),” Japanese Journal of Ophthalmology, vol. 18, pp. 291–294, 1974.View at: Google Scholar
A. Hubault and M. Hamza, “La maladie de Behçet en 1974,” in L'actualité Rhumatologique 15, S. de Sezeet al, Ed., pp. 43–55, Expension Scientifique, Paris, France, 1974.View at: Google Scholar
J. D. O'Duffy, “Critères proposés pour le diagnostique de la maladie de Behçet et notes therapeutiques,” Revue de Medecine, vol. 36, pp. 2371–2379, 1974.View at: Google Scholar
S. P. Chen and X-Q. Zhang, “Some special clinical manifestations of Behcet's disease—report of illustrative cases and review of literature (author's transl),” Chinese jJournal of Internal Medicine, vol. 19, no. 1, pp. 15–22, 1980.View at: Google Scholar
N. Dilsen, M. Konice, and O. Aral, “Our diagnostic criteria of Behcet's disease—an overview,” in Recent Advances in Behcet's Disease, T. Lehner and C. G. Barnes, Eds., vol. 103 of International Congress and Symposium Series, pp. 177–180, London Royal Society of Medicine Services, London, UK, 1986.View at: Google Scholar
Y. Mizushima, “Recent research into Behcet's disease in Japan,” International Journal of Tissue Reactions, vol. 10, no. 2, pp. 59–65, 1988.View at: Google Scholar
International Study Group for Behcet's Disease, “Criteria for diagnosis of Behcet's disease,” Lancet, vol. 335, no. 8697, pp. 1078–1080, 1990.View at: Google Scholar
F. Davatchi, F. Shahram, M. Akbarian et al., “Accuracy of existing diagnosis criteria for Behcet's disease,” in Behcet's Disease, B. Wechsler and P. Godeau, Eds., vol. 1037 of Excerpta Medica International Congress Series, pp. 225–228, Amsterdam, The Netherlands, 1993.View at: Google Scholar
F. Davatchi, F. Shahram, M. Akbarian et al., “Classification tree for the diagnosis of Behcet's disease,” in Behcet's Disease, B. Wechsler and P. Godeau, Eds., vol. 1037 of Excerpta Medica International Congress Series, pp. 245–248, Amsterdam, The Netherlands, 1993.View at: Google Scholar
N. Dilsen, “About diagnostic criteria for Behcet's disease: our new proposal,” in Behcet's Disease, D. Bang, E. S. Lee, and S. Lee, Eds., pp. 101–104, Design Mecca Publishing, Seoul, Korea, 2000.View at: Google Scholar
H. K. Chang and S. Y. Kim, “Survey and validation of the criteria for Behcet's disease recently used in Korea: a suggestion for modification of the international study group criteria,” Journal of Korean Medical Science, vol. 18, no. 1, pp. 88–92, 2003.View at: Google Scholar
H. K. Chang, S. S. Lee, H. J. Bai et al., “Validation of the classification criteria commonly used in Korea and a modified set of preliminary criteria for Behçet's disease: a multi-center study,” Clinical and Experimental Rheumatology, vol. 22, no. 4, pp. S21–S26, 2004.View at: Google Scholar
International Team for the Revision of the International Criteria for Behcet's Disease, “Evaluation of the International Criteria for Behcet's disease (ICBD),” Clinical and Experimental Rheumatology, vol. 24, supplement 42, p. S13, 2006.View at: Google Scholar
International Team for the Revision of the International Criteria for Behcet's Disease, “Revision of the International Criteria for Behcet's Disease (ICBD),” Clinical and Experimental Rheumatology, vol. 24, supplement 42, pp. S14–S15, 2006.View at: Google Scholar
F. Davatchi, M. Schirmer, C. Zouboulis, S. Assad-Khalil, and K. T. Calamia, “on behalf International Team for the Revision of the International Criteria for Behcet's Disease,“Evaluation and Revision of the International Study Group Criteria for Behçet's Disease”,” in Proceedings of the American College of Rheumatology Meeting, Boston, Mass, USA, November 2007, abstract 1233.View at: Google Scholar
Y. Dong, Q. Yao, and M. Wang, “Behcet's disease in China,” in Proceedings of the 8th APLAR Congress of Rheumatology, p. S14, Melbourne, Australia, April 1996.View at: Google Scholar
APLAR subcommittee for Behcet's Disease, “APLAR evaluation of Behcet's disease diagnosis criteria,” APLAR Journal of Rheumatology, vol. 1, pp. 237–240, 1998.View at: Google Scholar
T. Prokaeva, Z. Alekberova, T. Reshetnjak et al., “Evaluation of Behcet's disease diagnosis criteria: study from Russia,” in Behcet's Disease, D. Bang, E. Lee, and S. Lee, Eds., pp. 598–603, Design Mecca Publishing, Seoul, Korea, 2000.View at: Google Scholar
K. T. Calamia and F. Davatchi, “Sensitivity of diagnosis criteria in United States patients with Behcet's disease,” in Behcet's Disease, D. Bang, E. Lee, and S. Lee, Eds., pp. 121–124, Design Mecca Publishing, Seoul, Korea, 2000.View at: Google Scholar
F. Davatchi, F. Shahram, M. Akbarian et al., “Behcet's disease-analysisof 3443 cases,” APLAR Journal of Rheumatology, vol. 1, pp. 2–5, 1997.View at: Google Scholar
P. Mansoori, C. Chams, F. Davatchi et al., “Pathergy phenomenon in Behcet's disease, new aspects,” in Rheumatology APLAR 1992, A. R. Nasution, J. Darmawan, and H. Isbagio, Eds., pp. 111–113, Churchill Livingstone, Tokyo, Japan, 1992.View at: Google Scholar
C. Chams-Davatchi, F. Davatchi, F. Shahram et al., “Longitudinal study of the Pathergy phenomenon in Behcet's disease,” in Behcet's Disease, M. Hamza, Ed., pp. 356–358, Pub Adhoua, Tunisia, 1992.View at: Google Scholar
A. Altenburg, N. G. Bonitsis, N. Papoutsis, M. Pasak, L. Krause, and C. C. Zouboulis, “Evaluation of diagnostic criteria including ICBD (2006) in Adamantiades-Behcet's disease patients in Germany,” Clinical Experimental Rheumatology, vol. 26, supplement 50, p. S3, 2008.View at: Google Scholar
Z. Zhang, W. Zhou, Y. Hao, Y. Wang, and Y. Dong, “Validation of the International criteria for Behcet's disease (ICBD) in China,” Clinical Experimental Rheumatology, vol. 26, supplement 50, pp. S6–S7, 2008.View at: Google Scholar
F. Davatchi, B. Sadeghi Abdollahi, and F. Shahram, “Validation of the international criteria for Behcet's dDisease in Iran,” International Journal of Rheumatic Disases, vol. 13, pp. 55–60, 2010.View at: Google Scholar
International Study Group for Behcet's Disease, “Evaluation of diagnostic (“Classification”) criteria in Behcet's disease: toward internationally agreed criteria,” in Behcet's Disease Basic and Clinical Aspects, J. D. O'Duffy and E. Kokmen, Eds., pp. 11–39, Marcel decker, New York, NY, USA, 1991.View at: Google Scholar
M. Szklo and F. J. Neto, Epidemiology, Beyond the Basics, Jones and Bartlett, Sudbury, Mass, USA, 2nd edition, 2007.
B. R. Kirshwood and J. A. S. C. Sterne, Essential Medical Statistics, Blackwell Science LTD, Malden, Mass, USA, 2nd edition, 2003.
W. J. Youden, “Index for rating diagnostic tests,” Cancer, vol. 3, no. 1, pp. 32–35, 1950.View at: Google Scholar