Abstract

Objective. We performed a systematic review and network meta-analysis (NMA) to compare the diagnostic value of seven different imaging modalities for the detection of neuroblastic tumors in diverse clinical settings. Methods. PubMed, Embase, Medline, and the Cochrane Library were searched to identify eligible studies from inception to Sep 29, 2020. Quality assessment of included studies was appraised with Quality Assessment of Diagnostic Accuracy Studies. Firstly, direct pairwise meta-analysis was conducted to calculate the pooled estimates of odds ratio (OR) and 95% confidence interval (CI) of the sensitivity, specificity, NPV, PPV, and DR. Next, NMA using Bayesian methods was performed. The superiority index was assessed to quantify the rank probability of a diagnostic test. The studies performed SPECT/CT or SPECT were analyzed separately from the ones only performed planar imaging. Results. A total of 1135 patients from 32 studies, including 7 different imaging modalities, were eligible for this NMA. In the pairwise meta-analysis, 18F-FDOPA PET/CT had a relatively high value of all the outcomes (sensitivity: 10.195 [5.332–19.493]; specificity: 17.906 [5.950–53.884]; NPV: 16.819 [7.033–40.218]; PPV: 11.154 [4.216–29.512]; and DR 5.616 [3.609–8.739]). In the NMA, 18F-FDOPA PET/CT exhibited relatively high sensitivity in all subgroups (all data: 0.94 [0.87–0.98]; primary tumor: 0.89 [0.53–1]; bone/bone marrow metastases: 0.96 [0.83–1]; and primary tumor and metastases (P + M): 0.92 [0.80–0.97]), the highest specificity in the subgroup of P + M (0.85 [0.61–0.97]), and achieved the highest superiority index in the subgroups of all data (8.57 [1–15]) and P + M (7.25 [1–13]). Conclusion. 18F-FDOPA PET/CT exhibited the best diagnostic performance in the comprehensive detection of primary tumor and metastases for neuroblastic tumors, followed by 68Ga-somatostatin analogs, 123I-meta-iodobenzylguanidine (MIBG), 18F-FDG, and 131I-MIBG tomographic imaging.

1. Introduction

Neuroblastic tumors (NTs) are the most common extracranial solid tumors of children, which are derived from the primitive neural crest. NTs include neuroblastoma, ganglioneuroblastoma, and ganglioneuroma. Nearly 48% of neuroblastomas present with metastasis at the time of diagnosis [1, 2]. Therefore, accurate identification of all lesions is of importance for staging and establishing therapy protocol [3].

Imaging, especially nuclear medicine functional imaging, plays an indispensable role in the diagnosis, staging, surgical planning, response assessment, and follow-up of NTs. Since neuroblastic tumor cells specifically express the noradrenaline transporter, iodine radioisotope-labelled meta-iodobenzylguanidine (MIBG), a noradrenaline analogue, becomes an ideal tracer for imaging of the tumor lesions. MIBG was labelled with 131I at the beginning. Nowadays, 123I-labelled MIBG (123I-MIBG) is the mainstay of radiopharmaceutical in the diagnosis and management of NTs. Considering of the limitation in small lesions and prolonged acquisition time (24–48 hours) of the MIBG scan, positron emission tomography (PET) imaging is increasingly being applied in current clinical practice. In particular, when the tumor uptake of MIBG is weak or negative, 18F-fluorodeoxyglucose (FDG) PET imaging is recommended as a second-line imaging by the International Neuroblastoma Risk Group (INRG) guidelines [3] and the European Association of Nuclear Medicine (EANM) 2018 guidelines [2]. Various PET tracers have been utilized for imaging in neuroblastoma patients, including the metabolic compounds such as 18F-FDG and L-3,4-dihydroxy-6-18F-fluorophenylalanine (FDOPA), as well as the receptor-mediated compounds such as 68Ga-DOTA peptides and somatostatin analogues (SSAs) [46]. Traditional imaging, computed tomography (CT) and magnetic resonance imaging (MRI), also has an essential role in the staging and evaluation of surgical risks for the disease [7].

In the EANM 2018 guidelines for neuroblastoma, discussion is still ongoing on the effectiveness of various imaging modalities and the applicability of different tracers in diverse clinical settings. An increasing number of studies reported the utility of different imaging in the diagnosis of NTs. However, considering the radiation burden and imaging acquisition inconvenience to pediatric patients, head-to-head studies are few and most of them are of small sample sizes. Although there have been a few previous meta-analyses [810] of the diagnostic value of imaging modalities for NTs, there were some limitations in their data grouping. Moreover, all of them were conventional meta-analysis that evaluated one single imaging technique or simply compared two imaging modalities. Network meta-analysis (NMA) extends conventional meta-analysis, which is a novel synthesis of evidence. In contrast to the conventional pairwise meta-analysis, NMA draws together evidence from both direct and indirect comparison of multiple tests simultaneously [11]. NMA can calculate the effect size and quantify the rank probability of each diagnostic test between groups through indirect study comparison, even if there is no direct head-to-head study. Moreover, NMA with an arm-based model provides more natural variance-covariance matrix structures which make it more appropriate than the traditional meta-analysis. Therefore, we conducted an NMA and comprehensive systematic review to directly and indirectly compare the diagnostic value of all enrolled imaging modalities in NTs.

2. Materials and Methods

This systematic literature review and meta-analysis was performed according to the “Preferred Reporting Items for Systematic Reviews and Meta-Analyses” (PRISMA) guidelines and was registered in PROSPERO (CRD42020206862).

2.1. Search Strategy

A systematic literature search was conducted based on the Population, Interventions, Comparator, Outcome, and Study design (PICOS) principle. PubMed, the Cochrane Library, Embase, and Medline were searched from inception to Sep 29, 2020.

2.2. Inclusion and Exclusion Criteria

Inclusion criteria were as follows: (1) population: patients diagnosed with NTs, (2) intervention: any type of imaging modality was performed, (3) comparator: compared to each other, (4) outcomes: sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV), and (5) study type: diagnostic accuracy study.

Exclusion criteria were as follows: (a) case reports, reviews, meeting abstracts, or comments; (b) non-human studies or non-English articles; (c) sample size < 10; (d) lack of essential data, including true positive (TP), true negative (TN), false positive (FP), and false negative (FN) values; (e) study enrolling recurrent or refractory patients, and (f) overlapping patients reported. Once articles enrolling overlapping patients were identified, the recently published article with more patients was included.

2.3. Data Extraction

Two researchers independently performed the literature searching, screening, and data extracting. The differences were discussed until reaching consensus. The following information was collected: basic information of studies (first author, publication year, study period, original country, study design, and follow-up time), patient characteristics (sample size, age, and gender), type of lesions (P, primary tumor; BM, bone and bone marrow metastases; and P + M, primary tumor and metastases), type of imaging modality, standard reference, and raw diagnostic data (TP, FP, TN, and FN).

2.4. Quality Assessment

The quality of enrolled studies was appraised with Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) [12] by two authors. Discrepancies between the authors were resolved by discussion. The QUADAS-2 includes four domains: patient selection, reference standard, index text, and flow and timing. Each domain is assessed in terms of risk of bias, and the first 3 domains are also appraised in terms of concerns regarding applicability. RevMan (Version 5.3.5, the Cochrane Collaboration, Oxford, UK) was used to conduct the assessment.

2.5. Statistical Analysis and Data Synthesis

Traditional pairwise meta-analysis was conducted to calculate the pooled estimates of odds ratio (OR) and 95% confidence interval (CI) of sensitivity, specificity, NPV, PPV, and detection rate (DR) of various imaging modalities. Heterogeneity was assessed by the χ2 test and I2 statistics. A fixed-effect model would be applied if and/or I2< 50%. Otherwise, a random-effect model would be conducted. Subgroup analyses were conducted based on diverse clinical settings. The publication bias was assessed by Deeks’ funnel plot asymmetry test. Traditional meta-analyses were performed using STATA (version 15.0, StataCorp, College Station, TX).

Next, the evidence network structure was performed with package gemtc (v 0.8–8) in R software (Version 4.0.3, Comprehensive R Archive Network). Each node stands for a different diagnostic test, and the thickness of lines between nodes represents the number of studies that directly compared the two tests. Then, Bayesian NMA was performed by an arm-based model, which was developed by Nyaga et al. [13]. We run three chains in parallel until there was convergence. Trace plots were applied to visually check whether the distributions of the three simulated chains were mixed properly and were stationary. The superiority index [13] was estimated to quantify the rank probability of a diagnostic test. The larger the superiority index was, the more accurately a test was expected to predict the targeted condition compared to other screening tests. A two-sided value  < 0.05 was considered statistically significant in all statistical tests. All NMA were performed using R software, package rstan (v 2.21.2), package loo (v 2.3.1), and package plyr (v 1.8.6).

3. Results

3.1. Literature Search Results

A total of 1,094 studies were initially retrieved. After excluding irrelevant articles (n = 826) and duplicated records (n = 38), the remaining 230 studies were further assessed. A total of 51 studies were evaluated for eligibility by full-text review, after excluding non-English articles (n = 24), non-human studies (n = 24), irrelevant studies (n = 73), reviews (n = 31), cases (n = 14), meeting abstracts or comments (n = 4), and the studies with incomplete data (n = 9). After full-text review, irrelevant studies (n = 2), studies with insufficient data (n = 8) or ineligible reference standard (n = 3), studies focusing on recurrent or refractory patients (n = 2), and inaccessible full text (n = 4) were ruled out. Finally, thirty-two diagnostic studies [1445] met the inclusion criteria (Figure 1).

3.2. Characteristics of Included Studies and Quality Assessment

A total of 1135 patients from 32 studies, including 7 different imaging modalities, were eligible for this NMA. Nine (28.1%) [14, 15, 23, 25, 28, 30, 31, 35, 39] of 32 studies were prospective design. Nineteen studies [14, 15, 17, 18, 20-24, 26, 29, 31, 36, 3942, 44, 45] included at least two tests (imaging methods), and 14 (73.7%) of them were head-to-head studies. Twenty (62.5%) [15, 18, 2026, 29-31, 33, 34, 36, 40-43, 45] of 32 studies investigated 123I-MIBG imaging, nine (28.1%) studies [14, 17, 26, 27, 36, 37, 39, 42, 44] focused on 131I-MIBG, eleven (34.4%) studies [16-20, 23, 24, 3840, 42] performed 18F-FDG-PET or PET/CT, three (9.4%) studies [15, 21, 22] addressed 18F-FDOPA, six (18.8%) studies [18, 21, 24, 29, 44, 45] evaluated CT or MRI, two (6.3%) studies [14, 41] assessed 68Ga- SSAs, and one (3.1%) study [31] inquired into 111In-pentetreotide. The remaining four (12.5%) studies [33, 35, 36, 45] mixed 131I-MIBG and 123I-MIBG together, and all of them acquired planar imaging only. All 18F-FDOPA studies and one of the 68Ga-SSAs [14] acquired imaging with an integrated PET/CT scanner. Among the FDG studies, ten of them conducted with PET/CT, while the remaining one [24] performed with a dedicated PET scanner. Among 123I-MIBG studies, 7 studies only performed planar imaging, 8 studies acquired single-photon-emission computed tomography (SPECT) imaging, and 5 studies applied SPECT/CT. In the nine 131I-MIBG studies, only 2 studies performed with the SPECT/CT scanner, whereas the other 7 studies acquired planar imaging. In the six CT or MRI studies, 3 studies conducted with MRI, 2 studies combined the results of CT and MRI together, and the last one performed CT. The study focused on 111In-pentetreotide-only-acquired planar imaging. The principal characteristics of all included studies were displayed in Table 1. QUADAS-2 score results from each study are presented in Figure 2.

3.3. Outcome of Pairwise Meta-Analysis

A direct paired comparison of the 7 different imaging modalities of the diagnostic value for NTs was performed. The estimated OR and 95% CI of the sensitivity, specificity, NPV, PPV, and DR are summarized in Table 2. In both sensitivity and specificity, 18F-FDOPA had a relatively good performance of all outcomes (sensitivity: 10.195 [5.332–19.493]; specificity: 17.906 [5.950–53.884]; NPV: 16.819 [7.033–40.218]; PPV: 11.154 [4.216–29.512]; and DR 5.616 [3.609–8.739]).

When stratified according to clinical settings, 123I-MIBG imaging showed the highest DR for the diagnosis of primary tumor (9.486 [3.484–25.826]). For the detection of bone and bone marrow metastases, CT or MRI exhibited the highest DR (7.170 [4.503–11.415]), followed by 18F-FDOPA (5.283 [2.325–12.005]). Regarding the detection of primary tumor and metastases (P + M), 131I-MIBG exhibited the highest DR (22.563 [8.020–63.480]) and 18F-FDOPA had the second highest DR (5.943 [3.480–10.149]) (Table 3).

3.4. Evidence Network

The evidence network included seven imaging modalities. The result revealed the number of studies investigating 123I-MIBG was the highest. Studies comparing 123I-MIBG with 18F-FDG were the most, followed by 131I-MIBG, CT or MRI and 18F-FDOPA imaging (Figure 3).

3.5. Outcomes of Network Meta-Analysis and Ranking of Diagnostic Tests

The results of the NMA for the seven imaging methods are shown in Figure 4. 18F-FDOPA exhibited relatively high sensitivity in all subgroups (all data: 0.94 [0.87–0.98]; primary tumor: 0.89 [0.53–1]; BM: 0.96 [0.83–1]; and P + M: 0.92 [0.80–0.97]) and the highest specificity in the subgroup of P + M (0.85 [0.61–0.97]). 131I-MIBG exhibited the highest specificity in the subgroups of all data (0.93 [0.79–0.99]), primary tumor (0.87 [0.61–0.99]), and BM (0.93 [0.63–1]). According to the superiority index, 18F-FDOPA achieved the highest value in the subgroups of all data (8.57 [1–15]) and P + M (7.25 [1–15]). 131I- and 123I-MIBG subgroup had the highest superiority index in the diagnosis of primary tumor (6.06 [0.33–15]). 123I-MIBG ranked the highest superiority index in the subgroup of BM (6.27 [1–15]).

After removing the studies only conducted with planar imaging and traditional imaging (Figure 5), 18F-FDOPA PET/CT achieved the highest sensitivity (0.94 [0.87–0.98]) and superiority index (5.06 [1–9]), and 68Ga-SSAs PET or PET/CT reached the highest specificity (0.89 [0.42–1]).

3.6. Publication Bias

The results of assessment of publication bias demonstrated symmetrical distribution in all subgroups except the tomographic imaging subgroup (). Deeks’ funnel plots were presented in the supplementary materials (Figures S15)

4. Discussion

The current NMA reveals that 18F-FDOPA PET/CT should be the imaging modality of choice for the comprehensive detection of primary tumor and metastases in NTs. 123I-MIBG SPECT or SPECT/CT present satisfactory performance for the diagnosis of both primary tumor and BM.

The EANM 2018 guidelines focus on SPECT (123MIBG) and PET (PET/CT with 18F-FDG, 18F-FDOPA, and 68Ga-DOTA peptides) tracers currently used in clinical practice. These guidelines presented general indications, advantages, and limitations along with recommendations on imaging protocols, interpretation of findings, and reporting results for nuclear medicine imaging in neuroblastoma. However, discussion regarding the clinical settings that may benefit most from the use of one tracer over the others is still ongoing. It is well known that the accumulation and decarboxylation of L-DOPA in neuroendocrine tumors (NETs) make it an excellent tracer for catecholamine metabolism in NETs, including NTs. 18F-FDOPA has already been applied in the diagnosis of pheochromocytoma (PCCs) [46, 47] and recommended as a first-line PET/CT tracer for the detection of medullary thyroid carcinoma [48]. The EANM 2018 guidelines also suggested 18F-FDOPA may currently be the best PET tracer alternative to 123I-MIBG for the assessment of NTs [2]. It showed a remarkable performance in the diagnosis of neuroblastoma in the current meta-analysis as well as other existing research [15, 22, 49]. In the present NMA, 18F-FDOPA PET/CT exhibited relatively higher sensitivity in all clinical settings, the highest sensitivity, and specificity in the subgroup of P + M, which ranked the first according to superiority index. Therefore, 18F-FDOPA PET/CT may become a promising diagnostic tool for neuroblastoma in the future. 68Ga-SSAs can bind to specific somatostatin receptors on the cell surface of NETs, which is also an imaging tracer of choice in NETs [50]. 68Ga-SSAs PET or PET/CT ranked the second in the diagnostic value of NTs. Nevertheless, there are only 2 studies focused on this radiopharmaceutical. The study from Pezhman [14] suggested 68Ga-SSAs PET/CT was superior to 131I-MIBG SPECT/CT in providing valuable information for both primary staging and follow-up in patients with neural crest tumors, including NTs and PCCs. Another study [41] did not enroll TN patients, and the specificity cannot be evaluated. So, the results should be interpreted cautiously. Interestingly, both 18F-FDOPA and 68Ga-SSAs are relatively new tracers for NETs, and a number of studies which did comparison between them in the detection of neuroendocrine tumors and other diseases have been reported [51, 52]. However, the study that head to head compares 18F-FDOPA and 68Ga-SSAs in patients with neuroblastic tumors has not been found. Thus, further study is expected.

123I-MIBG imaging, the most commonly utilized molecular imaging modality for the identification and prognostic evaluation of neuroblastoma, exhibited a relatively high value in both sensitivity and specificity for the detection of primary tumor and metastases. In particular, 123I-MIBG achieved the highest superiority index in the detection of BM. After excluding the studies only performed with planar imaging, 123I-MIBG SPECT or SPECT/CT exhibited a higher superiority index compared with 18F-FDG-PET or PET/CT and 131I-MIBG SPECT or SPECT/CT, which is consistent with the EANM guidelines of 2018 [2]. Thus, we propose that it is highly necessary to perform 123I-MIBG imaging with SPECT/CT.

The current NMA showed that 18F-FDG was inferior to 18F-FDOPA, 123I-MIBG, or 131I-MIBG in the separate evaluation of both primary tumor and BM. 18F-FDG, a glucose analogue, concentrates at lesions with increased glucose metabolism, including most tumors and inflammation or infection. Therefore 18F-FDG is less specific for NTs than MIBG or FDOPA. Notably, 18F-FDG displayed a higher superiority index compared with MIBG when it was employed to comprehensively evaluate the primary tumor and metastases in the whole body. That may be related with the higher resolution of PET than SPECT. The study of Henriette [24] reported the false negative results of 123I-MIBG were due to small lesion size (mean lesion diameter 1.7 cm) and low uptake. Combined with our results, it suggested that 18F-FDG could play a complementary role of MIBG when the lesion is small or non-MIBG avid. Additionally, beyond lesion recognition,18F-FDG may be helpful in tumor staging, treatment evaluation, and prognostic assessment of neuroblastoma [40, 42, 53]. Higher FDG uptake was observed in patients with higher-stage MYCN amplification [54] or advanced stage [55]. Maximum standardized uptake value (SUVmax) was all significantly higher in patients with worse overall survival [54].

131I-MIBG imaging exhibited moderate diagnostic characteristics based on the superiority index. It may be limited by the unfavorable imaging characteristics of the isotope 131I. Nowadays, considering higher radiation dose of 131I-MIBG compared with 123I-MIBG, many studies [4, 56] recommend that diagnostic 131I-MIBG was indicated only when 123I-MIBG is unavailable or 131I-MIBG therapy is contemplated. Preliminary studies of 124I-MIBG [57, 58] as well as 18F-meta-fluorobenzylguanidine (18F-MFBG) [59, 60] and MIBG variants [61] (18F-fluoropropylbenzylguanidine, 18F-FPBG) are ongoing. Those imaging agents are proved to improve image quality and demonstrate promising performance in the diagnosis of NTs.

CT and MR are widely available and routinely used in clinical practices, whereas they only showed moderate sensitivity and low specificity in the detection of BM in the NMA. The high incidence of false positive findings was probably related with the fact that the traditional imaging modality cannot distinguish posttherapy bone marrow changes from residual tumor. In recent years, increasing studies suggested that whole-body “diffusion-weighted imaging with background body signal suppression” is feasible for assessment of the primary lesions [62, 63] and lymph node metastases [18] of NTs. However, according to our results, it should be carefully used in NT patients due to its low specificity in the identification of skeleton lesions. Regarding the diagnostic value of 111In-pentetreotide, only one study [31] was included, and the data of this study are incomplete. Therefore, more high-quality studies are expected.

Three previous meta-analyses of the diagnostic value of different imaging modalities for NTs were identified. All of them evaluated one single imaging technique [8, 9] or simply compared two imaging modalities [10]. Moreover, the studies conducted with SPECT/CT and planar imaging were not analyzed separately in the meta-analysis from the work of Jia X et al. [10], which was irrational. Because there is a huge difference between planar imaging and SPECT/CT in imaging resolution, there is little comparability between them. Li et al. [9] attempted to assess the diagnostic accuracy of PET(CT) in patients with neuroblastoma in their meta-analysis, but they did not calculate the data of FDG-PET(CT) and FDOPA PET(CT) separately. Therefore, the reference value of this study for clinical practice is fairly limited.

This NMA has several limitations. Firstly, subgroup analyses were not conducted based on lesion-based analysis versus patient-/scan-based analysis because in the subgroup of primary tumor, all included studies were patient-/scan-based analysis. In the other two subgroups, most of the enrolled studies were lesion-based analysis, and patient-/scan-based analyses were performed in only 9 studies for the evaluation of 6 imaging modalities. Secondly, due to the small number of included studies for each imaging modality, subgroup analyses were not performed according to other variables such as study design, patients’ baseline characteristics, interval between injection and acquisition, and other imaging protocols. Thirdly, CT and MR were not analyzed separately. In the included studies, half (3/6) of the included studies mixed CT and MR together to compare with nuclear medicine imaging. Only two studies investigated the performance of MR, and just one study reported the data of CT. Finally, the estimated specificity of several groups displayed a wide range of 95% credible intervals (0–1), such as 18F-FDOPA in the BM and P subgroups, 111In-pentetreotide in the P subgroup, and 68Ga-SSAs and 131I-MIBG in the P + M subgroup. This demonstrates that the specificity was unavailable because some studies did not enroll TN cases. Further direct comparative studies with standardized data would be necessary.

5. Conclusions

In conclusion, 18F-FDOPA PET or PET/CT exhibited the best diagnostic performance in the comprehensive detection of primary tumor and metastases in NTs, followed by 68Ga-SSAs, 123I-MIBG, 18F-FDG, and 131I-MIBG tomographic imaging. 123I-MIBG SPECT or SPECT/CT present satisfactory performance for the diagnosis of both primary tumor and BM. Further comparative studies with standardized data are expected in the future.

Abbreviations

NTs:Neuroblastic tumors
PET/CT:Positron-emission tomography/computed tomography
SPECT/CT:Single-photon-emission computed tomography/ computed tomography
CT:Computed tomography
MRI:Magnetic resonance imaging
18F-FDG:18F-fluorodeoxyglucose
MIBG:Meta-iodobenzylguanidine
FDOPA:L-3,4-dihydroxy-6-18F-fluorophenylalanine
SSAs:Somatostatin analogues
NMA:Network meta-analysis
INRG:International Neuroblastoma Risk Group
EANM:European Association of Nuclear Medicine
PPV:Positive predictive value
NPV:Negative predictive value
DR:Detection rate
TP:True positive
TN:True negative
FP:False positive
FN:False negative
OR:Odds ratio
CI:Confidence interval
PRISMA:Preferred Reporting Items for Systematic Reviews and Meta-Analyses
QUADAS-2:Quality Assessment of Diagnostic Accuracy Studies
P:Primary tumor
BM:Bone and bone marrow metastases
P + M:Primary tumor and metastases
NETs:Neuroendocrine tumors
PCCs:Pheochromocytoma
18F-MFBG:18F-meta-fluorobenzylguanidine
18F-FPBG:18F-fluoropropylbenzylguanidine.

Data Availability

The data used in this study are available on reasonable request to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Acknowledgments

Jigang Yang was supported by the National Key R&D Program of China (2020YFC0122004), National Natural Science Foundation of China (Nos. 81971642 and 81771860), and Beijing Natural Science Foundation (No. 7192041). The authors thank Jiajian Qin (PR China) for his help with code editing and modifications.

Supplementary Materials

The results of publication bias of the subgroup of all data (Figure S1), BM (Figure S2), P (Figure S3), P + M (Figure S4), and tomographic imaging (Figure S5) are given. (Supplementary Materials)