PET-CT for assessing mediastinal lymph node involvement in patients with suspected resectable non-small cell lung cancer

Mia Schmidt-Hansen*, David R. Baldwin, Elise Hasler, Javier Zamora, Víctor Abraira, Marta Roquéi Figuls

*Corresponding author for this work

Research output: Contribution to journalReview articlepeer-review

161 Citations (Scopus)

Abstract

Background: A major determinant of treatment offered to patients with non-small cell lung cancer (NSCLC) is their intrathoracic (mediastinal) nodal status. If the disease has not spread to the ipsilateral mediastinal nodes, subcarinal (N2) nodes, or both, and the patient is otherwise considered fit for surgery, resection is often the treatment of choice. Planning the optimal treatment is therefore critically dependent on accurate staging of the disease. PET-CT (positron emission tomography-computed tomography) is a non-invasive staging method of the mediastinum, which is increasingly available and used by lung cancer multidisciplinary teams. Although the non-invasive nature of PET-CT constitutes one of its major advantages, PET-CT may be suboptimal in detecting malignancy in normal-sized lymph nodes and in ruling out malignancy in patients with coexisting inflammatory or infectious diseases. Objectives: To determine the diagnostic accuracy of integrated PET-CT for mediastinal staging of patients with suspected or confirmed NSCLC that is potentially suitable for treatment with curative intent. Search methods: We searched the following databases up to 30 April 2013: The Cochrane Library, MEDLINE via OvidSP (from 1946), Embase via OvidSP (from 1974), PreMEDLINE via OvidSP, OpenGrey, ProQuest Dissertations & Theses, and the trials register www.clinicaltrials.gov. There were no language or publication status restrictions on the search. We also contacted researchers in the field, checked reference lists, and conducted citation searches (with an end-date of 9 July 2013) of relevant studies. Selection criteria: Prospective or retrospective cross-sectional studies that assessed the diagnostic accuracy of integrated PET-CT for diagnosing N2 disease in patients with suspected resectable NSCLC. The studies must have used pathology as the reference standard and reported participants as the unit of analysis. Data collection and analysis: Two authors independently extracted data pertaining to the study characteristics and the number of true and false positives and true and false negatives for the index test, and they independently assessed the quality of the included studies using QUADAS-2. We calculated sensitivity and specificity with 95% confidence intervals (CI) for each study and performed two main analyses based on the criteria for test positivity employed: Activity > background or SUVmax = 2.5 (SUVmax = maximum standardised uptake value), where we fitted a summary receiver operating characteristic (ROC) curve using a hierarchical summary ROC (HSROC) model for each subset of studies. We identified the average operating point on the SROC curve and computed the average sensitivities and specificities. We checked for heterogeneity and examined the robustness of the meta-analyses through sensitivity analyses. Main results: We included 45 studies, and based on the criteria for PET-CT positivity, we categorised the included studies into three groups: Activity > background (18 studies, N = 2823, prevalence of N2 and N3 nodes = 679/2328), SUVmax = 2.5 (12 studies, N = 1656, prevalence of N2 and N3 nodes = 465/1656), and Other/mixed (15 studies, N = 1616, prevalence of N2 to N3 nodes = 400/1616). None of the studies reported (any) adverse events. Under-reporting generally hampered the quality assessment of the studies, and in 30/45 studies, the applicability of the study populations was of high or unclear concern. The summary sensitivity and specificity estimates for the 'Activity > background PET-CT positivity criterion were 77.4% (95% CI 65.3 to 86.1) and 90.1% (95% CI 85.3 to 93.5), respectively, but the accuracy estimates of these studies in ROC space showed a wide prediction region. This indicated high between-study heterogeneity and a relatively large 95% confidence region around the summary value of sensitivity and specificity, denoting a lack of precision. Sensitivity analyses suggested that the overall estimate of sensitivity was especially susceptible to selection bias; reference standard bias; clear definition of test positivity; and to a lesser extent, index test bias and commercial funding bias, with lower combined estimates of sensitivity observed for all the low 'Risk of bias' studies compared with the full analysis. The summary sensitivity and specificity estimates for the SUVmax = 2.5 PET-CT positivity criterion were 81.3% (95% CI 70.2 to 88.9) and 79.4% (95% CI 70 to 86.5), respectively.In this group, the accuracy estimates of these studies in ROC space also showed a very wide prediction region. This indicated very high between-study heterogeneity, and there was a relatively large 95% confidence region around the summary value of sensitivity and specificity, denoting a clear lack of precision. Sensitivity analyses suggested that both overall accuracy estimates were marginally sensitive to flow and timing bias and commercial funding bias, which both lead to slightly lower estimates of sensitivity and specificity. Heterogeneity analyses showed that the accuracy estimates were significantly influenced by country of study origin, percentage of participants with adenocarcinoma, (18F)-2-fluoro-deoxy-D-glucose (FDG) dose, type of PET-CT scanner, and study size, but not by study design, consecutive recruitment, attenuation correction, year of publication, or tuberculosis incidence rate per 100,000 population. Authors' conclusions: This review has shown that accuracy of PET-CT is insufficient to allow management based on PET-CT alone. The findings therefore support National Institute for Health and Care (formally 'clinical') Excellence (NICE) guidance on this topic, where PET-CT is used to guide clinicians in the next step: either a biopsy or where negative and nodes are small, directly to surgery. The apparent difference between the two main makes of PET-CT scanner is important and may influence the treatment decision in some circumstances. The differences in PET-CT accuracy estimates between scanner makes, NSCLC subtypes, FDG dose, and country of study origin, along with the general variability of results, suggest that all large centres should actively monitor their accuracy. This is so that they can make reliable decisions based on their own results and identify the populations in which PET-CT is of most use or potentially little value.

Original languageEnglish
Article numberCD009519
JournalCochrane Database of Systematic Reviews
Volume2014
Issue number11
DOIs
Publication statusPublished - 13 Nov 2014

Bibliographical note

Funding Information:
Originally, 170 participants were randomised to the PET-CT group. Of those, 21 participants were excluded from the analyses (1 participant refused study investigations after randomisation, and 20 participants did not receive the reference standard) This study was supported by the Ontario Ministry of Health and Long-term Care (grant 06126) and the Canadian Institutes of Health Research (grant MCT-78777) and by Cancer Care Ontario Adverse events: none reported

Funding Information:
From a tuberculosis-endemic area The Research Grant of Korea Foundation for Cancer Research (grant number CB-2011-02-01) and the Basic Science Research Program through the National Research Foundation of Korea, funded by the Ministry of Education, Science and Technology (grant number 2011-0022379), supported the study Adverse events: not reported

Funding Information:
All participants were accounted for in the results. All participants received the reference standard. There were no uninterpretable results Supported by the Research Fund of Shandong Provincial Health Bureau of China (grant 2009HZ088) and by the Research Fund of Shandong Cancer Hospital and Institute (no. 2009 to 11) Adverse events: not reported

Funding Information:
N stage was determined in 150/165 participants, with thoracotomy results in 118, mediastinoscopy results in 26, and supraclavicular nodal biopsy results in 6 N-stage was available for 150/165 participants. It was unclear why N-stage is not available for the remaining 15 participants This study was supported by Samsung Medical Center Clinical Research Development Program grant #CRDP CRS 106-41-2. The authors stated that they have no financial relationship to disclose Adverse events: not reported

Funding Information:
This work was supported by a National Cancer Center grant. The authors reported to the American College of Chest Physicians that no significant conflicts of interest exist with any companies/ organisations whose products or services may be discussed in their article Adverse events: not reported

Funding Information:
The study was supported by the National Science and Technology Major Projects (2009ZX09501-026), Doctoral Fund of Ministry of Education of China (20070023041) and Beijing Hope Run Special Fund (grant LC2007A02) Adverse events: None reported

Funding Information:
This study was supported by Research Fund of Shandong Provincial Health Bureau, Shandong Province, China Adverse events: not reported

Funding Information:
The 21st Century COE program ’Biomedical Imaging Technology Integration Program’ funded by the Japan Society for the Promotion of Science supported the study Adverse events: none reported

Funding Information:
The study was supported partly by a Grant-in-Aid for Scientific Research from the Ministry of Education, Culture, Sport, Science and Technology, Japan (21591828) Adverse events: not reported

Funding Information:
This study was supported by grant NSC-97-2314-B-182A-101-MY3 from the National Science Council of Taiwan Adverse events: none reported Node SUV cut-off 3.15: TP = 13, FN = 6, FP = 20, TN = 63 Node/aorta SUV ratio cut-off 1.37: TP = 17, FN = 2, FP = 33, TN = 50 Node/liver SUV ratio cut-off 1.02: TP = 16, FN = 3, FP = 28, TN = 55 It was the visual interpretation results currently included under test data

Funding Information:
≥ 4 for qualitative assessment and SUVmax ≥ 2 for quantitative assessment (Tables 1 and 2 and Figures 3 and 4). These cut-off values were not prespecified Pathologic findings in resected specimens. Cervical mediastinoscopy (supplemented by left anterior mediastinotomy if the tumour was in the left upper lobe) if CT findings suggested invasion of the superior mediastinum or enlarged nodes plus complete standard mediastinal nodal sampling (N = 23) or thoracotomy plus systematic ipsilateral hilar and mediastinal sampling at the locations specified by the regional lymph node classifications of the American Joint Committee on Cancer and the Union for International Cancer Control (N = 227) All participants were accounted for in the results. All participants received the reference standard. There were no uninterpretable results Philips Healthcare and a grant-in-aid for scientific research from the Japanese Ministry of Education, Culture, Sports, Science and Technology provided the study with financial, technical support, or both. None of the authors was an employee of Philips Healthcare, and the authors had full control over the data for the duration of this study 2 2-by-2s (qualitative and quantitative measurement): qualitative: TP = 14, FP = 0, FN = 7, TN = 229 quantitative: TP = 16, FP = 0, FN = 5, TN = 229 Adverse events: none reported

Publisher Copyright:
© 2014 The Cochrane Collaboration.

ASJC Scopus subject areas

  • Pharmacology (medical)

Fingerprint

Dive into the research topics of 'PET-CT for assessing mediastinal lymph node involvement in patients with suspected resectable non-small cell lung cancer'. Together they form a unique fingerprint.

Cite this