Abstract

Background. The lung is one of the most common sites of metastasis in gastric cancer. Our study developed two nomograms to achieve individualized prediction of overall survival (OS) and cancer-specific survival (CSS) in patients with gastric cancer and lung metastasis (GCLM) to better guide follow-up and planning of subsequent treatment. Methods. We reviewed data of patients diagnosed with GCLM in the Surveillance, Epidemiology, and End Results (SEER) database from 2010 to 2015. The endpoints of the study were the OS and CSS. We used the “caret” package to randomly divide patients into training and validation cohorts in a 7 : 3 ratio. Multivariate Cox regression analysis was performed using univariate Cox regression analysis to confirm the independent prognostic factors. Afterward, we built the OS and CSS nomograms with the “rms” package. Subsequently, we evaluated the two nomograms through calibration curves, receiver operating characteristic (ROC) curves, and decision curve analysis (DCA). Finally, two web-based nomograms were built on the basis of effective nomograms. Results. The OS analysis included 640 patients, and the results of the multivariate Cox regression analysis showed that grade, chemotherapy, and liver metastasis were independent prognostic factors for patients with GCLM. The CSS analysis included 524 patients, and the results of the multivariate Cox regression analysis showed that the independent prognostic factors for patients with GCLM were chemotherapy, liver metastasis, marital status, and tumor site. The ROC curves, calibration curves, and DCA revealed favorable predictive power in the OS and CSS nomograms. We created web-based nomograms for OS (https://zhenghh.shinyapps.io/aclmos/) and CSS (https://zhenghh.shinyapps.io/aslmcss/). Conclusions. We created two web-based nomograms to predict OS and CSS in patients with GCLM. Both web-based nomograms had satisfactory accuracy and clinical usefulness and may help clinicians make individualized treatment decisions for patients.

1. Introduction

Gastric cancer (GC) is one of the most common malignant tumors of the gastrointestinal tract, accounting for the third and fifth causes of cancer deaths in men and women worldwide, respectively [1]. According to the 2018 Global Cancer Center statistics [1], there were approximately one million new cases of GC and approximately 780,000 GC-related deaths worldwide. Although radical surgery is currently effective in treating localized GC, recurrence or metastasis still occurs in 25% to 40% of patients after surgery [24]. According to relevant studies, the lung is a frequent metastatic organ in patients with GC [5], and the incidence of lung metastasis (LM) after GC surgery ranges from 1.3% to 3.8% [610]. Moreover, there is a lack of mature therapy standards for gastric cancer and lung metastasis (GCLM), and the 5-year survival rate of patients with GCLM is <5% [11]. At this stage, few studies have reported prognostic factors regarding the survival of patients with GCLM. Therefore, establishing a prediction model for patients with GCLM is clinically significant.

The treatment of GCLM has been recently diversified [1215]; however, the poor surgical outcome and complications associated with lung-occupying lesions in patients with GCLM lead to worse prognosis. Kong et al. [16] reported that the median survival of patients with GCLM is only four months. Moreover, studies have shown that the prognostic influences of GCLM generally include tumor histological grade, T stage, concurrent pulmonary metastases, primary lesions not subjected to surgery, bilateral pulmonary metastases, combined extrapulmonary metastases, and chemotherapy [17]. Regrettably, no studies have combined the relevant variables to assess the prognosis of GCLM.

A nomogram is a simple, multivariate visualization tool in oncology for predicting and quantifying individual patient survival, to aid clinical decision-making and promote precision medicine [1821]. In addition, the web-based nomogram, also known as “predictive probability web page calculator,” is a web page based on Shiny. This nomogram is a product of the electronic era, and the user just has to select the appropriate variable and click “Predict” to draw the probability of occurrence of the corresponding characteristics of patients, which is convenient and more practical [22]. Consequently, we aimed to devise two web-based nomograms to predict the overall survival (OS) and cancer-specific survival (CSS) in patients with GCLM based on the Surveillance, Epidemiology, and End Results (SEER) database.

2. Materials and Methods

2.1. Data Source and Inclusion Criteria

In this study, our data were obtained by downloading the SEER∗Stat software version 8.3.6. The SEER database is a public database, exempt from medical ethics review, and does not require informed consent. Strict inclusion and exclusion criteria were also developed, and the nadir criteria are listed below. The inclusion criteria were as follows: (I) patients diagnosed with GCLM between 2010 and 2015; (II) demographic variables, including age, race and gender, marital status, and insurance status; and (III) available tumor characteristics, including histological grade, T stage, N stage, brain metastasis, bone metastasis, and liver metastasis. The exclusion criterion was incomplete information. Next, we randomized the patients into training (70%) and validation cohorts (30%). In this study, patients in the training and validation cohorts were used to develop and validate the nomograms, respectively.

2.2. Clinicopathological Factors

Clinicopathological factors for the following variables were extracted: age (<60 and ≥60 years), race (white, black, and other), sex (female and male), histologic type (adenocarcinoma, signet ring cel1, intestinal type, other), T stage (T1, T2, T3, and T4), N stage (NO, N1, and N3), grade (grade I, grade II, grade III, and grade IV), bone metastasis (yes or no), liver metastasis (yes or no), brain metastasis (yes or no), primary site (cardia, fundus, body, gastric antrum, lesser, greater, other), radiotherapy (yes or no), chemotherapy (yes or no), surgery (yes or no), marital status (yes or no), and insurance (yes or no). OS and CSS were considered endpoint times. OS and CSS were, respectively, defined as the time from diagnosis to death from all causes and the time from cancer diagnosis to death.

2.3. Statistical Analysis

All statistical analyses were performed using the R software (version 4.0.2). value <0.05 (both sides) was considered statistically significant. We obtained relevant prognostic factors through univariate Cox regression analysis and obtained independent prognostic factors through multivariate Cox regression analysis on the basis of univariate Cox regression analysis. The prognostic nomograms for OS and CSS were created separately using the “rms” package, according to the independent prognostic factors. In addition, ROC curves for the prognostic nomograms were established. The area under the curve (AUC) was used to evaluate the discriminative power of the nomograms. In addition, calibration curves and decision curve analysis (DCA) for nomograms were established. Finally, we divided all patients into high- and low-risk groups according to the median risk score and tested the prognostic value of the nomograms using Kaplan-Meier (K-M) analysis.

3. Results

3.1. Flowchart

A detailed workflow is shown in Figure 1.

3.2. Characteristics of the Study Population

For the OS analysis, a total of 640 patients were included, 448 patients in the training cohort and the remaining 192 patients in the validation cohort. Among the 640 patients, the number of male patients (69.69%) was higher than that of the female patients (30.31%). A total of 484 patients (75.63%) were white, 75 patients (11.72%) were black, and 81 patients (12.65%) were classified as “other.” Of these patients, 219 were below 60 years of age and 421 were 60 years old or older. The baseline clinicopathological characteristics of patients in the OS group are shown in Table 1.

A total of 524 patients for the CSS analysis were enrolled; 368 patients were included in the training cohort, and the remaining 156 patients were included in the validation cohort. Of the 524 patients, 69.08% were male and 30.92% were female patients. Most of the patients (70.05%) were classified as white. Finally, 197 patients were below 60 years of age, and 327 patients were 60 years old or older. The baseline clinical pathological characteristics of patients in the CSS group are shown in Table 2.

3.3. Prognostic Factors for Patients with GCLM

For grouping status of OS, the detailed information of patients with GCLM in the OS group is shown in Table 3. Univariate Cox regression analysis demonstrated that grade II, liver metastasis, radiotherapy, and chemotherapy were OS-related prognostic factors. Multivariate Cox regression analysis showed that grade I1l ( value = 0.018, hazard ratios (HR) = 1.896, 95% confidence interval (CI) = 1.118–3.214), liver metastasis ( value <0.001, HR = 1.440, 95% CI = 1.179–1.760), and chemotherapy ( value <0.001, HR = 0.292, 95% CI = 0.235–0.363) were independent prognostic factors in patients with GCLM.

For grouping status of CSS, more details of the patients with GCLM in the CSS group are listed in Table 4. Univariate Cox regression analysis revealed that race, T2, liver metastasis, primary site, chemotherapy, and marital status were CSS-related prognostic factors. Multivariate COX regression analysis revealed that liver metastasis ( value <0.001, HR = 1.524, 95% CI = 1.217–1.909), primary site (greater, value = 0.001, HR = 2.315, 95% CI = 1.395–3.814), chemotherapy ( value <0.001, HR = 0.398, 95% CI = 0.317–0.501), and marital status ( value = 0.039, HR = 0.778, 95% CI = 0.629–0.988) were independent prognostic factors for GCLM.

3.4. Establishment of Nomogram

Prognostic nomograms of OS were established according to three independent prognostic factors (Figure 2(a)). Prognostic nomograms of CSS were created according to four independent prognostic factors (Figure 2(b)).

3.5. Verification of Nomogram

(i)ROC of OS: The AUCs at 3, 6, and 12 months were 0.753, 0.799, and 0.732, respectively, in the training cohort (Figures 3(a)3(c)). In the validation cohort, the AUCs at 3, 6, and 12 months were 0.855, 0.755, and 0.686, respectively (Figures 3(d)3(f)). The time-dependent ROC curves revealed that the AUC value fluctuated at approximately 0.8 from one month to 12 months (Figures 3(g) and 3(h)).(ii)ROC of CSS: The AUCs at 3, 6, and 12 months were, respectively, 0.820, 0.766, and 0.760, respectively, in the training cohort (Figures 4(a)4(c)). The AUCs at 3, 6, and 12 months were separately 0.894, 0.764, and 0.720, respectively, in the validation cohort (Figures 4(d)4(f)). The time-dependent ROC curves also demonstrated that the AUC value fluctuated at approximately 0.8 from one month to 12 months (Figures 4(g) and 4(h)).(iii)Calibration curves: The calibration curves at 3, 6, and 12 months for the OS probabilities were in good correspondence with the OS predicted with the nomograms to the actual results (Figures 5(a)5(f)). The calibration curves for the CSS probabilities at 3, 6, and 12 months also suggested the same better consistency among the CSS forecasted with the nomogram and the actual results (Figures 6(a)6(f)).(iv)DCA curves: DCA curves confirmed that nomograms can better predict OS (Figures 7(a)7(f)) and CSS (Figures 8(a)8(f)) in patients with GCLM. In addition, K-M survival curves revealed that, for OS (Figures 9(a) and 9(b)) and CSS (Figures 9(c) and 9(d)), patients from the higher risk group had a more unfavorable prognosis than those from the lower risk group.

3.6. Establishment of Two Web-Based Nomograms

Based on the above results, we constructed a probabilistic calculator OS (https://zhenghh.shinyapps.io/aclmos/) and CSS (https://zhenghh.shinyapps.io/aslmcss/) based on a dynamic network, which predicts the OS and CSS of patients with GCLM based on previous nomograms (Figure 10(a)). For example, the CSS of a patient with GCLM, who is a married woman with liver metastases, occurs in the gastric body and without chemotherapy. The survival curve of this patient is shown in Figure 10(b). Survival rates and 95% confidence intervals at three months (Figure 10(c), black line), six months (Figure 10(c), blue line), and 12 months (Figure 10(c), red line) can also be observed at the operation interface. In addition, specific numbers are summarized to improve the prediction accuracy (Figure 10(d)). The OS of patients with GCLM can be predicted in the same way.

4. Discussion

GC is a malignant tumor of the gastrointestinal tract with a low early diagnosis rate, low surgical resection rate, and high mortality rate [23]. The majority of patients with GC are in the advanced stage at the time of consultation, and 32.6% have distant metastases [24]. Interestingly, the incidence of LM is 14.9% [24]. LM typically indicates advanced tumors, and when not detected and treated in time, the prognosis is extremely poor. In our study, we created two nomograms to predict the prognosis of patients with GCLM. These two nomograms performed well in predicting OS and CSS in patients with GCLM, allowing more precise individualized clinical decision-making and surveillance. Finally, we built two web-based nomograms based on the nomograms. This prediction model can facilitate the prediction of the survival probability of patients with GCLM at a specific time. Clinicians can also arrange personalized treatment plans based on the prediction results.

As we know, survival statistics of GCLM are not optimistic. Therefore, clinicians can identify the risk and protective factors of GCLM, which can result in a good prognosis for patients with GCLM. A number of potential biomarkers that are involved in cadherin-catenin interaction, integrin signaling, and cancer stem cell identification in gastrointestinal cancers have been observed [25]. However, these biomarkers are difficult to measure, have low sensitivity, are expensive, and have few clinical applications. Therefore, it is necessary to actively identify other clinical features related to prognosis in patients with advanced GCLM. In 2019, Wenjie et al. [26] found that age, race, primary site, T stage, and N stage are independently related to CSS in patients with lymph node-positive GC. Studies have shown that the fat content in high muscle tissue is associated with CSS in patients with locally advanced GC [27]. However, so far, few studies have focused on GCLM, and no corresponding nomogram has been established to assess the survival and prognosis of these patients. Previous studies have confirmed that the prognostic factors of liver cancer are quite different from those of liver cancer with bone metastasis [2831]. Therefore, it is not possible to evaluate the survival of patients with GCLM solely through the prognostic factors of GC, due to possible biases and errors. In this study, we screened the relevant independent prognostic factors of patients with GCLM. More meaningfully, this study integrates these multiple prognostic factors and visual graphs to predict the survival of patients with GCLM through nomograms, which is a practical tool widely used in oncology [32]. The web-based nomograms were based on further upgraded results.

We found that liver metastasis is an independent risk factor for OS and CSS in patients with GCLM. There are two possible reasons for this. First, the liver contains a rich blood supply, tumor metastasis is rapid, patients are already advanced when symptoms appear, and most of them miss the time of surgery. Second, for patients with GCLM and hepatocellular carcinoma (HCC), the prognosis is worse because the patients are lethargic and weak, and their immunity is reduced, typically when they develop complications associated with advanced HCC (such as jaundice, ascites, peritonitis, and hepatic encephalopathy). Chemotherapy was found to be an independent protective factor for OS and CSS. This result confirmed the importance and necessity of chemotherapy in patients with GCLM. The National Comprehensive Cancer Network guidelines clearly state that chemotherapy is recommended for the treatment of patients with unresectable or metastatic GC [33]. A study reported median OS times of 8.6 and 7.9 months for patients with advanced GC treated with cisplatin combined with S-1 (CS) versus cisplatin combined with 5-FU (CF) regimens, respectively () [34]. Standardized chemotherapy not only relieves the patients’ clinical symptoms but also prolongs the survival time. Hence, it is worthwhile to focus on the possibility of liver metastasis in patients with GCLM. To obtain an excellent prognosis, doctors could prefer chemotherapy for the clinical treatment of patients with GCLM. In addition, we incorporated marital status into our study. The results of this study showed that married patients with GCLM had better clinical prognosis than those who were unmarried. It has been shown that marriage plays a humanistic role during the treatment of oncology patients and that care plays a crucial role in influencing tumor progression [35].

However, there are some limitations to our study. First, although we have set strictly incorporated exclusion standards, the deletion of patients is missing and may cause statistical bias. Second, there is no detailed treatment information in the SEER database, such as specific chemotherapy modalities and surgical procedures. Third, the SEER database has limited coverage, and some important factors such as smoking, alcohol consumption, family history of tumor, and other factors that may affect patient prognosis were not assessed.

5. Conclusions

In conclusion, this study revealed that grade, liver metastasis, and chemotherapy were independent prognostic factors for OS, where the risk factors were grade and liver metastasis, and chemotherapy was a protective factor. Liver metastasis, primary site, chemotherapy, and marital status were independent prognostic factors for CSS, where liver metastasis and primary site were risk factors, and chemotherapy and marital status were protective factors. We created two easy-to-use visual web-based nomograms with several clinical and pathological factors to quantitatively predict OS and CSS in patients with GCLM. Moreover, our model may help physicians develop individualized postoperative follow-up strategies.

Abbreviations

AUC:Area under the curve
CSS:Cancer-specific survival
DCA:Decision curve analysis
GC:Gastric cancer
GCLM:Gastric cancer and lung metastasis
LM:Lung metastasis
OS:Overall survival
ROC:Receiver operating characteristic
SEER:Surveillance, Epidemiology, and End Results.

Data Availability

The dataset from the SEER database which was generated and/or analyzed during the current study is available in the SEER dataset repository (https://seer.cancer.gov/).

Ethical Approval

The study’s approval was waived by the local ethics committee, as the SEER data were publicly available and deidentified.

Conflicts of Interest

The authors declare that there are no conflicts of interest with respect to the research, authorship, and/or publication of this article.

Authors’ Contributions

Honghong Zheng and Zhehong Li contributed equally to this work. HH Z and ZH L conceived and designed the study. Honghong Zheng and Jianjun Li performed the literature search. Zhehong Li and Shuai Zheng generated the figures and tables. Jianjun Li and Shuai Zheng analyzed the data. Honghong Zheng and Zhehong Li wrote the manuscript, and Enhong Zhao critically reviewed the manuscript. Enhong Zhao supervised the study. All experiments and methods met the standards of relevant guidelines and regulations. All authors have read and approved the manuscript.

Acknowledgments

This study is funded by the 2020 Hebei Provincial Finance Department Grant for Outstanding Talents in Clinical Medicine (no. 2020397).