Abstract

Background. The dismal prognosis of gastric signet ring cell carcinoma (GSRC) is a global problem. The current study is conducted to comprehensively evaluate clinicopathological features and survival outcomes in GSRC patients stratified by anatomic subsites. Then, predictive nomograms are constructed and validated to improve the effectiveness of personalized management. Method. The patients diagnosed with GSRC were recruited from the online SEER database. The influence of anatomic subsites on overall survival (OS) and cancer-specific survival (CSS) was evaluated using multivariate Cox regression and Kaplan-Meier analysis. Then, we employed propensity score matching (PSM) technique to decrease selection bias and balance patients’ epidemiological factors. Predictive nomograms were constructed and validated. Sensitivity analysis was performed to validate the conclusion. Results. Multivariate Cox regression demonstrated that the patients with overlapping gastric cancer (OGC) suffered the highest mortality risk for OS (HR, 1.29; 95% CI, 1.23-1.36; ) and CSS (HR, 1.33; 95% CI, 1.28-1.37; ). Age, TNM stage, tumor localization, tumor size, surgery, and chemotherapy presented a highly significant relationship with OS and CSS. Following subgroup and PSM analysis, OGC patients were confirmed to have the worst OS and CSS. Then, nomograms predicting 6-month, 12-month, and 36-month survival were constructed. The area under the curve (AUC) value in ROC was 0.775 (95% CI, 0.761-0.793) for 6-month survival, 0.789 (95% CI, 0.776-0.801) for 12-month survival, and 0.780 (95% CI, 0.765-0.793) for 36-month survival in the OS group, while in the CSS group, it was 0.771 (95% CI, 0.758-0.790) for 6-month survival, 0.781 (95% CI, 0.770-0.799) for 12-month survival, and 0.773 (95% CI, 0.762-0.790) for 36-month survival. Conclusion. We identified anatomic subsites as a predictor of survival in those with GSRC. Patients with OGC suffered the highest mortality risk. The proposed nomograms allowed a relatively accurate survival prediction for GSRC patients.

1. Introduction

Gastric cancer (GC) is the fifth most frequently diagnosed malignancy and the third greatest cause of cancer-associated death worldwide [1]. Adenocarcinoma occupies the majority of GC [2]. Gastric signet ring cell carcinoma (GSRC) is a rare subtype of gastric adenocarcinoma, which is related to aggressive malignancy behavior and poor prognosis [3]. It is reported that the occurrence rate of GSRC has gradually risen in the past three decades in the United States [4].

Anatomically, the stomach is classified into two prime subsites: the proximal section, which is composed of the cardia and fundus, and the distal section, which includes the body, antrum, and pylorus. Some studies also include overlapping section, which denote that the tumor develops across two or more anatomic subsites [5]. Recently, it is demonstrated that cardia, noncardia, and overlapping GC have divergent biological features and predisposing factors, which should be considered separately to investigate GC behavior [6]. And subsite-specific analysis can promote targeting prevention and therapy. However, no risk stratification by anatomic subsites has been made in the patients diagnosed with GSRC before.

Herein, a population-based research was conducted to investigate the clinicopathological features and survival outcomes in GSRC incidence, stratified by anatomic subsites. And one-to-one propensity score matching (PSM) was made to examine the effect of tumor localization on GSRC prognosis. Besides, nomogram models to predict personal prognosis were constructed and validated based on multi-institution and multipopulation data from the Surveillance, Epidemiology, and End Result (SEER) database.

2. Materials and Methods

2.1. Patient Selection

Patients were extracted from the SEER 18 regions’ database (Incidence-SEER 18 Regs Research Data (with additional treatment fields), Nov 2017 Sub (1975-2016 varying)) using SEERSat software (Version 8.3.5) [7]. We designed the following inclusion criteria: (1) years at diagnosis; (2) histology ICD-O-3 (International Classification of Diseases for Oncology, 3rd edition) confined only to signet ring cell carcinoma (8490); and (3) patients with complete demographic, clinicopathological, treatment, and follow-up information. The detailed patient selection workflow is shown in Figure 1.

2.2. Clinicopathological Variables

Clinical features including tumor localization, age, race, marital status, gender, median household income, TNM grade, insurance status, tumor grade, T stage, N stage, M stage, tumor size, regional nodes examined, distal organic metastasis, treatment methods, and prognostic information were extracted for each patient. Based on the ICD-O-3 codes, anatomic subsites were characterized as follows: cardia (C16.0), fundus (C16.1), body (C16.2), antrum (C16.3), pylorus (C16.4), lesser curvature (C16.5), greater curvature (C16.6), overlapping (C16.8), and unspecified (C16.9), which was consistent with prior study [5]. Overlapping gastric cancer denoted that the tumor developed across two or more anatomic subsites. To avoid the inaccurate definition, only the vertical position classification (C16.0 to C16.4, C16.8) was included. As a result, the patients with tumor in cardia and fundus were divided into the proximal gastric cancer (PGC) group, while the tumors locating in corpus, antrum, and pylorus were included in the distal gastric cancer (DGC) group, and overlapping lesion of the stomach was in the overlapping gastric cancer (OGC) group. Age was categorized as 18-49 years, 50-59 years, 60-69 years, 70-79 years, and ≥80 years. Race was divided into black, white, American Indian/Alaska Native (AI), and Asian or Pacific islander (API). Median house income was categorized as quartile 1, quartile 2, quartile 3, and quartile 4 from bottom to top. TNM staging system was based on the 7th edition of the American Joint Committee on Cancer (AJCC). Radiation therapy and chemotherapy were classified into “yes” and “no/unknown.” The study was exempted by institutional review boards due to the lack of subject identifiers and interventions.

2.3. Statistical Analysis

The categorical variables were tested using the chi-square test. The primary endpoints were overall survival (OS) and cancer-specific survival (CSS). Kaplan-Meier (KM) curves and log-rank test were used to estimate survival distribution. Cox proportional hazard models were applied to perform univariate and multivariate analyses. The proportional hazard assumption was assessed using Schoenfeld residuals and was met for all models (Figures S1 and S2).

PSM was a novel statistical method which could minimize the heterogeneity and mimic randomized controlled trials [8]. It was performed to reevaluate the impact of anatomic subsites using one-to-one nearest-neighbor matching and a caliper width of 0.01. Standardized difference (SD) was employed to examine the changes in covariate before and after PSM. denoted significant balances in the baseline variables [9].

The predictive ability of nomograms was assessed by calibration curves and concordance index (-index) [10, 11]. In the calibration plot, 1000 bootstrap resamples were conducted to investigate the consistence of the predicted and observed probabilities of survival. Besides, the receiver operating characteristic (ROC) curves were plotted to show the prediction power of the constructed model, and the area under the curve (AUC) value was listed. Higher AUC presented a stronger prediction power. Then, a sensitivity analysis was performed to validate the conclusion.

The statistical analyses were based on R software, version 4.0.3 (https://www.r-project.org) using packages of tableone, rms, survival, survminer, ggplot2, cobalt, pROC, and Matchit. A two-tailed was indicated statistically significant.

3. Results

3.1. Clinicopathological Characteristics

A total of 2039 patients with GSRC from 2010 to 2015 were recruited in the SEER database. The clinicopathological features in each group are presented in Table 1.

The patients in the PGC group tended to be in the older age groups of 60-69 (31.1%) and 70-79 (20.5%). The PGC group had more white patients (82%), while there were more black (14.3%) and API (21.4%) patients in the DGC group. Male and female proportion was the highest in the PGC group (71.2%) and the DGC group (53.1%), respectively. Socioeconomic status was divided into quartile 1 (<$51030), quartile 2 ($51031-$61237), quartile 3 ($61238-$74330), and quartile 4 (>$74331). Compared to the OGC and PGC groups, the patients in the DGC group tended to have earlier stage (26.5%), T stage (27.4%), N stage (45.4%), M stage (76.2%) and smaller tumor size (17.3%). The OGC group presented the highest bone metastasis proportion (6.0%), but more liver metastasis (6%) and lung metastasis (4.5%) were found in the PGC group. The proportion of patients that underwent surgery presented the largest (67.1%) in the DGC group. And PGC patients had more intentions to receive radiation (49.3%) and chemotherapy (78.8%).

3.2. Effects of Tumor Localization on OS and CSS

The OS and CSS of GSRC patients were evaluated by KM analysis. Significant differences in OS and CSS were found based on tumor localization () (Figure 2).

Univariate analysis demonstrated that tumor localization, age, race, marital status, median household income, TNM stage, tumor size, regional node examined, bone metastasis, liver metastasis, lung metastasis, surgery, radiation, and chemotherapy were significantly associated with OS (Table 2) and CSS (Table 3) (all ).

The outcomes of multivariate Cox regression analysis demonstrated that the patients with DGC suffered relatively low risk for OS (OGC: HR, 1.29; 95% CI, 1.23-1.36; ; PGC: HR, 1.15; 95% CI, 1.09-1.22; ) (Table 2) and CSS (OGC: HR, 1.33; 95% CI, 1.28-1.37; ; PGC: HR, 1.18; 95% CI, 1.10-1.23; ) (Table 3). The result also presented that patients with OGC suffered the highest mortality risk. Age, TNM stage, tumor size, surgery, and chemotherapy presented high levels of correlation with OS and CSS.

To decrease the impact of confounding factors, all GSRC patients were stratified based on clinical characteristics. It was identified that tumor localization was an independent prognostic factor of OS (Figure 3) and CSS (Figure 4) in the subgroups stratified by gender, surgery, radiation, chemotherapy, and T stage (all ).

3.3. Survival Analysis after 1 : 1 PSM

A one-to-one PSM was conducted to minimize the influence of potential confounders. Two matched groups were produced: a PGC and OGC cohort and a DGC and OGC cohort. The clinical baselines between both cohorts were balanced (Table 4). SD in most variables were less than 0.1, which indicated good balancing performance (Figure 5). OGC patients presented worse OS and CSS in the PGC-OGC cohort and the DGC-OGC cohort after PSM (Figure 6).

3.4. Construction and Validation of the Nomogram

These six significant independent variables were applied to construct the prognostic nomograms to predict the 6-month, 12-month, and 36-month OS and CSS of GSRC patients: age, TNM stage, tumor size, tumor localization, surgery, and chemotherapy (Figures 7(a) and 8(a)).

The calibration curves for 6-month, 12-month, and 36-month OS and CSS showed good consistence between the predicted and observed probabilities of survival (Figures 7(b) and 8(b)). To measure the accuracy of the nomograms, the -index of 0.751 (95% CI, 0.733-0.764) for OS and 0.764 (95% CI, 0.742-0.789) for CSS was concluded. Furthermore, the ROC curves regarding the predictive ability of 6-month, 12-month, and 36-month survival were constructed (Figures 7(c) and 8(c)). And the resulting AUC values were calculated. In the OS group, it was 0.775 (95% CI, 0.761-0.793) for 6-month survival, 0.789 (95% CI, 0.776-0.801) for 12-month survival, and 0.780 (95% CI, 0.765-0.793) for 36-month survival, respectively, while in the CSS group, it was 0.771 (95% CI, 0.758-0.790) for 6-month survival, 0.781 (95% CI, 0.770-0.799) for 12-month survival, and 0.773 (95% CI, 0.762-0.790) for 36-month survival, respectively.

3.5. Sensitivity Analysis

Considering that the tumor size was a high-risk factor and had a high proportion of unknown values, a sensitivity analysis was carried out to validate the conclusion. A total of 1354 patients were included with specified tumor size, and then, univariate and multivariate Cox proportional hazard was carried out, presenting that the patients with OGC suffered the highest risk for OS (HR, 1.27; 95% CI, 1.14-1.42; ) (Table S1) and CSS (HR, 1.25; 95% CI, 1.11-1.38; ) (Table S2). Moreover, in the KM analysis, patients with OGC suffered the worst survival in OS () and CSS () (Figure S3). Nomograms based on the six significant independent variables were constructed, and the relative calibration curves and ROC curves showed good consistence and predictive ability (Figure S4). These results confirm the conclusions above.

4. Discussion

GSRC is a highly malignant type of GC, with a reported 5-year survival rate of only 15.9% [12]. And it was identified that tumor location was correlated with GC behavior and patients’ survival. Nevertheless, survival analysis of GSRC based on the tumor location continues to be scarce. Hence, it is urgent to make an in-depth study on the role of tumor location and establish a predictive model to guide better clinical practice. This was the first research to investigate the effect of tumor location on GSRC prognosis using PSM in the SEER database. The results confirmed the concept that PGC, DCG, and OGC were different malignant entities, which should be considered separately to improve GSRC incidence and verify driving risk factors.

Gender was an important factor influencing the occurrence of GC. In the research, the total ratio of males to females was 1.2 : 1, with a higher ratio (2.47 : 1) in the PGC patients, which might attribute to the unhealthy diet and habits in men, such as smoking or alcohol abusing [13]. In addition, this research showed that the PGC group presented to be more frequent in aging population, which was similar to previous Chinese reports [14, 15]. However, no correlation was found between age and tumor site in two European studies. The distinction might be partly due to the discrepancy of ethnic lines [16, 17]. In addition, there was a relatively higher frequency of AJCC stage IV (38.2%), N3 stage (30.5%), M1 stage (38.2%) patients in the OGC group, which suggested a more aggressive malignant behavior of OGC.

In multivariate Cox regression analysis, age, TNM stage, tumor size, tumor localization, surgery, and chemotherapy were identified as prognostic factors. The patients with OGC suffered the highest risk for OS (HR, 1.29; 95% CI, 1.23-1.36; ) and CSS (HR, 1.33; 95% CI, 1.28-1.37; ). Besides, it was well established that older age had lower survival time because of more comorbidities than the younger patients [18]. Furthermore, it was found that GSRC patients who received chemotherapy suffered lower risk, which was consistent with previous research [19]. Our results further supported former findings of larger tumor size as an independent prognostic role negatively correlated with GSRC patient survival. It was reported that larger tumor might present higher probability of invasive growth and lymph node metastasis [20, 21]. In most malignancy, histological grade was one of the indicators which determine prognosis. However, in this research, no significant correlation was found. Since approximately 97% of the GSRC patients were in the grade III/grade IV histological classification, grade failed to be a risk factor for determining patients’ prognosis.

The prognosis in PGC and DGC still remained controversial. Majority of reports had demonstrated a significant poorer survival in PGC patients compared with DGC patients [14, 2224], while no significant difference was found in other research [16, 25]. Katsuhiko et al. even reported a longer OS in patients with PGC [26]. The distinction might be associated with different staging and histology in different research [23, 24]. Furthermore, esophageal cancer was included into PGC in several studies, which led to confounding differences. However, few studies concerned the role of OGC. In our study, the subtypes of GC were confined to GSRC, and the anatomic subsites were clearly defined as DGC, OGC, and PGC. So the results were more convincing.

Before PSM, the results presented the best survival of DGC and the worst survival of OGC in OS and CSS. To minimize the influence of potential confounders, PSM was performed to balance the clinical baselines between both cohorts. We further confirmed that those who were diagnosed with OGC suffered worse OS and CSS in the PGC-OGC cohort and the DGC-OGC cohort. Thomassen et al. founded that between 1995 and 2011 in the Netherlands, primary cancer of overlapping location was associated with higher odds for gastric cancer peritoneal carcinomatosis and worse survival [27], which indicated that OGC presented more invasive features than DGC and PGC in part of GC patients. The overlapping tumor developed across two or more anatomic subsites, presenting more aggressive malignancy behavior than the one-site-confined tumor. The Cancer Genome Atlas (TCGA) Research Network recently identified four subtypes with different molecular profiles to classify GC: Epstein-Barr virus-positive, microsatellite unstable, genomically stable, and tumors with chromosomal instability [28]. The correlation between four molecular GC subtypes and different anatomical sites was observed, which presented that the majority of tumor arising at the proximal section were associated with chromosomal instability [29]. So the molecular profile of OGC should raise concern to explain the mechanism of invasiveness.

Nomograms had been regarded as efficient tools in clinical practice, which couldpredict numerical probabilities for individual patients by incorporating critical prognostic factors [30]. Many nomograms had demonstrated superiority over the traditional TNM staging system in predicting survival in multiple malignancies [3134]. Several studies had constructed prognostic nomogram of patients with GSRC; however, these researches had either limited population selection or absence of accurate tumor location. Wei et al. only included locally advanced (stage II and stage III) GSRC and constructed CSS prognostic nomogram, finding that patients who received postoperative radiotherapy had a better prognosis than surgery alone [35]. Guo et al. enrolled GSRC patients from 2004 to 2015 and constructed nomogram, but tumor location was not included in the clinical variables [36]. Wang et al. enrolled nonelderly GSRC patients, and primary site was considered in the study. But the classification was ambiguous; even nonvertical position (lesser curvature) and body position were classified together. So the primary site did not present as an independent risk factor [37]. This was the first study to include all GSRC patients from 2010 to 2015 and clearly define anatomic subsites as DGC, OGC, and PGC. On the basis of the multivariate analysis results, age, TNM stage, tumor size, tumor localization, surgery, and chemotherapy were integrated into this predictive model. The calibration curves showed good consistence between the predicted and observed probability of survival. And the AUC values ranged from 0.773 to 0.789 in ROC curves, which showed high accuracy in predicting 6-month, 12-month, and 36-month OS and CSS. So the nomogram could present great prognostic efficiency among GSRC patients with different tumor locations.

Furthermore, a sensitivity analysis was performed to validate the results. A total of 1354 patients were included with specified tumor size. The results showed that the OGC patients still suffered the highest risk for OS (HR, 1.27; 95% CI, 1.14-1.42; ) and CSS (HR, 1.25; 95% CI, 1.11-1.38; ). And the constructed nomogram remained good consistence and predictive ability. The comprehensive statistical analyses adjusted for residual confounders, thus making the conclusion more reliable.

Limitation of the study includes the lack of detailed information of radiotherapy and chemotherapy, such as the dose of radiotherapy and the chemotherapy regimen. Also, this is a retrospective analysis, so selection bias is inevitably brought in. Further prospective research is necessary to confirm the conclusion. Despite these limitations, our substantial cases to investigate incidence by tumor location could provide novel insights on the epidemiology of GSRC.

5. Conclusion

In conclusion, we firstly identified anatomic subsites as a predictor of survival in those with GSRC. Patients with OGC suffered the highest mortality risk. The constructed nomograms presented a relatively good performance and could be considered a practical tool to predict personal prognosis in GSRC patients. Further studies should be conducted separately to learn more about the etiologies of GSRC based on the different anatomic subsites.

Abbreviations

GC:Gastric cancer
GSRC:Gastric signet ring cell carcinoma
OS:Overall survival
CSS:Cancer-specific survival
PSM:Propensity score matching
SEER:Surveillance, Epidemiology, and End Result
PGC:Proximal gastric cancer
DGC:Distal gastric cancer
OGC:Overlapping gastric cancer
AI:American Indian/Alaska Native
API:Asian or Pacific islander
TNM:Tumor-node-metastasis
AJCC:American Joint Committee on Cancer
KM:Kaplan-Meier
HR:Hazard ratio
ROC:Receiver operating characteristic curves
AUC:Area under the curve.

Data Availability

The datasets generated and analyzed during the current study are available in the SEER database (https://seer.cancer.gov/) and from the corresponding authors upon reasonable request.

Ethical Approval

The SEER database is publicly available and provides deidentified case data. So the study is exempted of ethics approval.

Informed consent is not applicable due to the lack of subject identifiers and interventions.

Disclosure

This article has been presented as preprint in Research Square according to the following link: https://www.researchsquare.com/article/rs-403278/v1.

Conflicts of Interest

The authors have no conflict of interests with any organizations or individuals.

Authors’ Contributions

(I) Yangyang Xie and Hai Huang were responsible for the study design. (II) Yangyang Xie, Xue Song, and Zhongkai Ni were responsible for data collection. (III) Yangyang Xie, Xue Song, Haimin Jin, and Xiaowen Li were responsible for statistical analysis. (IV) Yangyang Xie, Xue Song, and Haimin Jin were responsible for data interpretation. (V) Yangyang Xie, Xiaowen Li, and Wenge Dong were responsible for literature search. (VI) All authors were responsible for manuscript writing. (VII) All authors gave final approval of the manuscript.

Acknowledgments

The authors gratefully thank the efforts of the SEER Program. This study was supported by the Science and Technology Program of Traditional Chinese Medicine in Zhejiang Province (2021ZB208) and Zhejiang Medical and Health Science and Technology Plan Project (2021KY927).

Supplementary Materials

Figure S1: Schoenfeld residuals’ plot for overall survival. Figure S2: Schoenfeld residuals’ plot for cancer-specific survival. Figure S3: overall survival (A) and cancer-specific survival (B) curves of selected patients based on tumor localization. Figure S4: validation of the nomogram composed of prognostic risk model. The calibration curve of the nomogram between the predicted probabilities of survival and the 45-degree line for overall survival (A) and cancer-specific survival (B). The area under the ROC curve was used to show the discrimination of the nomogram for overall survival (C) and cancer-specific survival (D). Table S1: impact of tumor localization on the OS by univariate and multivariate survival analysis in selected patients. Table S2: impact of tumor localization on the CSS by univariate and multivariate survival analysis in selected patients. (Supplementary Materials)