Introduction

Since the first gastric cancer surgery by Billroth in the nineteenth century, gastric cancer treatment has evolved enormously and established itself in the current gastric cancer treatment guidelines1,2,3. The evidence-based gastric cancer treatment guidelines cited results from several pivotal randomized controlled studies, such as Dutch, MAGIC, ACTS-GC, CLASSIC, REGATTA, KLASS, CLASS, and so on4,5,6,7,8,9,10,11,12. Prospective, randomized controlled trials (RCT), such as these, are the most reliable way of comparing treatments and are considered the gold standard for studying causal relationships13.

In clinical practice, it is difficult for patients and surgeons to participate in prospective clinical trials, including multicenter surgical RCTs. Patients worry about treatment-related postoperative complications when receiving a new rather than existing treatment. Patients sometimes feel instinctive anxiety about the word “clinical trial.” Surgeons also feel pressured by new, unfamiliar surgical techniques, particularly when performing additional procedures for research purposes. In multicenter surgical trials, surgeons are concerned that their surgical results may be compared with those of other surgeons.

We hypothesized that participation in a prospective surgical clinical trial would not affect surgical outcomes. We aimed to evaluate the effects of participation in prospective surgical clinical trials on surgical outcomes after radical gastrectomy for gastric cancer.

Methods

Patients

We retrospectively reviewed a prospectively collected data from the gastric cancer database from January 2016 to December 2020 consisting of 1,915 patients who underwent gastrectomy. Patients who met one of the following criteria were excluded from the study: non-curative resection, completion of total gastrectomy, history of preoperative chemotherapy, emergency operation, or stage IV disease. Due to the retrospective nature of the study, this study was approved by the Institutional Review Board of Ajou University Hospital (AJOUIRB-DB-2023–027), and inform consent was waived.

Surgery

All surgical procedures were performed according to the Korean and Japanese Gastric Cancer treatment guidelines2,14. The extent of gastric resection was determined based on tumor location. Among the patients enrolled in the SENORITA trial, which compared sentinel node navigation surgery with standard gastrectomy, those assigned to the test group underwent wedge resection of the stomach15. D2 lymph node dissection was performed if advanced gastric cancer or lymph node metastasis was suspected. To restore gastrointestinal continuity after surgery, we used different reconstruction methods depending on the type of gastrectomy. We employed gastroduodenostomy, gastrojejunostomy, or Roux-en-Y gastrojejunostomy for distal subtotal gastrectomy. Proximal gastrectomy was performed via double-tract reconstruction, while total gastrectomy was performed via Roux-en-Y esophagojejunostomy. Pylorus-preserving gastrectomy, on the other hand, was performed via gastrogastrostomy. The tumor stage was defined according to the 8th edition of the American Joint Committee on Cancer staging system16. Postoperative complications were defined as any event requiring pharmacological, endoscopic, radiological, or surgical intervention until postoperative day 30. The severity of postoperative complications was graded using the Clavien–Dindo classification system17. Major complications were defined as Clavien–Dindo grade III or higher.

Surgical clinical trials

Our study included patients from several prospective surgical clinical trials, collectively called the SCT group. These trials included the VIGTORY trial (n = 3), which compared D2 versus D2 plus 14v lymph node dissection in clinical T3N + and T4N + advanced gastric cancer; the AaRon trial (n = 28), which evaluated robotic distal gastrectomy for non-inferiority of N2 lymph node dissection; the ADDICT trial (n = 78), which compared D1 + versus D2 lymph node dissection in clinical stage IB or II advanced gastric cancer; the KLASS-04 trial (n = 17), which compared distal gastrectomy versus pylorus-preserving gastrectomy for middle third early gastric cancer; the KLASS-05 trial (n = 24), which compared total versus proximal gastrectomy for upper third early gastric cancer; the KLASS-06 trial (n = 34), which compared open versus laparoscopic total gastrectomy in advanced gastric cancer; the KLASS-07 trial (n = 39), which compared laparoscopy-assisted versus totally laparoscopic distal gastrectomy; the SDR-01 trial (n = 43), which evaluated the safety of sentinel lymph node-guided lymph node dissection using indocyanine green in near-infrared laparoscopy; the SDR-02 trial (n = 36), which compared quality of life and endoscopic findings according to the reconstruction methods after laparoscopic distal gastrectomy for gastric cancer; and the SENORITA trial (n = 7), which compared sentinel node biopsy with stomach-preserving surgery versus gastrectomy in early gastric cancer.

Outcomes

The primary outcome was the development of postoperative complications according to participation in surgical clinical trials. Secondary outcomes were the development of major postoperative complications and differences in overall survival according to participation in surgical clinical trials. Overall survival was defined as the duration from surgery to death from any cause.

Statistical analysis

The propensity score weighting (PSW) technique using a generalized boosted method (GBM) was applied to reduce differences in baseline clinicopathological characteristics between groups according to participation in surgical clinical trials. GBM was used to estimate the propensity score, and the covariates for PSW were age, sex, body mass index, American Society of Anesthesiologists physical status (ASA) score, preoperative white blood cell count, preoperative hemoglobin count, preoperative albumin count, clinical T stage, clinical N stage, operation year, operator, operation method, extent of gastric resection, extent of lymph node dissection, combined resection, tumor size, and number of harvested lymph nodes. Standardized mean difference (SMD) was calculated to assess the balance of variables between the matched groups, with an SMD < 0.1 indicating good balance and an SMD < 0.2 indicating an adequate balance between the groups18. Categorical variables are shown as numbers (percentages), and continuous variables are summarized as mean ± standard deviation. The chi-square and Student’s t-test were used to analyze the relationship between the groups according to participation in surgical clinical trials. The Kaplan–Meier method was used to plot the time-to-event data, and the log-rank test was used to compare group differences in time-to-event data. Statistical analyses were conducted using R software (version 3.4.4; R Foundation for Statistical Computing, Vienna, Austria). P values < 0.05 were considered statistically significant.

Results

Patient characteristics

Of the 1915 patients who underwent gastrectomy for gastric cancer, 226 were excluded from the analysis because of non-curative resection (n = 157), completion of total gastrectomy (n = 22), preoperative chemotherapy (n = 10), emergency operation (n = 2), or stage IV disease (n = 35). A total of 1689 patients were included in the study (Fig. 1).

Figure 1
figure 1

Study design.

Of the 1689 patients, 309 (18.3%) were included in the SCT group, and the remaining 1380 (81.7%) were included in the non-SCT group. The clinicopathological characteristics of the patients are summarized in Table 1. Before PSW, the SCT group was significantly younger, had a lower ASA score, higher preoperative hemoglobin and albumin counts, and earlier clinical T stage than the non-SCT group (P < 0.001, P = 0.008, P = 0.021, P < 0.001, and P < 0.001, respectively). Regarding pathologic features, the SCT group had a relatively small tumor size compared to the non-SCT group (P = 0.034). After PSW, variables that were not statistically similar before PSW were well-balanced between the groups.

Table 1 Patient clinicopathological characteristics.

The surgical characteristics of the patients are presented in Table 2. Before PSW, there was a significant difference (P < 0.001) in the proportion of patients who had undergone recent gastrectomy between the SCT and non-SCT groups, with a higher number of patients included in the SCT group than in the non-SCT group. Two of the four surgeons actively participated in the surgical clinical trials (P < 0.001). There were statistically significant differences between the groups in the extent of gastrectomy and combined resection of other organs (P = 0.002 and P = 0.014, respectively). After PSW, all surgical variables were well balanced between the groups but showed small differences in the operator, the extent of gastrectomy, and combined resection of other organs.

Table 2 Patient surgical characteristics.

The perioperative outcomes of the patients are shown in Table 3. Overall, no significant differences were observed in terms of the operation time, intraoperative blood loss, complications (including details and major complications), or hospital stay between the groups (all P > 0.05). After PSW, the SCT group had a longer operation time and a higher incidence of postoperative pancreatitis. The other variables were not significantly different between the groups.

Table 3 Patient perioperative outcomes.

Survival outcomes

The median follow-up duration after gastrectomy was 48 months (range, 19–79 months) until the study cutoff date (October 2022). Of the 1689 unweighted patients, 175 (10.4%) died: 16 (9.1%) in the SCT group and 159 (90.9%) in the non-SCT group. The hazard ratio for death was 0.419 (95% confidence interval [CI], 0.247–0.712; P = 0.001) in the SCT group with reference to the non-SCT group. The 3-year overall survival rates of patients in the SCT and non-SCT groups were 95.1% and 89.6%, respectively (log-rank P = 0.003) (Fig. 2). However, the differences in 3-year overall survival were not significant when the patients were stratified according to pathologic stage (96.9% for the SCT group vs. 98.2% for the non-SCT group in stage I, P = 0.121; 87.2% for the SCT group vs. 97.7% for the non-SCT group in stage II, P = 0.123; 64.3% for the SCT group vs. 73.6% for the non-SCT group in stage III, P = 0.157).

Figure 2
figure 2

Overall survivals according to the participation in the surgical clinical trials before propensity score weighting. (a) All stages, (b) Stage I, (c) Stage II, (d) Stage III.

In the weighted populations, the hazard ratio for death in the SCT group with reference to the non-SCT group was 0.715 (95% confidence interval [CI], 0.417–1.227; P = 0.223). The 3-year overall survival rates for the SCT and non-SCT groups were 97.3% and 94.6%, respectively (P = 0.18) (Fig. 3). After stratification according to the pathologic stage, the 3-year overall survival rates of the SCT group were not significantly different from those of the non-SCT group (100% for the SCT group vs. 98.9% for the non-SCT group in stage I, P = 0.37; 100% for the SCT group vs. 95.2% for the non-SCT group in stage II, P = 0.294; 87.6% for the SCT group vs. 78.9% for the non-SCT group in stage III, P = 0.528).

Figure 3
figure 3

Overall survivals according to the participation in the surgical clinical trials after propensity score weighting. (a) All stages, (b) Stage I, (c) Stage II, (d) Stage III.

Discussion

Before being weighted with propensity scores, patients who participated in prospective surgical clinical trials had comparable postoperative morbidity and superior overall survival compared to those who did not. After weighting with propensity scores, surgical outcomes, including postoperative morbidity and overall survival, showed no differences between patients enrolled in surgical clinical trials and those who did not.

Even surgeons familiar with prospective surgical clinical trials sometimes feel nervous when performing surgeries on patients registered for clinical trials. In particular, when operating on patients requiring additional procedures for research purposes in addition to standard surgical procedures, surgeons may experience tension and be overwhelmed by unfamiliar procedures. Even when surgical clinical trials are actively conducted, patient safety is of utmost importance. Therefore, whether participation in a prospective surgical clinical trial per se impacts surgical outcomes is a topic that needs to be clarified.

In recent decades, several prospective surgical clinical trials have been conducted worldwide. The Republic of Korea is one of the leading countries in gastric cancer-related surgery and research. Our institution, a major gastric cancer center in Korea, is committed to enrolling patients in several prospective trials. In this study, the number of patients enrolled in surgical clinical trials among all patients undergoing gastric cancer surgery increased over time, reaching nearly 25% (82/250) by 2020. Ten surgical clinical trials were performed during the five years of the study period. Of these, seven were RCTs and the other three were prospective non-randomized surgical trials. The prospective trials required additional procedures besides standard surgery, which may have made the surgeon feel nervous or overwhelmed. As a result, these cases were included in the SCT group.

In the unweighted populations, the proportion of patients who underwent surgical clinical trials in the entire cohort was over 18%. The patients in the SCT group had few known risk factors for postoperative complications. They had a younger age, lower ASA score, higher preoperative hemoglobin and albumin levels, smaller tumor size, and earlier tumor stages19,20,21. This reflects the bias of researchers who recommend surgical clinical trial participation for a relatively healthy and limited number of patients with far advanced gastric cancer. The perioperative outcomes of the SCT group in the unweighted population were similar to those of the non-SCT group, despite having fewer risk factors for postoperative complications. This suggests a possible worsening of perioperative outcomes in the SCT group; however, the two groups had similar outcomes in the weighted population. The overall survival in the SCT group was superior to that in the non-SCT group. However, this superiority may be due to discrepancies in clinicopathological features between the groups, and the task of matching the discrepancies was performed using the PSW method.

RCTs are the best method for analyzing causal relationships, but only a retrospective analysis was possible in this study because it was designed to classify patients according to their participation in prospective studies. Enrolling patients for this study prospectively meant that all patients were allocated to the SCT group. A good alternative for correcting the clinicopathological background of the two groups in a retrospective study was the use of propensity scores. We tried to add as many matching variables as possible to adjust for the two groups evenly and introduced the PSW method to minimize the number of patients who were excluded from the control group.

After PSW, the perioperative outcomes, including each complication type, were comparable between the groups, except for pancreatitis and renal complications. As the incidence of these complications was low, further validation is needed in a study targeting a larger number of patients. The overall survival of the SCT group, which was superior before PSW, was comparable to that of the non-SCT group after PSW. Prospective studies assume that participation in a surgical clinical trial has no significant impact on surgical outcomes, including operation time, intraoperative blood loss, complications, hospital stay, and oncological outcomes. However, to the best of our knowledge, this has not been validated in well-designed studies. This study is expected to encourage patients and surgeons to participate more actively in prospective studies by providing relief to those concerned about adversely affecting the surgical results through participation in a surgical clinical trial.

This study had several limitations. First, since our study was conducted at a single institution, selection bias is possible despite our efforts to reduce it using propensity weighting. Second, long-term recurrence- or relapse-free survival could not be analyzed because of the lack of recurrence-related data. Additional research is needed on adjuvant chemotherapy and recurrence, depending on the participation in clinical trials.

In conclusion, enrolling relatively healthy patients with early tumor lesions was associated with participation in surgical clinical trials. Performing PSW to match the clinicopathological features between the groups, participation in surgical clinical trials did not affect surgical outcomes, including postoperative complications and long-term overall survival. Participation in well-designed surgical clinical trials for medical advancement should be actively encouraged by patients and surgeons without worrying about surgical outcomes.