Introduction

Obstructive sleep apnea (OSA) is a prevalent condition characterized by repeated episodes of partial or complete obstruction of the upper airway during sleep, leading to disrupted sleep and intermittent hypoxia1,2. It is increasingly recognized as a public health concern owing to its association with various comorbidities, including cardiovascular diseases, metabolic syndrome, and neurological disorders1,2.

While substantial research has revealed a link between OSA and several health issues, the relationship between OSA and cancer, particularly lung cancer, remains debatable3. Lung cancer, a leading cause of cancer-related mortality worldwide, has multifactorial etiology involving genetic, environmental, and lifestyle factors4. Hypoxia, a common feature in the tumor microenvironment, is known to influence tumor growth, angiogenesis, and resistance to therapy5. Thus, the potential role of OSA, with its hallmark intermittent hypoxic episodes, has been suggested in the context of lung carcinogenesis.

Current literature regarding the associations between OSA and the risk of lung cancer is often limited by a low incidence of lung cancer. Most studies have used the incidence of all cancers as the outcome measure and showed subgroup analyses evaluating the associations with specific types of cancer6,7,8,9. However, subgroup analyses may suffer from a lack of statistical power if there is an insufficient number of events10. In addition, most studies on OSA and lung cancer have been conducted in Western countries. Therefore, we aimed to evaluate associations between OSA and the incidence of lung cancer based on the Korean National Health Information Database (NHID).

Methods

Study design and population

We utilized the NHID, a population database documenting health care utilization, health screening, socio-demographic variables, and mortality for the entire population of South Korea, developed and maintained by the National Health Insurance Service11. Our study included adults aged ≥ 19 years who were newly diagnosed with OSA between 2011 and 2018. OSA was defined as at least one claim under the International Classification of Diseases, 10th revision (ICD-10) codes for OSA (G47.33). For the control group, we selected approximately tenfold the number of age- and sex-matched adults from the NHID who had not been diagnosed with OSA. Individuals with a history of any cancer prior to the index date (the date of diagnosis of OSA or the date of the first claim between 2011 and 2018 in the NHID for the control group) were excluded. Individuals who were deceased or were diagnosed with lung cancer within 1 year after the index date were also excluded to minimize the potential effects of reverse causation. Subsequently, we conducted up to 1:5 propensity score matching by age and sex between the OSA and the control groups.

This study was conducted in accordance with the principles set out in the Declaration of Helsinki for medical research involving human subjects. Furthermore, the study was approved by the Institutional Review Board of Seoul National University Hospital (E-2111-022-1269). Informed consent was waived due to the retrospective nature of the study by the Institutional Review Board of Seoul National University Hospital.

Study outcome and variables

The study outcome was the incidence of newly diagnosed lung cancer during the follow-up period. We determined the development of lung cancer by ICD-10 codes (C33 or C34) and registration for a special health insurance co-payment calculation in the event of cancer, introduced by the Korean government to enhance the financial protection of cancer patients by expanding National Health Insurance benefit coverage to alleviate out-of-pocket expenses for cancer patients for 5 years from the date of registration. As the differential co-payments cover 95% of medical expenses12, the failure to identify cancer patients from the NHID is very rare. Patients were followed up from the index date to the date of diagnosis of lung cancer, death, or the end of the follow-up (December 31, 2020), whichever came first.

We collected the following data from the NHID: age, sex, quintile of income, comorbidities including hypertension (I10 to I13, I15), type 2 diabetes (E11 to E14), ischemic heart disease (I20 to I25), heart failure (I50), atrial fibrillation or flutter (I48), transient cerebral ischemic attack or cerebral infarction (G45, I63), chronic obstructive lung disease (COPD; J44), and depression (F32, F33). These comorbidities are defined by their respective ICD-10 codes.

Statistical analysis

Categorical variables are presented as counts and percentages, and continuous variables are presented as means with standard deviation (SD). Baseline characteristics between the OSA and control groups were compared using the independent samples t-test for continuous variables and the χ2 test for categorical variables. The incidence of lung cancer was calculated by dividing the number of events by the time at risk in person-years. To evaluate the association between OSA and risk of lung cancer, we used a Cox proportional hazards model, adjusting for potential confounding factors such as income and the presence of comorbidities. The risks for lung cancer are presented as hazard ratios (HRs) and 95% confidence intervals (CIs). We conducted subgroup analyses for the primary outcome, stratified by age and sex. Statistical significance was set at p < 0.05. We used SAS version 9.4 (SAS Institute, Cary, NC, USA) software for propensity score matching. Other statistical analyses were performed with R version 4.0.3 (www.r-project.org).

Ethical approval

This study was conducted in accordance with the principles set out in the Declaration of Helsinki for medical research involving human subjects. Furthermore, the study was approved by the Institutional Review Board of Seoul National University Hospital (E-2111-022-1269). Informed consent was waived due to the retrospective nature of the study by the Institutional Review Board of Seoul National University Hospital.

Results

Baseline characteristics

A total of 195,437 adult patients newly diagnosed with OSA between 2011 and 2018 were identified in the NHID. Among these, 14,366 were excluded using the following exclusion criteria: incomplete data, a history of any cancer before the index date, or a lung cancer diagnosis or death within 1 year after the index date. Among 2,113,982 adult patients who had not been diagnosed with OSA in the same period, 71,579 were excluded using the same exclusion criteria. After up to 1:5 propensity score matching based on age and sex, 181,070 patients in the OSA group and 905,283 in the control group were analyzed (Fig. 1). The mean age of the patients was 44.6 years, and 78.1% were male. Patients with OSA had a higher prevalence of comorbidities. However, the proportion of patients in the lowest quintile of income was higher in the control group (Table 1).

Figure 1
figure 1

Flow diagram of the study population. OSA, obstructive sleep apnea.

Table 1 Baseline characteristics (N = 1,086,353).

Lung cancer incidence

During follow-up over (mean ± SD) 9.1 ± 2.0 years, 2614 incident cases of lung cancer were identified among 1,086,353 patients. The mean time to lung cancer diagnosis was 6.3 ± 2.5 years from the index date. The incidence rate was 39.51 per 100,000 person-years in the OSA group, and 24.93 per 100,000 person-years in the control group (Table 1).

Figure 2 shows the unadjusted Kaplan–Meier curves of the incidence of lung cancer according to the presence of OSA. In an unadjusted model, patients with OSA had an increased risk of incident lung cancer (HR 2.33, 95% CI 2.09–2.60; Table 2). After adjusting for income and the presence of comorbidities including hypertension, type 2 diabetes, ischemic heart disease, heart failure, atrial fibrillation or flutter, stroke, chronic obstructive lung disease, and depression, this association remained significant (HR 1.95, 95% CI 1.74–2.18, p-value < 0.001; Table 2).

Figure 2
figure 2

Kaplan–Meier curves of the incidence of lung cancer according to the presence of OSA. OSA, obstructive sleep apnea.

Table 2 Incidence of lung cancer according to OSA.

The incidence rate in the female patients with OSA was 41.47 per 100,000 person-years, and that in the male patients with OSA was 38.96 per 100,000 person-years (Table 2). Kaplan–Meier curves of the incidence of lung cancer in male and female patients are shown in Supplementary Fig. 1. The adjusted HR for incident lung cancer was 2.14 (95% CI 1.69–2.70) in female patients with OSA, and 1.90 (95% CI 1.67–2.16) in male patients with OSA (Table 2). The risk of incident lung cancer increased with age (Supplementary Fig. 2). The adjusted HR for incident lung cancer was 1.73 (95% CI 0.92–3.25) in patients with OSA aged 19–39 years, 1.86 (95% CI 1.62–2.14) in those aged 40–64 years, and 2.99 (95% CI 2.46–3.64) in those aged ≥ 65 years (Table 2).

Discussion

In this nationwide population-based cohort study utilizing Korean NHID over a mean follow-up of 9 years, we found a significant association between OSA and an elevated risk of developing lung cancer, with OSA patients demonstrating an incidence rate of 39.51 per 100,000 person-years compared with 24.93 per 100,000 person-years in the age- and sex-matched control group. OSA remained a significant risk factor for lung cancer after adjusting for various potential confounders such as income and comorbid conditions.

There is growing evidence for the putative association between OSA and cancer; however, epidemiologic studies still show inconsistent findings. In a study of a large health insurance database in the United States, the all-cancer incidence was similar between patients with OSA (defined by ICD codes) and matched controls, whereas specific cancers such as kidney, pancreas, and melanoma showed increased risks in patients with OSA13. Conversely, risks for some other cancers, including colorectal and prostate, were lower. Recent data from a large Australian sleep clinic cohort showed that the association between OSA (defined by apnea–hypopnea index [AHI]) and all-cancer incidence was lost after adjusting for confounders such as age, sex, body mass index, smoking status, socioeconomic status, and blood pressure. Similarly, nocturnal hypoxemia, defined as sleep time spent with an oxygen saturation < 90% (T90), was not an independent risk factor for all-cancer incidence14. However, according to data from a large multicenter French cohort, T90, but not AHI, was associated with all-cancer incidence after adjusting for confounders15. Nevertheless, the association with OSA can be heterogeneous across different cancer sites. Considering that there are specific risk factors and pathophysiology associated with each type of cancer16, it is crucial to reveal the association between OSA and specific cancer sites.

There is limited evidence on the associations between OSA and the risk of site-specific cancer16,17. Most studies have focused on all-cancer incidence as the main outcome while conducting subgroup analyses to investigate the associations with certain types of cancer6; however, the statistical power of subgroup analyses may be inadequate when the number of events is low10. A subgroup analysis of a health insurance database study in the United States showed that the incidence of lung cancer did not increase in individuals with OSA after adjustment for comorbidities13. This study was also limited by a relatively short mean follow-up period of less than 4 years. Similarly, another subgroup analysis of an Australian sleep clinic cohort study did not show any association between incident lung cancer and the presence of OSA or nocturnal hypoxemia14. However, in a subgroup analysis of a multicenter French cohort study, nocturnal hypoxemia was found to be associated with the risk of lung cancer15. Furthermore, in a recent meta-analysis of 4,885,518 individuals, patients with OSA had an approximately 25% higher risk of lung cancer compared with those without OSA3. To reduce heterogeneity, a sensitivity analysis of three studies, each with more than 5 years of follow-up, was performed. It showed that patients with OSA had a 32% higher incidence of lung cancer3.

Our nationwide population-based cohort study found 2614 incident cases of lung cancer among 1,086,353 patients over a mean follow-up of 9 years. To reduce the possibility of reverse causation, we excluded incident cases diagnosed in the first year of follow-up. As a result, OSA was an independent risk factor for lung cancer incidence after adjusting for demographics and comorbidities. Our study's adjusted HR for incident lung cancer was 1.95, which was higher than that found in the previous meta-analysis3. Differences in effect size could be explained by variations in race and follow-up duration. The previous meta-analysis included studies conducted in Western countries with relatively shorter follow-up durations. Furthermore, we found that this association was observed across both sexes and different age groups and was more prominent in women and older people. Further studies are needed to determine whether OSA may differentially affect the incidence of lung cancer in different sex and age groups.

While the exact mechanisms linking OSA and lung cancer remain to be fully elucidated, the evidence suggests that intermittent hypoxia—a hallmark feature of OSA—may play a crucial role in tumorigenesis10,18. Intermittent hypoxia can lead to oxidative stress, DNA damage, and systemic inflammation, creating an environment conducive to cancer development and progression via hypoxia-inducible factor5,19. Additionally, OSA-related sleep fragmentation may further exacerbate carcinogenesis by increasing sympathetic activation, systemic inflammation, and immune dysregulation3,20.

Our study has some limitations. First, we defined OSA by ICD codes without conducting sleep studies such as polysomnography, which may have introduced misclassification bias. Second, as we determined the development of lung cancer by ICD codes, and since ICD codes do not reflect pathologic subtypes of lung cancer such as adenocarcinoma, squamous cell carcinoma, and small cell lung cancer, we cannot evaluate the associations between OSA and the incidence of each subtype of lung cancer. Third, we did not measure intermittent hypoxemia or sleep fragmentation, making it difficult to evaluate any pathophysiologic link between OSA and lung cancer. Fourth, although we adjusted for demographics and comorbidities, residual confounding such as smoking, alcohol consumption, physical activity, obesity, and treatment for OSA may exist.

In conclusion, this nationwide study showed an independent association between OSA and an increased risk of lung cancer. This association between OSA and lung cancer incidence was consistent among both sexes and various age groups, with a stronger effect seen in women and older adults. These findings should be explored in further studies for a better understanding of the relationship between OSA and lung cancer, which will guide tailored prevention and management.