Identification of phenomic data in the pathogenesis of cancers of the gastrointestinal (GI) tract in the UK biobank

Tan, Shirin Hui; Guan, Catherina Anak; Bujang, Mohamad Adam; Lai, Wei Hong; Voon, Pei Jye; Sim, Edmund Ui Hang

doi:10.1038/s41598-024-52421-9

Download PDF

Article
Open access
Published: 23 January 2024

Identification of phenomic data in the pathogenesis of cancers of the gastrointestinal (GI) tract in the UK biobank

Shirin Hui Tan ORCID: orcid.org/0000-0002-4556-4980^1,2,
Catherina Anak Guan¹,
Mohamad Adam Bujang¹,
Wei Hong Lai¹,
Pei Jye Voon³ &
…
Edmund Ui Hang Sim²

Scientific Reports volume 14, Article number: 1997 (2024) Cite this article

540 Accesses
Metrics details

Subjects

Abstract

Gastrointestinal (GI) cancers account for a significant incidence and mortality rates of cancers globally. Utilization of a phenomic data approach allows researchers to reveal the mechanisms and molecular pathogenesis of these conditions. We aimed to investigate the association between the phenomic features and GI cancers in a large cohort study. We included 502,369 subjects aged 37–73 years in the UK Biobank recruited since 2006, followed until the date of the first cancer diagnosis, date of death, or the end of follow-up on December 31st, 2016, whichever occurred first. Socio-demographic factors, blood chemistry, anthropometric measurements and lifestyle factors of participants collected at baseline assessment were analysed. Unvariable and multivariable logistic regression were conducted to determine the significant risk factors for the outcomes of interest, based on the odds ratio (OR) and 95% confidence intervals (CI). The analysis included a total of 441,141 participants, of which 7952 (1.8%) were incident GI cancer cases and 433,189 were healthy controls. A marker, cystatin C was associated with total and each gastrointestinal cancer (adjusted OR 2.43; 95% CI 2.23–2.64). In this cohort, compared to Asians, the Whites appeared to have a higher risk of developing gastrointestinal cancers. Several other factors were associated with distinct GI cancers. Cystatin C and race appear to be important features in GI cancers, suggesting some overlap in the molecular pathogenesis of GI cancers. Given the small proportion of Asians within the UK Biobank, the association between race and GI cancers requires further confirmation.

Identifying proteomic risk factors for cancer using prospective and exome analyses of 1463 circulating proteins and risk of 19 cancers in the UK Biobank

Article Open access 15 May 2024

3D genomic mapping reveals multifocality of human pancreatic precancers

Article 01 May 2024

Identifying therapeutic targets for cancer among 2074 circulating proteins and risk of nine cancers

Article Open access 29 April 2024

Introduction

The prevalence of gastrointestinal (GI) tract cancers are a significant public health issue worldwide, given their substantial contribution to the overall incidence and mortality rates of cancer on a global scale. GI tract cancers accounted for 26.3% of the total 4.8 million cancer cases and 35.3% of the 3.4 million cancer-related deaths in the year 2018¹; encompassing the oesophagus, stomach, liver, pancreas, small intestine, colon, and rectum, are commonly referred to as GI tract cancers². In addition, Arnold et al.³ reported that the predominant GI malignancies include colorectal cancer (10.2%), stomach cancer (5.7%), liver cancer (4.7%), oesophageal cancer (3.2%), and pancreatic cancer (2.5%). GI tract cancers present with varying clinical characteristics and risk factors, intimately linked with lifestyle decisions and pre-cancerous ailments³; inclusive phenotypic information across multiple levels, encompassing clinical, molecular, and cellular aspects.

Phenotypic data, an integral component of phenomics, provides a comprehensive understanding of observable traits and characteristics, contributing to a holistic analysis of biological systems; whilst phenomics involves quantifying the phenome, a set of observable characteristics encompassing physical, chemical, and biological traits in individuals and populations^4,5. These traits arise from intricate interplays among genetic factors, environmental conditions, dietary influences, and symbiotic microorganisms⁶. Phenomic studies, in contrast to traditional biomedical research, exhibit unique features such as meticulous standardization in measurements, data management and analysis; relying on extensive big data, encompassing multi-dimensional and well-organized datasets⁷. In the domain of cancer, comprehending the entire spectrum of phenotypic irregularities associated with malignancies, including the intricate details revealed by biomarkers, is imperative for advancing our knowledge of carcinogenesis, especially the mechanisms underlying the relationships among phenome, genome and environmental impact.

Following the completion of the Human Genome Project, comprehensive explorations into the human phenome have become crucial, forming a foundational framework for deciphering the intricacies of human health codes, especially the complex relationships among the phenome, genome and environmental influences⁷. Previous research has documented the use of phenomic data to investigate the correlation between environmental factors, such as diet and lifestyle, and the development of various cancers^8,9,10,11,12. Furthermore, different toolkits associated with cancer phenomics were examined^13,14,15,16. In the realm of cancer research, there is currently a limited body of knowledge regarding the exploration of diverse cancer types using phenomics data sourced from extensive datasets that encompass comprehensive clinical information. This scarcity is particularly evident when considering GI cancer, where the understanding of the complex interplay between phenotypic characteristics and the underlying molecular mechanisms remains relatively under-explored. Given this research gap, the UK Biobank emerges as an unparalleled resource, presenting an extraordinary opportunity to address the gap in our understanding of the involvement of phenomic data in GI cancer research.

The UK Biobank is a prospective cohort study of considerable magnitude that has enlisted more than 500,000 individuals between the age of 40 and 69 years from various regions of the UK during the period of 2006 to 2010. The extensive sample size and comprehensive data collection of phenotypic and genotypic data enable the examination of intricate associations between socio-demographic factors, blood chemistry, anthropometric measurements, and lifestyle of participants, thereby facilitating the development of more efficacious prevention and treatment approaches^17,18. Within the national cancer registry, UK Biobank participants have contributed to a large accumulation of data comprising over 43,000 newly reported cancer cases up to the present. The UK Biobank is uniquely equipped to facilitate research into the factors that contribute to the onset of disease. It facilitates the identification of risk factors that increase or decrease the likelihood of developing specific diseases, as well as the precise quantification of these associations' magnitude. In addition, the substantial diversity observed in the intensity of these associations across various demographic, socioeconomic, and lifestyle characteristics provides an opportunity to assess the applicability of these associations to substantial subgroups of the population^19,20.

Growing evidence highlights the importance and pressing need for utilizing phenomics in the examination of diseases^21,22,23,24. Given the limited research on GI phenomics, this study was undertaken to explore the correlation between GI cancers and phenomic characteristics within the UK Biobank cohort. The UK Biobank offers a comprehensive range of socio-demographic, anthropometric, and biological markers, including blood and urine biomarkers, making it a valuable resource for this investigation. It is anticipated that the outcomes of this study will contribute to a more profound understanding of the multi-omics composition of patients, complemented by clinical data. This understanding, in turn, is expected to facilitate the identification of diagnostic, prognostic, and predictive biomarkers. Furthermore, the insights gained from this research endeavor hold the potential to unveil effective pathways for the personalized treatment of a diverse range of targeted diseases.

Methods

Study design and participants

The UK Biobank is a prospective cohort study with the aim of investigating how various diseases are caused by genetic, environmental, and lifestyle factors. Every participant in the UK Biobank provided informed consent upon enrolment, granting permission for the sharing of anonymized data with authorized researchers. Participants retained the right to withdraw their consent for data sharing at any point during their participation. All participants were registered with the National Health Service (NHS) in the United Kingdom. Participants completed a self-administered touchscreen questionnaire regarding their sociodemographics, lifestyle behaviours, medical history, and medication use during the initial recruitment session. They also underwent physical measurements such as weight, height, waist circumference, and hip circumference. Detailed information of the UK Biobank has been reported previously²⁵. Since its establishment in 2006, a total of 502,369 subjects aged 37–73 years were recruited between 2006 and 2010 and followed up since then¹⁹ until the date of the first cancer diagnosis, date of death, or the end of follow-up on December 31st, 2016, whichever occurred first. Access to the UK Biobank data was applied and approved (Application number 96759). This study was also approved by the Malaysia Medical Research and Ethics Committee (NMRR ID-23-00931-SPO).

Figure 1 illustrated the flow diagram for exclusion and inclusion in this study. After taking into consideration the exclusion criteria, we included 433,189 controls for our analyses. Controls were participants who did not have a record of ever being diagnosed with cancer according to the 10th Revision of the International Classification of Diseases (ICD-10). As for the cases, we included incident cancer cases who had GI cancers as coded using the ICD-10. GI cancers referred in this study included C15 oesophageal cancer, C16 gastric cancer, C17 small intestine cancer, C18 colon cancer, C19 rectosigmoid junction cancer, C20 cancer of rectum, C21 cancer of anus and anal canal, C22 liver cancer, C23 gallbladder cancer, C24 cancer of other and unspecified parts of biliary tract, C25 pancreatic cancer and C26 cancer of other digestive organs. We excluded participants with any GI cancers diagnosed within two years from recruitment (n = 55,340) to account for reverse causation, and those with missing date of cancer diagnosis (n = 7888). Finally, we included 7952 participants who had GI tract cancers as coded using the ICD-10.

Phenomic analysis

Sociodemographic characteristics (gender, age, and race) and lifestyle factors (smoking, alcohol drinking and physical activity) were collected during baseline assessment. Smoking status and alcohol drinking status were categorized as Never, Previous or Current smoker or alcohol drinker respectively, as recorded in the UK Biobank and reported previously^26,27.Townsend deprivation index score which reflected the socioeconomic status were calculated for each participant based on the postcodes of residence. Age was calculated based on age from date of birth and baseline assessment visit. Physical activity was collected based on number of days that the participants had moderate or vigorous activity for at least 10 min. As part of the research interest is to investigate the differences between race in cancer occurrence, we also explored race as categorized by White (British, Irish, White and any other White background) versus Asians (Chinese, Indian, Pakistani, Bangladeshi and any other Asian background).

Anthropometric measurements including height, body weight, waist circumference and hip circumference were taken by trained nurses during the baseline assessment visit²⁸. Body mass index (BMI) was calculated as weight/height². We then categorized the BMI based on the WHO BMI classification²⁹.

Biological markers were obtained from serum and urine samples. Efforts were put in by the UK Biobank to minimise systematic and random errors in the biomarker assays, including blood and urine samples analyses^30,31. We included biomarkers related to glucose control (glycated hemoglobin (HbA1c)), cardiovascular health (Apolipoprotein A, Apolipoprotein B, C-reactive protein (CRP), lipoprotein(a), high density lipoprotein (HDL) cholesterol, low density lipoprotein (LDL) cholesterol, triglyceride, total cholesterol), renal profile (creatinine, cystatin C, total protein, urate and urea), liver profile (alanine transferase (ALT), alkaline phosphatase (ALP), aspartate transferase (AST), gamma-glutamyl transferase (GGT), albumin, total bilirubin, direct bilirubin), hematological parameters (basophil count, eosinophil count, erythrocyte count, hemoglobin concentration, leukocyte count, lymphocyte count, monocyte count, neutrophil count, platelet count), hormones (insulin like growth factor 1 (IGF-1), oestradiol, testosterone, sex hormone binding globulin (SHBG)), and bone-related markers (ionized calcium, phosphate, 25-hydroxyvitamin D (25(OH)D) and rheumatoid factor).

Blood and/or urine samples were available for all the participants. Participants without any of the blood and urine biomarkers available were not included in this study. However, there were missing data for the variables included in this study. No imputation was performed for the missing data. For the analysis of each individual parameter, participants with missing data were excluded. Due to the large number of variables involved, information on the missing data is available upon request.

Statistical analyses

This study aimed to determine the risk factors for GI cancers. The risk factors explored consisted of 59 parameters as mentioned previously. The outcome was defined as the diagnosis of incident total GI cancers (based on ICD-10 C15-C26) and each individual diagnosis of GI cancers. Descriptive statistics were used to describe the characteristics of these variables based on each individual GI cancers, total GI cancers and healthy controls (Table 1).

Table 1 Baseline characteristics of the study population in the UK Biobank.

Full size table

We employed univariable and multivariable logistic regression analyses of phenomic features against outcomes of interest (first/initial diagnoses of disease) in this study, similar to studies conducted by Gausman et al.³² and Kang et al.³³. Initially, univariable analysis using logistic regression was applied to determine the significant risk factors for the outcomes of interest. The Benjamini–Hochberg correction was implemented to control the False Discovery Rate (FDR) and mitigate the risk of false positives. The Benjamini–Hochberg correction is a widely accepted method for controlling the FDR, offering a more balanced approach than the Bonferroni correction. Unlike the Bonferroni correction, which is known for its conservative nature and increased likelihood of false negatives, the Benjamini–Hochberg procedure allows for a more nuanced control of the error rate. By controlling the FDR instead of the Family-Wise Error Rate (FWER), the BH procedure provides a good balance between identifying true positives and limiting false positives.

Since the cohort data analysed was large, besides relying on the p-value, odds ratios (OR) of more than 2.0 and 1.5 for categorical and numerical variables respectively was fixed to screen the significant and important risk factors³⁴. Univariable logistic regression results for all variables can be found in Supplementary Table 1. All the significant variables in the univariable logistic regression for each GI cancers and total GI cancers were listed in Table 2. Next, multivariable logistic regression based on the variables identified in Table 2 were conducted. One important criterion for model selection in logistic regression is the assumption that the independent variables should not correlate with each other. Therefore, variable selection in multivariable logistic regression was performed in a way that the variables of the same category will be chosen based on the variable with the highest odds ratio in univariable logistic regression. The odds ratio with respective 95% confidence interval and p-values for each variable were reported in Table 3. The analyses was conducted without prior considerations of potential causal pathway, stratification based on socio-demographic and lifestyle factors to avoid analysis bias.

Table 2 Factors associated with GI cancers in the UKB cohort based on univariable logistic regression.

Full size table

Table 3 Factors associated with total GI cancers and top five GI cancers in the UK Biobank cohort (using multiple logistic regression).

Full size table

Data extraction and processing was conducted on the UK Biobank Research Analysis Platform (RAP) through Jupyterlab and Jamovi³⁵. All analyses were carried out using Jamovi³⁵.

Ethics approval

The UK Biobank received ethical approval from the National Information Governance Board for Health and Social Care and the National Health Service North West Centre for Research Ethics Committee. The study was conducted in accordance with the Declaration of Helsinki. This study was also approved by the Malaysia Medical Research and Ethics Committee (NMRR ID-23-00931-SPO).

Consent to participate

All participants provided written informed consent prior to recruitment.

Results

The analysis included a total of 441,141 participants, of which 7952 (1.8%) were incident GI cancer cases and 433,189 were healthy controls. Among the 7952 participants with GI cancers, there were 11,563 total GI cancers recorded. A subset of participants with GI cancers and cancer(s) other than GI cancers (1468 participants) were not included in the subsequent logistic regression analysis.

Table 1 shows the characteristics of healthy controls, participants who developed GI cancers by total GI cancers and top 5 GI cancers, in the order of colorectal cancer (n = 5436), pancreatic cancer (n = 1286), oesophageal cancer (n = 1146), gastric cancer (n = 878) and liver cancer (n = 755). Comparing with the control group, the GI cancer group was older when they were recruited, consisted of more males, had higher BMI, and a higher proportion of participants being current or previous smokers. Participants with GI cancer were followed up for an average of 7.5 years before they were diagnosed with GI cancer, while healthy controls were followed up for a mean duration of 8.5 years.

Table 1 illustrates the significant variables that are associated with each GI cancer and total GI cancers, based on univariable logistic regression. Race and cystatin C were significantly associated with total GI cancers. Cystatin C was also significantly associated across each type of GI cancer whereas race is associated with all GI cancers except gastric cancer. There seemed to be gender differences in oesophageal, gastric and liver cancers, with men having higher risk in developing these cancers. Lifestyle factors (alcohol drinking and smoking status) were also associated with some GI cancers (oesophageal cancer for both factors and liver cancer for smoking status). Anthropometric measurement (body mass index classification) was also associated with oesophageal, gastric and liver cancers.

In terms of biochemical markers, several cardiovascular health markers (apolipoproteins A and B, HDL cholesterol, LDL cholesterol) on top of ionized calcium and phosphate were associated with some GI cancers. There were also hematological markers, including basophil, eosinophil, erythrocyte and monocyte that are found to be associated with some GI cancers.

These variables that were significantly correlated to GI cancers based on univariable logistic regression were further analysed in multivariable logistic regression (Table 3). Against total GI cancers, an increase of 1 mg/L cystatin C multiplied the odds of getting total GI cancers by 2.43 times. Analysis also found that compared to Asians, participants of White race had a 2.22 times higher risk of getting GI cancers.

When we looked at each individual GI cancer, cystatin C remained significantly associated with the cancers with an adjusted odds ratio of at least 1.97. Cystatin C and White participants had a higher risk of getting colorectal cancers (adjusted OR of 2.11 and 2.54 respectively), which was the top one GI cancer in the UK Biobank cohort. Participants with a diagnosis of pancreatic cancer are associated with higher cystatin C (adjusted OR 2.15), eosinophil count (adjusted OR 1.41) and White ancestry (adjusted OR 2.51 compared to Asians). For oesophageal cancer, besides cystatin C and of White race, lower Apolipoprotein A1, higher monocyte count and lower ionized serum calcium were associated with higher risk of getting oesophageal cancer. Compared to normal weight participants, those who were underweight (adjusted OR 2.75), overweight (adjusted OR 1.44) and obese (adjusted OR 1.84) were associated with oesophageal cancer. Previous and current smokers were also found have higher risk (adjusted OR 1.82 and 2.77 respectively) of getting oesophageal cancer. On the other hand, gastric cancer was associated with male subjects, those with BMI other than normal weight, lower HDL cholesterol and higher monocyte count. Lastly, in addition to cystatin C, apolipoprotein B, phosphate, monocyte count, males and those of smoking history were associated with liver cancer.

Discussion

In this large UK Biobank cohort study of a total of 7952 incident GI cancer cases, we aimed to investigate the associations between the phenomic features and GI cancers to better understand the molecular pathogenesis of GI tract cancers. The analysis included a total of 441,141 participants in the study, of whom 7952 (1.8%) were incident cases of GI cancer and 433,189 were healthy controls. The results demonstrated significant associations between certain variables and different types of GI cancers, providing valuable insights into the risk factors and potential biomarkers associated with these cancers. The characteristics of the GI cancer group were substantially different from those of the control group, with the GI cancer group being older, predominantly male, having a higher BMI, and containing a greater proportion of current or former smokers.

The distribution of the top five GI cancers observed in this UK Biobank cohort was found to be consistent with global trends^3,36. Colorectal cancer (47.01% of the total GI cancer cases) emerged as the most common GI cancer, followed by pancreatic cancer (11.12%), oesophageal cancer (9.91%), gastric cancer (7.59%), and liver cancer (6.50%). This pattern aligns with previous studies and reflects the epidemiology of GI cancers on a global scale, indicating the generalizability of the findings from this cohort to broader populations. As the top five GI cancers in the UK Biobank cohort represented 82% of the entire GI cases, subset analysis focuses on the five out of nine major GI cancer categories.

One of the notable findings in this study was the consistent association of cystatin C and race with each type of GI cancer. Cystatin C, a biomarker related to kidney function^37,38, was found to be consistently raised and associated with all GI cancers in this cohort. Participants with higher cystatin C levels exhibited an increased risk of developing GI cancers, suggesting its potential as a prognostic biomarker. This finding is corroborated in previous literature^39,40,41,42. Cystatin C exerts a series of complex effects that may result in either an inhibition or a promotion of tumour cell growth and dissemination, as demonstrated by previous research^39,40. A recent study discovered a novel mechanism of mast cells inducing endoplasmic reticulum stress in which Cystatin C mediates tumor inhibition during colorectal cancer development⁴¹ This function of Cystatin C in cancer cells has never been reported and may lead researchers one step closer to understanding the molecular pathogenesis of GI cancers in relation to cystatin C.

Additionally, race was found to be significantly associated with total GI cancers, with Whites having a higher risk compared to Asians. The influence of race is also evident in subsets colorectal, pancreatic and oesophageal cancers in this study. Epidemiological studies have examined the association between race, specifically White and Asian populations, and gastrointestinal malignancies, including colorectal, pancreatic, esophageal, gastric, and liver cancer^{3,43,44,45,46}. Similarly, results showed that gender played a role in the difference in GI cancer incidence, particularly gastric and liver cancers, males having 2.8, 2.4 and 1.7 times more likely to get the cancers respectively. This finding is in line with current literature^45,47,48. While acknowledging the limited representation of Asians in the UK Biobank cohort, the study emphasizes that phenotypic feature identification is its main goal in relation to GI malignancies. Importantly, the study emphasized that the relatively small number of Asians in the cohort should not undermine the robustness of the scientific inferences drawn regarding associations between exposures and health conditions.

In addition to sociodemographic characteristics, lifestyle factor particularly smoking status was proven to be associated with certain GI cancers, including liver and oesophageal cancers. Smoking status still remained a significant factor in the multivariable logistic regression analysis for liver cancer. Interestingly, exposure to smoking (including those who had stopped smoking) consistently increased the risk of developing GI cancers. This is supported and demonstrated in other studies as well^45,47,49. Cancer incidence and mortality rate variations are influenced by several factors, including genetic, environmental, lifestyle, and socioeconomic variables^43,44,45,50.

Anthropometric measurement (body mass index classification) showed associations with oesophageal and gastric cancers. In line with the work of other researchers, we demonstrated U-shape relationship between BMI and the three cancers^51,52,53. This abundant evidence of excess body weight over the past few decades indicates an emphasis on lipid metabolism and mechanisms involved in malignancies^{26,51,52,53,54}. As demonstrated in this study, apolipoprotein A1, apolipoprotein B and HDL cholesterol were associated with oesophageal, gastric and liver cancers. Studies have suggested that apolipoproteins play critical roles in malignancies including GI cancers. Low apolipoprotein A1 level is linked to a high cancer risk, systemic inflammatory response and poorer survival in some cancers, including oesophageal squamous cell carcinoma^55,56,57,58. This is in accordance with our study findings. Apolipoprotein A1 is a protein component of HDL cholesterol. Similar to apolipoprotein A1, HDL cholesterol is inversely associated with cancers, as demonstrated in the subset gastric cancer in this study. One of the proposed mechanisms of the opposing role in tumorigenesis of HDL cholesterol is its modulation of cell cycle entry and apoptosis through the mitogen-activated protein kinase-dependent (MAPK) pathway⁵⁹. A Korean cross-sectional study also reported the association between reduced HDL/apolipoprotein A1 levels and an increased risk of colorectal cancer⁶⁰. Emerging evidence suggests that the apolipoprotein A1/HDL axis, involved in lipid metabolism, is dysregulated in cancer. mRNA levels of apolipoprotein A1 were lower in hepatocellular carcinoma compared to normal liver tissue, the primary source of apolipoprotein A1, as determined by Oncomine database microarray data⁶¹. In hepatocellular carcinoma, the mechanisms underlying the transcriptional repression of apolipoprotein A1 remain obscure.

However, this result is consistent with previous reports of decreased apolipoprotein A1 protein levels in malignant liver tissue and hepatocellular carcinoma patient serum^62,63. The decrease in apolipoprotein A1 transcription, intracellular and secreted apolipoprotein A1, and circulating HDL levels in hepatocellular carcinoma suggests that this pathway may have a tumor-suppressing function⁶¹. Several studies have discovered associations between serum apolipoprotein A1/HDL levels and various aspects of the natural progression of various cancer types^56,59,64,65. Consistent with the study findings, high apolipoprotein B level was suggested as a risk factor for liver cancer; it is associated with poorer survival post surgery and a larger tumour size⁶⁶. More in-depth exploration of the genetic information of apolipoproteins may indicate liver malignancy and thus should be further researched on. Mutations of apolipoprotein B is reported to account for almost 10% of all genetic mutations⁶⁶. Specifically, a non-oncogenetic mutation of apolipoprotein B is observed, which can result in apolipoprotein B inactivation and is associated with the overexpression of oncogenic regulators and the downregulation of tumour suppressors, resulting in poorer survival outcomes. It is hypothesised that mutations that render apolipoprotein B inactive are preferred in tumorigenesis in order to provide more energy for cancer metabolism^55,65.

Multivariable logistic regression demonstrated that ionized serum calcium level was inversely associated with the risk of oesophageal cancer (adjusted OR = 0.37, 95% CI 0.18–0.74; p-value = 0.005). This is in line with studies that established the significance of calcium intake, in particular, as a potential effect modifier of the association between calcium and diseases including GI tract neoplasia^67,68,69. Increasing dietary calcium intake was associated with lower risk of oesophageal cancer^67,68,69. There seems to be inconsistent findings on the relationship between serum calcium and risk of cancer in current literature. The Swedish AMORIS study exploring GI cancers specifically oesophageal, stomach and CRC cancers, showed positive association between albumin-adjusted serum calcium and risk of these GI cancers⁷⁰. Nevertheless, a study exploring two large European prospective cohorts (including the UK Biobank) corroborated our study findings on ionized serum calcium level and risk of liver and colorectal cancer⁷¹.

The different direction of the association between the UK Biobank and EPIC cohorts, and the AMORIS study was attributed to differences in study design and the degree of adjustment for confounding variables⁷¹. It is worthwhile to discuss on this study’s focus on serum calcium measurement rather than dietary calcium intake. Serum calcium indicates extracellular calcium homeostasis and is mainly regulated by vitamin D and parathyroid hormone. Consequently, abnormalities in serum calcium level may reflect an error in its regulation pathways instead of dietary calcium deficiency. This may result in distinct associations between calcium in the diet and serum and carcinogenesis^70,71,72. Besides calcium, phosphate is also found to be inversely associated with liver cancer (adjusted OR = 0.36; 95% CI 0.22–0.58; p-value = 0.001). There is little research on phosphate and cancers, with inconsistent trends among the studies and/or cancers^54,73,74. It is accepted that altered levels of phosphate have been linked to the onset of cancer, but with uncertainties on the pathophysiology behind it. More in-depth studies are warranted to better understand the positive and inverse correlation observed between calcium and phosphate levels, and the risk of cancers. This will shed light on the involvement of calcium and phosphate metabolism, and potentially related important hormonal factors and cancer.

Additionally, hematological markers including monocyte and eosinophils were related to some GI cancers. Monocytes and eosinophils are a type of white blood cell. Interestingly, there are scarce research on the association of eosinophils and monocytes in GI cancer. Despite that, the value of immune-related markers in cancers are acknowledged. Previous studies focused mainly on pre-operative values of these circulating cells, however, changes in the immune profile may occur months or years prior to cancer diagnosis due to its role in the etiopathegenesis of tumours⁷⁵. White blood cells were previously found to be associated with increased risk of colorectal, lung and breast cancer⁷⁶. Preclinical data showed that eosinophils have both pro-tumorigenic and anti-tumorigenic properties, via direct and indirect mechanisms. This varying outcomes in different studies imply that the role of eosinophils and their mediators may differ depending on the cancer type^77,78,79.

These findings provide valuable insights into the associations between various factors and GI cancers within the UK Biobank cohort. The identification of significant associations contribute to our understanding of the underlying mechanisms and risk factors involved in the development of GI cancers. The consistent association of cystatin C with different types of GI cancers suggest its potential as a promising biomarker for early detection and risk stratification. The findings from this study will guide our subsequent way forward to explore the whole exome sequencing data in GI cancers within the UK Biobank. This will promote a multi-omic methodology to help characterize GI cancers and associated phenomic features. Specifically, variants within the exome region of the genome, which is responsible for encoding proteins, can serve as valuable indicators for the identification of genetic variants that are highly relevant to drug discovery⁸⁰.

Notable strengths of this study include its prospective study design involving a large sample size, a lengthy follow-up period and evaluation of a comprehensive list of covariates. In addition, all biochemistry markers were measured using well-established and validated methods, ensuring accuracy and reliability throughout the study. This study, is however, not without its limitations. Despite UK Biobank not being suitable for determining universally applicable rates of disease prevalence and incidence, its substantial size and diverse exposure measures allow for valid scientific inferences on associations between exposures and health conditions. Such assessments can be widely generalizable and do not necessitate participants to be representative of the population at large^19,81,82. In addition, this study focusses on the phenomic data involved on the pathogenesis of GI cancers, with aim to identify the potential phenomic feature(s) associated with the pathogenesis of GI cancers, and not to associate with incidence rate. Although the number is small, this is a cross sectional analyses of UK Biobank data, which still represent the largest database at present, and present findings are in accordance with previous studies looking into different health outcomes and their associations with race. In addition, the study relied on self-reported lifestyle data, which introduces the possibility of recall bias. To validate and expand upon these findings, additional research with diverse populations and rigorous data acquisition techniques is required. Besides, in terms of study data, no information was available regarding potential confounding variables such as vitamin D and/or calcium supplementation. Furthermore, we were unable to explain the effect of dietary calcium on gastrointestinal carcinogenesis, as suggested by biological studies (Supplementary Table 1).

In conclusion, this study identified several significant associations between various factors and GI cancers using the UK Biobank cohort. A marker Cystatin C emerged as a consistent biomarker associated with different types of GI cancers. Given the small proportion of Asians within the UK Biobank, the association between race and GI cancers requires further confirmation. The findings provide valuable insights into the potential diagnostic and therapeutic targets for GI cancers, emphasizing the importance of personalized approaches in cancer prevention, early detection, and treatment strategies. In order to provide more in-depth understanding of how these factors were associated with GI cancers and shed light on the molecular pathogenesis of GI cancers, future research should employ a multi-modal approach exploring the genomics and proteomics of the UK Biobank cohort. This will allow validation of the study findings and enhance understanding on the underlying mechanisms linking these factors to GI cancer development.

Data availability

The datasets generated during and/or analysed during the current study are available to bona fide researchers and can apply for access to the UK Biobank data at https://www.ukbiobank.ac.uk/enable-your-research/apply-for-access.

References

Bray, F. et al. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 68, 394–424 (2018).
Article PubMed Google Scholar
Ferlay, J. et al. Estimating the global cancer incidence and mortality in 2018: GLOBOCAN sources and methods. Int. J. Cancer 144, 1941–1953 (2019).
Article CAS PubMed Google Scholar
Arnold, M. et al. Global burden of 5 major types of gastrointestinal cancer. Gastroenterology 159, 335–349 (2020).
Article PubMed Google Scholar
Bilder, R. M. et al. Cognitive ontologies for neuropsychiatric phenomics research. Cogn. Neuropsychiatry 14, 419–450 (2009).
Article PubMed PubMed Central Google Scholar
Zbuk, K. M. & Eng, C. Cancer phenomics: RET and PTEN as illustrative models. Nat. Rev. Cancer 7, 35–45 (2007).
Article CAS PubMed Google Scholar
Jin, L. Welcome to the phenomics journal. Phenomics 1, 1–2 (2021).
Article PubMed PubMed Central Google Scholar
Ying, W. Phenomic studies on diseases: Potential and challenges. Phenomics 3, 285–299 (2023).
Article PubMed PubMed Central Google Scholar
Campbell, F. C. et al. Mechanistic insights into colorectal cancer phenomics from fundamental and organotypic model studies. Am. J. Pathol. 188, 1936–1948 (2018).
Article PubMed Google Scholar
Cho, J. et al. Bridging genomics and phenomics of gastric carcinoma. Int. J. Cancer 145, 2407–2417 (2019).
Article CAS PubMed Google Scholar
Frank-Raue, K., Rondot, S. & Raue, F. Molecular genetics and phenomics of RET mutations: Impact on prognosis of MTC. Mol. Cell. Endocrinol. 322, 2–7 (2010).
Article CAS PubMed Google Scholar
Karagulle, M., Fidan, E., Kavgaci, H. & Ozdemir, F. The effects of environmental and dietary factors on the development of gastric cancer. J. BUON 19, 1076–1082 (2014).
PubMed Google Scholar
Stebbing, J. et al. Comparison of phenomics and cfDNA in a large breast screening population: The Breast Screening and Monitoring Study (BSMS). Oncogene 42, 825–832 (2023).
Article CAS PubMed PubMed Central Google Scholar
Davatzikos, C. et al. Cancer imaging phenomics toolkit: Quantitative imaging analytics for precision diagnostics and predictive modeling of clinical outcome. J. Med. Imaging 5, 011018 (2018).
Article Google Scholar
D’Orazio, M. et al. Machine learning phenomics (MLP) combining deep learning with time-lapse-microscopy for monitoring colorectal adenocarcinoma cells gene expression and drug-response. Sci. Rep. https://doi.org/10.1038/s41598-022-12364-5 (2022).
Article PubMed PubMed Central Google Scholar
Fathi Kazerooni, A. et al. Cancer imaging phenomics via CaPTk: Multi-institutional prediction of progression-free survival and pattern of recurrence in glioblastoma. JCO Clin. Cancer Inform. 4, 234–244 (2020).
Article PubMed Google Scholar
Rathore, S. et al. Brain Cancer imaging phenomics toolkit (brain-CaPTk): An interactive platform for quantitative analysis of glioblastoma. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 10670 LNCS (2018).
Collins, R. What makes UK Biobank special?. Lancet 379, 1173–1174 (2012).
Article PubMed Google Scholar
Sudlow, C. et al. UK biobank: An open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 12, e1001779 (2015).
Article PubMed PubMed Central Google Scholar
Fry, A. et al. Comparison of sociodemographic and health-related characteristics of UK biobank participants with those of the general population. Am. J. Epidemiol. 186, 1026–1034 (2017).
Article PubMed PubMed Central Google Scholar
Richiardi, L., Pizzi, C. & Pearce, N. Commentary: Representativeness is usually not necessary and often should be avoided. Int. J. Epidemiol. 42, 1018–1022 (2013).
Article PubMed Google Scholar
Delude, C. M. Deep phenotyping: The details of disease. Nature 527, S14–S15 (2015).
Article ADS CAS PubMed Google Scholar
Houle, D., Govindaraju, D. R. & Omholt, S. Phenomics: The next challenge. Nat. Rev. Genet. 11, 855–866 (2010).
Article CAS PubMed Google Scholar
Nicholson, J. K. Molecular phenomic approaches to deconvolving the systemic effects of SARS-CoV-2 infection and post-acute COVID-19 syndrome. Phenomics 1, 143–150 (2021).
Article PubMed PubMed Central Google Scholar
Zhang, H., Hua, X. & Song, J. Phenotypes of cardiovascular diseases: Current status and future perspectives. Phenomics 1, 229–241 (2021).
Article PubMed PubMed Central Google Scholar
Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Fang, Z., He, M. & Song, M. Serum lipid profiles and risk of colorectal cancer: A prospective cohort study in the UK Biobank. Br. J. Cancer 124, 663–670 (2021).
Article CAS PubMed Google Scholar
McMenamin, Ú. C. et al. Circulating sex hormones are associated with gastric and colorectal cancers but not esophageal adenocarcinoma in the UK Biobank. Am. J. Gastroenterol. 116, 522–529 (2021).
Article CAS PubMed PubMed Central Google Scholar
UK Biobank. UK Biobank: Anthropometry. http://www.ukbiobank.ac.uk/ (2014).
WHO Consultation on Obesity (1999) and World Health Organization (2000) Obesity: preventing and managing the global epidemic: report of a WHO consultation. World Health Organization. https://apps.who.int/iris/handle/10665/42330 (Accessed 01 June 2023).
Fry, D., Almond, R., Moffat, S., Gordon, M. & Singh, P. UK biobank biomarker project companion document to accompany serum biomarker data. http://www.ukbiobank.ac.uk/uk-biobank-biomarker-panel/ (2019).
UK Biobank. UK Biobank Biomarker assay quality procedures: Approaches used to minimise systematic and random errors (and the wider epidemiological implications). http://www.ukbiobank.ac.uk/ (2019).
Gausman, V., Liang, P. S., O’Connell, K., Kantor, E. D. & Du, M. Evaluation of Early-life factors and early-onset colorectal cancer among men and women in the UK biobank. Gastroenterology https://doi.org/10.1053/j.gastro.2021.11.023 (2022).
Article PubMed Google Scholar
Kang, Y. J., Stewart, M., Patel, M., Furniss, D. & Wiberg, A. Modifiable risk factors for prevention in Dupuytren’s disease: A UK biobank case-control study. Plast. Reconstr. Surg. Adv. https://doi.org/10.1097/PRS.0000000000010774 (2023).
Article Google Scholar
Bujang, M. A. et al. The all-cause mortality and a screening tool to determine high-risk patients among prevalent type 2 diabetes mellitus patients. J. Diabetes Res. 2018, 1–8 (2018).
Google Scholar
The jamovi project. jamovi (Version 2.3). [Computer Software] (2022).
Ferlay, J. et al. Global cancer observatory: Cancer Today. Lyon, France: International Agency for Research on Cancer. https://gco.iarc.fr/today (Accessed 01 June 2023) (2020).
Kim, S. W. et al. A new equation to estimate muscle mass from creatinine and cystatin C. PLoS One 11, e0148495 (2016).
Article PubMed PubMed Central Google Scholar
Murty, M. S. N., Sharma, U. K., Pandey, V. B. & Kankare, S. B. Serum cystatin C as a marker of renal function in detection of early acute kidney injury. Indian J. Nephrol. 23, 80–183 (2013).
Article Google Scholar
Breznik, B., Mitrović, A., Lah, T. & Kos, J. Cystatins in cancer progression: More than just cathepsin inhibitors. Biochimie 166, 233–250 (2019).
Article CAS PubMed Google Scholar
Leto, G., Crescimanno, M. & Flandina, C. On the role of cystatin C in cancer progression. Life Sci. 202, 152–160 (2018).
Article CAS PubMed Google Scholar
Song, F. et al. Mast cells inhibit colorectal cancer development by inducing ER stress through secreting Cystatin C. Oncogene 42, 209–223 (2023).
Article CAS PubMed Google Scholar
Wu, J. et al. Association of plasma cystatin C with all-cause and cause-specific mortality among middle-aged and elderly individuals: A prospective community-based cohort study. Sci. Rep. 12, 22265. https://doi.org/10.1038/s41598-022-24722-4 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Ashktorab, H., Kupfer, S. S., Brim, H. & Carethers, J. M. Racial disparity in gastrointestinal cancer risk. Gastroenterology 153, 910–923 (2017).
Article PubMed Google Scholar
Liu, Z. et al. The disparities in gastrointestinal cancer incidence among Chinese populations in Shanghai compared to Chinese immigrants and indigenous non-Hispanic white populations in Los Angeles, USA. Int. J. Cancer 146, 329–340 (2020).
Article CAS PubMed Google Scholar
Pardamean, C. I. et al. Changing colorectal cancer trends in Asians: Epidemiology and risk factors. Oncol. Rev. 17, 10576 (2023).
Article PubMed PubMed Central Google Scholar
Wang, S. et al. Global and national trends in the age-specific sex ratio of esophageal cancer and gastric cancer by subtype. Int. J. Cancer 151, 1447–1461 (2022).
Article CAS PubMed PubMed Central Google Scholar
Cruz, A. et al. Racial and gender disparities in the incidence of anal cancer: Analysis of the nationwide inpatient sample (NIS). J. Gastrointest. Oncol. 10, 37–41 (2019).
Article PubMed PubMed Central Google Scholar
Scherübl, H. Tobacco smoking and gastrointestinal cancer risk. Visc. Med. 38, 217–222 (2022).
Article PubMed PubMed Central Google Scholar
Shin, W. S. et al. Updated epidemiology of gastric cancer in Asia: Decreased incidence but still a big challenge. Cancers 15, 2639 (2023).
Article CAS PubMed PubMed Central Google Scholar
Suzuki, A. et al. Defined lifestyle and germline factors predispose Asian populations to gastric cancer. Sci. Adv. 6, eaav9778 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Jang, J. et al. Association between body mass index and risk of gastric cancer by anatomic and histologic subtypes in over 500,000 East and Southeast Asian cohort participants. Cancer Epidemiol. Biomark. Prev. 31, 1727–1734 (2022).
Article Google Scholar
Sohn, W. et al. Obesity and the risk of primary liver cancer: A systematic review and meta-analysis. Clin. Mol. Hepatol. 27, 157–174 (2021).
Article PubMed Google Scholar
Tian, J. et al. Cumulative evidence for the relationship between body mass index and the risk of esophageal cancer: An updated meta-analysis with evidence from 25 observational studies. J. Gastroenterol. Hepatol. 35, 730–743 (2020).
Article CAS PubMed Google Scholar
Brown, R. B. Obesity and cancer: Potential mediation by dysregulated dietary phosphate. Obesities 2, 64–75 (2022).
Article Google Scholar
He, Y., Chen, J., Ma, Y. & Chen, H. Apolipoproteins: New players in cancers. Front. Pharmacol. 13, 1051280 (2022).
Article CAS PubMed PubMed Central Google Scholar
Shi, F. et al. Identification of serum proteins AHSG, FGA and APOA-I as diagnostic biomarkers for gastric cancer. Clin. Proteom. 15, 18 (2018).
Article Google Scholar
Sirniö, P. et al. Decreased serum apolipoprotein A1 levels are associated with poor survival and systemic inflammatory response in colorectal cancer. Sci. Rep. 7, 5374 (2017).
Article ADS PubMed PubMed Central Google Scholar
Wang, X. P. et al. High level of serum apolipoprotein A-I is a favorable prognostic factor for overall survival in esophageal squamous cell carcinoma. BMC Cancer 16, 516 (2016).
Article PubMed PubMed Central Google Scholar
Ahn, J. et al. Prediagnostic total and high-density lipoprotein cholesterol and risk of cancer. Cancer Epidemiol. Biomark. Prev. 18, 2814–2821 (2009).
Article CAS Google Scholar
Jung, Y. S. et al. Associations between parameters of glucose and lipid metabolism and risk of colorectal neoplasm. Dig. Dis. Sci. 60, 2996–3004 (2015).
Article CAS PubMed Google Scholar
Georgila, K., Vyrla, D. & Drakos, E. Apolipoprotein A-I (ApoA-I), immunity, inflammation and cancer. Cancers 11, 1097 (2019).
Article CAS PubMed PubMed Central Google Scholar
Ai, J. et al. Proteome analysis of hepatocellular carcinoma by laser capture microdissection. Proteomics 6, 538–546 (2006).
Article CAS PubMed Google Scholar
Mustafa, M. G. et al. Biomarker discovery for early detection of hepatocellular carcinoma in hepatitis C-Infected patients. Mol. Cell. Proteom. 12, 3640–3652 (2013).
Article CAS Google Scholar
Pedersen, K. M., Çolak, Y., Bojesen, S. E. & Nordestgaard, B. G. Low high-density lipoprotein and increased risk of several cancers: 2 population-based cohort studies including 116,728 individuals. J. Hematol. Oncol. 13, 129 (2020).
Article CAS PubMed PubMed Central Google Scholar
Ren, L. et al. Apolipoproteins and cancer. Cancer Med. 8, 7032–7043 (2019).
Article PubMed PubMed Central Google Scholar
Nault, J. C. et al. Clinical impact of genomic diversity from early to advanced hepatocellular carcinoma. Hepatology 71, 164–182 (2020).
Article CAS PubMed Google Scholar
Hashemian, M. et al. Dietary intake of minerals and risk of esophageal squamous cell carcinoma: Results from the Golestan Cohort Study. Am. J. Clin. Nutr. 102, 102–108 (2015).
Article CAS PubMed PubMed Central Google Scholar
Shah, S. C. et al. Associations between calcium and magnesium intake and the risk of incident oesophageal cancer: An analysis of the NIH-AARP Diet and Health Study prospective cohort. Br. J. Cancer 122, 1857–1864 (2020).
Article CAS PubMed PubMed Central Google Scholar
Shah, S. C. et al. Associations between calcium and magnesium intake and the risk of incident gastric cancer: A prospective cohort analysis of the National Institutes of Health-American Association of Retired Persons (NIH-AARP) Diet and Health Study. Int. J. Cancer 146, 2999–3010 (2020).
Article CAS PubMed Google Scholar
Wulaningsih, W. et al. Serum calcium and risk of gastrointestinal cancer in the Swedish AMORIS study. BMC Public Health 13, 663 (2013).
Article CAS PubMed PubMed Central Google Scholar
Karavasiloglou, N. et al. Prediagnostic serum calcium concentrations and risk of colorectal cancer development in 2 large European prospective cohorts. Am. J. Clin. Nutr. 117, 33–45 (2023).
Article PubMed Google Scholar
Peacock, M. Calcium metabolism in health and disease. Clin. J. Am. Soc. Nephrol. 5, S23–S30 (2010).
Article CAS PubMed Google Scholar
Wulaningsih, W. et al. Inorganic phosphate and the risk of cancer in the Swedish AMORIS study. BMC Cancer 13, 257 (2013).
Article CAS PubMed PubMed Central Google Scholar
Yan, H., Jin, X., Yin, L., Zhu, C. & Feng, G. Investigating causal associations of circulating micronutrients concentrations with the risk of lung cancer: A Mendelian randomization study. Nutrients 14, 4659 (2022).
Article Google Scholar
Zhou, Y. et al. Identifying opportunities for timely diagnosis of bladder and renal cancer via abnormal blood tests: A longitudinal linked data study. Br. J. Gen. Pract. 72, e19–e25 (2022).
Article PubMed Google Scholar
Allin, K. H., Bojesen, S. E. & Nordestgaard, B. G. Inflammatory biomarkers and risk of cancer in 84,000 individuals from the general population. Int. J. Cancer 139, 1493–1500 (2016).
Article CAS PubMed Google Scholar
Reichman, H., Karo-Atar, D. & Munitz, A. Emerging roles for eosinophils in the tumor microenvironment. Trends Cancer 2, 664–675 (2016).
Article PubMed Google Scholar
Sibille, A. et al. Eosinophils and lung cancer: From bench to bedside. Int. J. Mol. Sci. 23, 5066 (2022).
Article CAS PubMed PubMed Central Google Scholar
Varricchi, G. et al. Eosinophils: The unsung heroes in cancer?. OncoImmunology 7, e1393134 (2018).
Article PubMed Google Scholar
Conroy, M. C. et al. UK biobank: A globally important resource for cancer research. Br. J. Cancer 128, 519–527 (2022).
Article PubMed PubMed Central Google Scholar
Batty, G. D., Gale, C. R., Kivimäki, M., Deary, I. J. & Bell, S. Comparison of risk factor associations in UK Biobank against representative, general population based studies with conventional response rates: Prospective cohort study and individual participant meta-analysis. BMJ 368, m131 (2020).
Article PubMed PubMed Central Google Scholar
Manolio, T. A. & Collins, R. Enhancing the feasibility of large cohort studies. JAMA 304, 2290–2291 (2010).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This research was conducted using the UK Biobank Resource under application number 96759. We would like to thank the Director General of Health Malaysia for his permission to publish this article.

Funding

The authors declare that no funds, grants, or other support were received during the preparation of this manuscript.

Author information

Authors and Affiliations

Clinical Research Centre, Sarawak General Hospital, Ministry of Health Malaysia, Jalan Hospital, 93586, Kuching, Sarawak, Malaysia
Shirin Hui Tan, Catherina Anak Guan, Mohamad Adam Bujang & Wei Hong Lai
Faculty of Resource Science and Technology, Universiti Malaysia Sarawak, 94300, Kota Samarahan, Malaysia
Shirin Hui Tan & Edmund Ui Hang Sim
Department of Radiotherapy, Oncology and Palliative Care, Sarawak General Hospital, Ministry of Health Malaysia, Jalan Hospital, 93586, Kuching, Sarawak, Malaysia
Pei Jye Voon

Authors

Shirin Hui Tan
View author publications
You can also search for this author in PubMed Google Scholar
Catherina Anak Guan
View author publications
You can also search for this author in PubMed Google Scholar
Mohamad Adam Bujang
View author publications
You can also search for this author in PubMed Google Scholar
Wei Hong Lai
View author publications
You can also search for this author in PubMed Google Scholar
Pei Jye Voon
View author publications
You can also search for this author in PubMed Google Scholar
Edmund Ui Hang Sim
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.H.T., C.A.G., W.H.L., E.U.H.S. and P.J.V. designed the study. S.H.T., C.A.G., M.A.B., W.H.L., E.U.H.S. and P.J.V. analysed, interpreted the data and drafted the paper. S.H.T. had primary responsibility for statistical analysis and final content. All authors reviewed and approved the final paper.

Corresponding author

Correspondence to Shirin Hui Tan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Table 1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tan, S.H., Guan, C.A., Bujang, M.A. et al. Identification of phenomic data in the pathogenesis of cancers of the gastrointestinal (GI) tract in the UK biobank. Sci Rep 14, 1997 (2024). https://doi.org/10.1038/s41598-024-52421-9

Download citation

Received: 13 September 2023
Accepted: 18 January 2024
Published: 23 January 2024
DOI: https://doi.org/10.1038/s41598-024-52421-9

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.