Abstract
The TNM staging system is currently used to detect cancer stages. Regardless, a small proportion of cancer patients recur even after therapy, suggesting more specific molecular tools are required to justify the stage-specific detection and prompt cancer diagnosis. Thus, we aimed to explore the blood-based DNA methylation signature of metastatic nasopharyngeal carcinoma (NPC) to establish a holistic methylation biomarker panel. For the identification of methylation signature, the EPIC BeadChip-based array was performed. Comparative analysis for identifying unique probes, validation, and functional studies was investigated by analyzing GEO and TCGA datasets. We observed 4093 differentially methylated probes (DMPs), 1232 hydroxymethylated probes, and 25 CpG islands. Gene expression study revealed both upregulated and downregulated genes. Correlation analysis suggested a positive (with a positive r, p ≤ 0.05) and negative (with a negative r, p ≤ 0.05) association with different cancers. TFBS analysis exhibited the binding site for many TFs. Furthermore, gene enrichment analysis indicated the involvement of those identified genes in biological pathways. However, blood-based DNA methylation data uncovered a distinct DNA methylation pattern, which might have an additive role in NPC progression by altering the TFs binding. Moreover, based on tissue-specificity, a variation of correlation between methylation and gene expression was noted in different cancers.
Similar content being viewed by others
Introduction
Nasopharyngeal carcinoma (NPC) is one of the predominant malignancies in South Asian continents, primarily in southern China, Indonesia, and India1. According to the global cancer repository report published by GLOBOCAN in 2020 (https://gco.iarc.fr/), NPC ranked 22nd based on occurrence. Among the in-vogue treatment regimens, radiotherapy and parallel chemotherapy increase NPC patients’ survival. To date, identifying NPC at an initial stage is challenging because the symptoms of this cancer resemble other, more common conditions, which creates confusion for clinicians to diagnose. It is also hard to examine the nasopharynx. Sometimes, the submucosal lesions spread silently with a standard appearance during clinical tests. Thus, many NPC patients appear in clinics with advanced stages, resulting in a poor prognosis2.
So far, the TNM (tumor–node–metastasis) staging technique is a decisive parameter in providing analytical information and directing treatment choices for cancers, including NPC. To date, this model is not enough to determine the best personalized treatment. About 20–30% of NPC patients with the same stages receiving similar treatment show local relapse or distal metastasis3,4,5, pointing out that molecular subclassification may be clinically appropriate, but more accurate molecular tools are a prerequisite that can stratify patients concerning prognosis and response to therapy. Since NPC is diversely categorized into four subtypes based on histological character and ethnicity-specific occurrence, there is very little knowledge available for NPC’s prognostic biomarkers to predict and improve patient outcomes.
The development of malignancy or its distribution is a complex phenomenon comprising many factors, which may differ uniquely from one cancer to another. Cumulative evidence suggests that the tumors’ molecular characteristics add predictive strength in developing molecular biomarkers. DNA methylation at the 5ʹ cytosine (5mc) was revealed with capabilities to define a better prognosis of various malignancies6,7,8,9. DNA methylation is one of the most extensive genomic aberrations occurring during carcinogenesis10. These abnormalities can broadly be categorized as focal hypermethylation and global hypomethylation11. Hypermethylation can promote the silencing the regulatory region of tumor suppressor genes, contributing to cell growth deregulation12,13.
On the other side, hypomethylation mainly predominates in the repetitive regions and sometimes unwantedly activates transposable elements within the genome, resulting in further genetic damage14. Hypomethylation also drives genomic instability, causing missegregation of chromosomes during cell division15. Thus, studying DNA methylation in cancer is a fundamental requirement for comprehending that cancer’s outcome and prognostic behaviour and developing a better treatment regime.
Genome-wide DNA methylation between tissue and whole blood has been investigated in several tumors and displayed a dissimilar portrait of methylation signature, which indicates that blood DNA methylation has an additional role in tumor progression and metastasis over tissue DNA methylation. Moreover, the Study of DNA methylation from blood samples has some advantages over other sampling methods as it is commonly available in epidemiological studies, marginally aggressive, and more suitable to the screening population. Although DNA methylation can be detected in cell-free DNA (cfDNA) from serum or plasma, its low yield and variation in the processing method of isolation indicated significant problems16.
However, the underlying relationship between DNA methylation and NPC’s development is poorly understood. Besides two genome-wide DNA methylation studies of primary NPC, only a few biomarkers with diagnostic and prognostic values have been reported5,17,18. The only existing report of global blood DNA methylation of head and neck cancer (HNSCC) suggested that environmental risk factors could modify the DNA methylation and are associated with the prognosis of HNSCC. Although this study did not imply the advancement of HNSCC, it led us to hypothesize whether the blood DNA methylation signature differs from tissue DNA methylation in advanced NPC.
Therefore, we aimed to determine the DNA methylation signature from the whole blood of advanced NPC patients and examined the differences in DNA methylation patterns of blood and tissue of NPC patients to identify unique methylation biomarkers. The predictive role of biomarkers in NPC metastasis was investigated further.
Results
Blood DNA methyl array data revealed a differential methylation pattern in metastatic NPC
The methylation analysis results are displayed in Figs. 1, 2, Tables S2 and S3. After normalization, probes with Δβ between ≤ 0.2 and ≥ 0.2 were removed, and 4093 probes were identified as DMPs, which covered approximately 2875 genes throughout the genome (Fig. 1 and Table S2). The distribution of DMPs in different chromosomes showed that Chr.1 and 2 were highly methylated (Fig. 2a). Analysis of methylation related to the gene’s location indicated that the highest number of methylations occurred at the gene body (47.50%), followed by the intragenic region or IGR (29.76%) (Fig. 2c left). The open sea region showed comparatively higher DMPs (8.0.92%) than any other locations related to CpG (Fig. 2c right). The majority of the probes were hypermethylated (97.39%) in NPC patients (Fig. 2e). The hypermethylation frequency was highest in the gene body (46.43%), followed by IGR (29.05%). The open sea region was more hypermethylated (80.07%) than other CpG locations.
Methylation in the untranslated region (UTR) is essential in controlling gene expression. In our study, a total of 12.05% hypermethylation was observed in the untranslated region, including 5ʹ-UTR (9.92%) and 3ʹ-UTR (2.13%). Promoter methylation of about 9.28% was noted, in which TSS200 and TSS1500 had 1.93% and 7.35%, respectively (Fig. 2f). The distribution of hypomethylation concerning genes and CpGs followed a similar trend (Fig. 2g).
Unlike 5mc, hydroxymethylation (5hmc) is mainly distributed to the active transcriptional region, accompanied by open chromatin, allowing histone modification. The expression of 5hmc varies from tissue to tissue, and its expression is correspondingly low in cancer stage 4 with metastasis. Patients were shown to live longer with a higher 5hmc level than one with a lower level19. Thus, using 5hmc as the ideal prognostic biomarker could be one of the most excellent possibilities for understanding cancer susceptibility. We further conducted the analysis and distribution of 5hmc to understand the ratio of 5hmc to 5mc in advance NPC and found a total of 1232 5hmc probes, which comprised almost one-third of the 5mc (Fig. 2b and Table S3). The highest amount of 5hmc was noticed in the open sea region (82.06%), followed by the shore (18.91%), shelf, and island. The distribution of 5hmc related to gene location exposed the gene body as the highest 5hmc covering region compared to other sites. The UTR contained 14.78% of the 5hmc, in which 5ʹ-UTR and 3ʹ-UTR had 10.23% and 4.55%, respectively (Fig. 2d). A total of 14.01% of 5hmc was noted in the promoter region, including TSS200 (10.63%) and TSS1500 (3.41%).
Comparative analysis exhibited uniquely methylated probes
The comparative analysis of the present blood DNA methyl array data with the previously existing tissue DNA methylation dataset represented that despite few, most of the genes and probes were uniquely methylated (Fig. 3a,b). The distribution of differential methylation in different chromosomes indicated a discriminative picture. The highest methylation was observed in chromosome 1, above 10%, followed by chromosomes 2 and 19, showing the lowest methylation (< 5%) compared to other datasets (Fig. 3c). The distinct methylation pattern indicated that blood DNA methylation might have an additional role in NPC progression.
Early evidence pointed out that CpG methylation in untranslated regions, promoters, and gene bodies could alter gene expression in many cancers. Recent studies have demonstrated the role of DNA Methylation in the untranslated regions. Methylation in the 5ʹ-UTR regions often upregulates the gene expression. DNA methylation in 3ʹ-UTR is also functionally involved in gene regulation in the pan-cancer network20,21,22. A correlation between CpG methylation in the gene body and cancer prognosis is now established23. For further continuation of our study, probes in the CpG island containing hypermethylation and hypomethylation were selected. With this criterion, we found a total of 25 probes that correspond to 22 genes; of these, 21 probes were hypermethylated. The most significant number of hypermethylation and hypomethylation was noted in PLCB3 (Δβ: 0.30, adj. p 0.015) and HLA-DRB5 (Δβ: -0.52, adj. p 0.043) genes, respectively. Among 22 genes, PLCB3, C18orf1, ZNF516, PRKCZ, KDM4B, HLX, MGRN1, UHRF1, SPI1, PLEC1, MPO, FLNB, MLLT1, and HLA-DRB5 resided in the gene body; promoter containing six genes (FGR, COL11A2, SMTN, KCNT1, and APEH); 5ʹ UTR, 3ʹ-UTR and 1st exon were having MBP, ADRBK1 and FUT4 genes respectively. The overall result is depicted in Table 1. The identified probes were cross-validated through TCGA dataset analysis. Data showed that the distribution of DMPs varied according to cancer types, among which cancer BRCA, HNSC, KIRC, KIRP, and UCEC were enriched with statistically significant methylation (p < 0.05).
Selected gene characteristics such as coding sequence length, transcript, 5ʹ-UTR, 3ʹ-UTR, genome span, GC content, distribution of gene types, and number of exons were compared with genomes using chi-squared and Student’s t-tests. Data indicated that the identified genes had a statistically significant (p < 0.05) amount of 5ʹ-UTR, an increased percentage of GC content, and chromosome distribution towards the coding region (Fig. S3).
Functional analysis identified a cancer-specific correlation between DNA methylation and gene expression
All 22 genes were validated for gene expression by nalysing the RNA sequencing data from the GEO dataset. Fold change was determined by dividing the mean of control FPKM with that of RPKM of cancer. The Data explained that genes SMTN, KCNT1, APEH, ZNF516, PRKCZ, KDM4B, MGRN1, MPO, MLLT1, and FLNB were upregulated. FGR, COL11A2, MBP, PLCB3, HLX, UHRF1, SPI1, FUT4, and HLA-DRB5 were downregulated (Fig. 4a).
We further investigated the correlation between DNA methylation and gene expression by determining the correlation coefficient (r), where r with a positive, negative, and zero value symbolized the positive, negative, and no correlation, respectively. P value ≤ 0.05 was considered significant. Results demonstrated that the methylation of the genes FGR, COLIIA2, APEH, HLX, SPI1, and FUT4 were negatively correlated with gene expression. Similarly, a reverse correlation was observed from the probe of SMTN. These two data were consistent with the RNA-seq result. Conversely, the probe of MGRN1 and MPO did not follow the trend of the RNA-seq data. Additionally, methylation in PLCB3, MBP, ZNF156, PRKCZ, KDM4B, UHRF1, MLLT1, FLNB, and HLA-DRB5 showed both positive and negative associations with gene expression in different tissue-specific carcinomas, suggesting that the methylation and gene expression could be tissue specific. Probes that did not show significant methylation were not included in the correlation analysis. The result of the correlation analysis is depicted in Fig. 4b and Table S4.
Validation of genome-wide methylation and gene expression data
To check the accuracy of the methyl array and RNA sequence data, MSP-RT and qRT-PCR for gene expression were employed. MSP has been widely used in clinical settings for the diagnosis and prognosis of diseases such as cancer due to its high specificity, sensitivity, cost, and labor-saving method24. Using SYBR-Green technology in real-time PCR enables the objective evaluation of MSP data, providing an easy way to express the methylation status semi-quantitatively without having to perform laborious gel electrophoresis, as shown by existing research25,26,27. Methylation status is quantified by evaluating the difference between the CT values (ΔCT = uCT − mCT) as described by Yoshioka et.al28, where u and m indicate unmethylation and methylation, respectively. Results of the MSP RT-PCR study revealed that while ZNF516, PLCB3, MBP, and FGR were found to be hypermethylated in NPC (ΔCTZNF516 = 0.5, ΔCTPLCB3 = 0.639, ΔCTMBP = 1.138 and ΔCTFGR = 0.733), HLA-DRB5 was hypomethylated. The result of the gene-wise methylation difference and melting curve of each gene is depicted in Figs. 5a and S5, respectively. Gene expression study showed a significantly lower expression of RNA in PLCB3, MBP, and FGR, HLA-DRB5 (p ≤ 0.05) and a higher expression in ZNF516 (Fig. 5b).
DNA Methylation in CpG harbors several transcription factor binding sites
Transcription factor takes the overall control of gene regulation, thus maintaining normal cellular functions. They are commonly dysregulated in many human cancers and are associated with approximately 20% of oncogenesis29. DNA methylation, mainly in the CpG region, potentially alters TFs binding to DNA and promotes gene regulation modification30. We predicted TFBSs present in the CpG island of designated probes. TFs were selected according to the PWM relative score ≥ 0.8 and a p-value < 0.05. The shading of the boxes indicated the p-value of the profile’s match to that position (scaled between 0 and 1000 scores, where 0 corresponds to a p-value of 1 and 1000 to a p-value ≤ 10–10); thus, the darker the shade, the lower (better) the p-value. The predicted score was adjusted to 600, and TFs with a score more than 600 were obtained. Data indicated that the identified genomic region of FUT4 and APEH were enriched with several TF binding sites that scored ranging from 963 to 611. Moreover, binding sites for Zinc finger proteins (ZBTB18, ZNF680, ZNF460, ZNF135, ZNF384, ZNF354A, Zic3, Zic1) and FOX family TFs (FOXD3 and FOXO1) were mostly observed. TFs in the ZNFs and FOX family modulates major regulatory pathways and involve in the onset of cancers. The result of TFBS prediction is depicted in Fig. 6 and Table S7.
Methylation in CpG could modulate immunological functions in various cellular pathways
Pathway enrichment study is a statistical method of exploring pathways enriched in the input gene list relative to what is expected by chance31. The enrichment analysis results are displayed in Fig. 7 and Table S8. After the analysis, pathways were filtered based on a user-specified FDR cut-off, followed by shorting the significant pathway by FDR, Fold Enrichment, or other metrics. Results indicated that genes were connected with different immune-modulatory functions, such as T-cell differentiation, cell–cell adhesion, neutrophil, and leucocyte-mediated immunity etc. (Fig. 7a). A hierarchical clustering tree was created to comprehend the correlation among significant pathways listed in the enrichment tab, in which bigger dots indicate more significant P-values. Data showed that T-cell differentiation, cell–cell adhesion, and leucocyte activation were potentially augmented (Fig. 7b). We further generated the interactive plot to establish the intricate relationship between genes and enriched pathways (Fig. 8a). The Gene interaction network was constructed to check the protein–protein interaction among the selected genes; a bigger node indicates a significant interaction (Fig. 8b).
Discussion
So far, the study of DNA methylation in NPC has mainly focused on tumor suppressor genes’ methylation patterns17. Existing genome-wide DNA methylation reports deciphered high methylation among a Chinese population. The report suggested that this could affect the current therapy in NPC patients5. Another methylome study explained increased methylation at chromosome 6p in the same ethnic population18. Recently, peripheral blood methylation signatures in various solid tumors exhibited a distinct epigenetic signature, indicating cancer’s unique prognostic and diagnostic biomarkers and their resistance to therapy16,32,33,34. Blood DNA methylation in head and neck cancer (HNSCC) was first reported in 2006. It explored an independent association between hypomethylation and HNSCC prognosis and displayed a complex relationship with the known risk factors associated with this cancer35. This growing evidence indicated the additional role of blood DNA methylation on cancer progression, metastasis, and recurrence.
Our current study exhibited a different pattern of methylation in NPC. Chromosomes 1 and 2 have the highest methylation, indicating a distinctive signature from NPC tissue DNA methylation. According to previous reports, the gene body’s methylation represses intragenic transcription and allows efficient transcriptional elongation23,36. Our data suggested that the distribution of DMPs in the location relative to the gene and CpG showed increased methylation in the gene body and the open sea region. A similar pattern was also found in other cancers. A higher amount of hypermethylation in the NPC patients’ samples was observed in this study. This finding was consistent with the previous reports published in China, signifying that hypermethylation is comparatively higher in NPC than in other cancers18.
CpG methylation is a primary concern of cancer risk. The Discovery of hypermethylation in the CpG island of certain tumor suppressor genes interrupts many cellular pathways like DNA repair, apoptosis, cell cycle control, cell–cell adhesion, etc.37. Unlike hypermethylation, hypomethylation in CpG comes later in cancers and contributes to metastatic tumor heterogeneity38. We found twenty-two unique genes (corresponding to twenty-five probes) that were differentially methylated in their CpG islands. The functional study analyzing RNA sequence revealed the differential expression of those genes in NPC, and TCGA analysis further reinforced the study. The results of the genome-wise methyl array and RNA sequence were further cross-validated by performing MSP-RT PCR and quantitative RT-PCR for gene expression; both exhibited consistency with the previous data, suggesting the accuracy and robustness of the methylation analysis.
Moreover, a correlation was detected between methylation of the specific probe and gene expression in cancers other than NPC. This study also implied that the methylation within the particular region of CpG and gene expression could vary according to cancer. Further, the analysis of TFBS of CpG methylated promoter and gene bodies illustrated binding sites for several essential TFs such as ZNFs and FOXs. Early reports demonstrated that ZNF and FOX family TFs are crucial in cell migration and proliferation39,40. A differential expression of other TFs investigated in many cancers is associated with tumors’ development, spread, invasiveness, and lack of proper immune responses41,42,43,44,45. It can be proposed that CpG methylation could alter the binding of TFs, which is thought to modify the gene expression of the corresponding proteins, resulting in NPC advancement. However, further research is needed to imply the hypothesis.
Additionally, the gene enrichment analysis of the identified genes was connected to various cellular pathways and immune activations. Previous research projected the contribution of these genes in different cancers like breast, pancreatic, and ovarian. Genes PRKCZ (Protein Kinase C Zeta), FGR (Src family protein), KDM4B, MPO, COL11A1, FUT4 (Fucosyltransferase IV), and APEH (acylpeptide hydrolase) are directly associated with tumor aggressiveness, growth, and migration, and invasiveness, resulting in poor clinical outcome in cancer patients41,46,47,48,49,50. Similarly, HLX (H2.0-like homeobox) controls early hematopoiesis and promotes acute myeloid leukemia51, and HLA-DRB5 (HLA major histocompatibility complex, class II, DR beta 5) acts as a receptor for T cell activation. Its expression is shown to be upregulated in breast cancer52.
The present study elucidated a lucid picture of genome-wide blood DNA methylation of metastatic NPC, which could set a goal and a future direction of developing more precise molecular tools based on methylation biomarkers for detecting and diagnosing primary and advanced NPC. Though the study was statistically robust and comprehensive, a few limitations exist. Limited sample size could affect statistical interpretation, and a lack of downstream experiments might create a lag for further validation.
Methods
Study selection
Blood samples from four NPC patients were collected from the Eden Medical Centre, Dimapur, Nagaland, India, between 2018 and 2019. For the clinical examination, detailed medical history, physical check-up, Nasopharyngioscopy, serum biochemistry, blood count, chest X-ray, CT or MRI examination of the nasopharynges, skull base, and any suspect metastatic sites (paranasal sinuses) were tested. All the patients suffered from non-keratinizing undifferentiated carcinoma (NKUC). NPC types were determined following the guidelines of WHO53. TNM classification was confirmed according to the recommendation of the American Joint Committee on Cancer (AJCC)54,55. All the samples were obtained before the start of the diagnosis. Four control samples were obtained separately from matched disease-free healthy individuals’ age, sex, and ethnicity. Both cases and controls tested negative for Epstein Bar Virus (EBV). Written informed consent was obtained from all the patients. Ethical approval was obtained from the ethics committee of Visva-Bharati University, West Bengal and Eden Medical centre, Dimapur, Nagaland, as the participating institute. All methods were performed in accordance with the relevant guidelines and regulations. Demographic and Clinical characteristics of NPC cases and controls are shown in Table S1.
Methylation profiling and data processing
Genomic DNA was extracted from the whole blood of eight samples, followed by the bisulfite conversion using the EZ DNA methylation Gold Kit (Zymo Research, USA). The whole-genome methylation array was carried out using the Infinium Methylation EPIC BeadChip Kit (Illumina Inc, USA). The array data (IDAT files) were nalysed using the ChAMP Bioconductor package in R studio (Version 1.2.5042)56. Based on the methylation intensities, the methylation level was measured by the beta-value (β) method, ranging from 1 to 0 (where the value of 1 indicates a fully methylated probe and 0 indicates an unmethylated probe)57. Β value is calculated as the ratio of the methylated probe intensity to the overall intensity (sum of methylated and unmethylated probe intensities)11.
From the methylation EPIC BeadChip, 865,918 methylated probes were detected initially. Probes having bad samples (p > 0.01), a bead count of < 3, having non-CpGs, containing SNPs, probes aligned to multiple locations, and those located at X and Y chromosomes were removed. Eventually, 731,042 methylated probes were recorded. The result of the quality control (QC), beta value distribution plot, and Frequency polygon is shown in Fig. S1. The beta-mixture quantile normalization (BMIQ) method was used to adjust the probe type or color bias, subtract background signals, and eliminate systematic errors (Fig. S2). It is a well-established normalization method that decomposes the β profiles of Type I and Type II probes into two mixtures of 3 methylation states. Then quantile normalized the three distributions of Type II profile corresponding to those of Type I profile58. Wilcoxon signed-rank test was performed for each locus of the sample group (Cancer vs. Control) to identify a significantly differential methylated locus. Multiple testing correction was performed over p values using the Benjamini-Hochberg (BH) method. The cut-off for differential methylation was restricted to BH-adjusted p-values < 0.05 and absolute beta-value (mean beta-value difference between cancer and control) or Δβ ≥ 0.2. CpG was considered hypermethylated or hypomethylated if the Δβ was ≥ 0.2 or ≤ 0.2, respectively59,60. The outline of the overall process is depicted in Fig. 9.
Comparative analysis of methyl array data with global NPC methylation dataset
The present methyl array data were compared with available global NPC methyl array data (GSE52068 and GSE62336) deposited in the NCBI-GEO database (https://www.ncbi.nlm.nih.gov/gds). Data from GEO datasets were retrieved and re-analyzed by ‘GEOquery’ and ‘Limma’ R Bioconductor packages. Genes and CpG probes from all the datasets were compared separately by a Venn diagram to view common and unique genes or probes.
RNA-Seq and TCGA data analysis
RNA-sequencing data of two NPC and one para-cancerous tissue was retrieved from the GEO database (GSE134886) and re-analyzed to inspect the gene expression. Comprehensive methylation analysis was performed between the differentially methylated probes (DMPs) of NPC and other carcinomas existed in the Cancer Genome Atlas Program (TCGA): bladder urothelial carcinoma (BLCA, n = 424), breast invasive carcinoma (BRCA, n = 853), cholangiocarcinoma (CHOL, n = 45), colon adenocarcinoma (COAD, n = 288), esophageal carcinoma (ESCA, n = 190), head and neck squamous cell carcinoma (HNSC, n = 535), kidney renal clear cell carcinoma (KIRC, n = 333), kidney renal papillary cell carcinoma (KIRP, n = 292), liver hepatocellular carcinoma (LIHC, n = 409), lung squamous cell carcinoma (LUSC, n = 370), thyroid carcinoma (THCA, n = 558), and uterine corpus endometrial carcinoma (UCEC, n = 187). Further, the Pearson method was set to calculate the corresponding correlation coefficient to conduct a pair-wise correlation analysis between methylation and gene expression.
Bisulfite conversion, methylation-specific real-time PCR and quantitative RT-PCR
Bisulfite conversion was performed by EpiJET Bisulfite Conversion Kit (Thermo Scientific, USA) following the manufacturer’s instruction. During the conversion process, methylated cytosine remains unchanged, while unmethylated cytosine is converted into uracil. Two types of primers and the SYBR Green qPCR Master Mix were utilized for MSP-RT. CpG island was identified for each gene of interest (Fig. S4), followed by designing primers specifically for the site of the methylation-favoured regions of individual genes. One primer set was designed for fully methylated sequences that recognize unconverted cytosine during bisulfite treatment. In contrast, other primer sets recognize fully unmethylated sequences and bind to uracil instead of cytosine (Table S5). RT-PCR was performed under the following conditions: An initial denaturation at 95 °C for 5 min, then 35 cycles of 95 °C for 30 s, 50–55 °C (varied for different genes primer) for 30 s, 72 °C for 1 min, and a final step at 72° for 5 min.
RNA was extracted from NPC and control samples using QIAamp RNA Blood Mini Kit (Qiagen, USA) following the manufacturer’s protocol to validate the RNA sequence data for gene expression. After cDNA synthesis, RT-PCR was carried out at 95 °C for 3 min, followed by 40 cycles of denaturation at 95 °C for 30 s, annealing at 55–60 °C (varied for different genes primer) for 30 s, and amplification at 72 °C for 1 min. Comparing CT values of NPC and control, fold change was calculated. Primers for the qRT-PCR are listed in Table S6.
Transcription factor binding site prediction
For the prediction of transcription factor binding sites (TFBS) of the methylation-specific probes, the JASPAR 2022 module of the UCSC Genome Browser was employed61. JASPAR CORE TF-binding profiles read each taxon independently using position weight matrix (PWM) Scanning and generate a score for each TF.
Gene ontology and interaction network analysis
Gene ontology analysis (GO) was performed with an FDR cut-off < 0.05 using the open-source server ShinyGO 0.7662. It accesses the Ensembl and pathway database from many other sources and uses many R packages to visualize and analyze a gene’s relative functions. FDR was calculated based on the nominal P-value from the hypergeometric test. Depending on the FDR-specified cut-off, Pathways were filtered. Significant pathways were sorted by FDR, Fold Enrichment, or other metrics. The preferred GO ontology functions were selected, viz., biological process, molecular function, and cellular component.
Using the integrated CHAMP-FEM package, we identified the connected subnetwork of the protein interaction of a differentially methylated promoter region with a large average edge-weight density. Weight edges were constructed from the statistical association of DNA methylation with the phenotype of interest (case vs. control), where uniquely methylated genes were considered as seeds for the network construction. Queries were uploaded onto NetworkAnalyst (version 3.0), a comprehensive statistical network visual analytics platform63. IMEx Interactome database was selected, which uses literature-curated complete data from InnateDB64.
Ethics declarations
Obtained from Visva-Bharati and the participating institute.
Consent to participate
Informed consent was taken from each patient.
Data availability
The data generated in this study are available within the article and its supplementary data files S1. Expression profile data analysed in this study were obtained from Gene Expression Omnibus (GEO) at GSE52068, GSE62336 and GSE134886.
References
Mahdavifar, N., Ghoncheh, M., Mohammadian-Hafshejani, A., Khosravi, B. & Salehiniya, H. Epidemiology and inequality in the incidence and mortality of nasopharynx cancer in Asia. Osong. Public Health Res. Perspect 7, 360–372. https://doi.org/10.1016/j.phrp.2016.11.002 (2016).
Abdullah, B., Alias, A. & Hassan, S. Challenges in the management of nasopharyngeal carcinoma: A review. Malays J. Med. Sci. 16, 50–54 (2009).
Chua, D. T. T. et al. Long-term survival after cisplatin-based induction chemotherapy and radiotherapy for nasopharyngeal carcinoma: A pooled data analysis of two phase III trials. J. Clin. Oncol. 23, 1118–1124. https://doi.org/10.1200/JCO.2005.12.081 (2005).
Chen, L. et al. Concurrent chemoradiotherapy plus adjuvant chemotherapy versus concurrent chemoradiotherapy alone in patients with locoregionally advanced nasopharyngeal carcinoma: A phase 3 multicentre randomised controlled trial. Lancet Oncol. 13, 163–171. https://doi.org/10.1016/s1470-2045(11)70320-5 (2012).
Jiang, W. et al. Genome-wide identification of a methylation gene panel as a prognostic biomarker in nasopharyngeal carcinoma. Mol. Cancer Ther. 14, 2864–2873. https://doi.org/10.1158/1535-7163.Mct-15-0260 (2015).
Brock, M. V. et al. DNA methylation markers and early recurrence in stage I lung cancer. N. Engl. J. Med. 358, 1118–1128. https://doi.org/10.1056/NEJMoa0706550 (2008).
de Maat, M. F. et al. Quantitative analysis of methylation of genomic loci in early-stage rectal cancer predicts distant recurrence. J. Clin. Oncol. 26, 2327–2335. https://doi.org/10.1200/jco.2007.14.0723 (2008).
Kuang, Y. et al. Genome-wide analysis of methylation-Driven genes and identification of an eight-gene panel for prognosis prediction in breast cancer. Front. Genet. 11, 301. https://doi.org/10.3389/fgene.2020.00301 (2020).
Mori, T. et al. Predictive utility of circulating methylated DNA in serum of melanoma patients receiving biochemotherapy. J. Clin. Oncol. 23, 9351–9358. https://doi.org/10.1200/jco.2005.02.9876 (2005).
Locke, W. J. et al. DNA methylation cancer biomarkers: Translation to the clinic. Front Genet 10, 1150. https://doi.org/10.3389/fgene.2019.01150 (2019).
Ross, J. P., Rand, K. N. & Molloy, P. L. Hypomethylation of repeated DNA sequences in cancer. Epigenomics 2, 245–269. https://doi.org/10.2217/epi.10.2 (2010).
Ehrlich, M. DNA methylation in cancer: too much, but also too little. Oncogene 21, 5400–5413. https://doi.org/10.1038/sj.onc.1205651 (2002).
Stone, A. et al. DNA methylation of oestrogen-regulated enhancers defines endocrine sensitivity in breast cancer. Nat. Commun. 6, 7758. https://doi.org/10.1038/ncomms8758 (2015).
Daskalos, A. et al. Hypomethylation of retrotransposable elements correlates with genomic instability in non-small cell lung cancer. Int. J. Cancer 124, 81–87. https://doi.org/10.1002/ijc.23849 (2009).
Prada, D. et al. Satellite 2 demethylation induced by 5-azacytidine is associated with missegregation of chromosomes 1 and 16 in human somatic cells. Mutat. Res. 729, 100–105. https://doi.org/10.1016/j.mrfmmm.2011.10.007 (2012).
Raut, J. R., Guan, Z., Schrotz-King, P. & Brenner, H. Whole-blood DNA methylation markers for risk stratification in colorectal cancer screening: A systematic review. Cancers (Basel) 11. https://doi.org/10.3390/cancers11070912 (2019).
Jiang, W., Cai, R. & Chen, Q. Q. DNA methylation biomarkers for nasopharyngeal carcinoma: Diagnostic and prognostic tools. Asian Pac. J. Cancer Prev. 16, 8059–8065. https://doi.org/10.7314/apjcp.2015.16.18.8059 (2015).
Dai, W. et al. Comparative methylome analysis in solid tumors reveals aberrant methylation at chromosome 6p in nasopharyngeal carcinoma. Cancer Med. 4, 1079–1090. https://doi.org/10.1002/cam4.451 (2015).
Xu, T. & Gao, H. Hydroxymethylation and tumors: Can 5-hydroxymethylation be used as a marker for tumor diagnosis and treatment?. Hum Genom. 14, 15. https://doi.org/10.1186/s40246-020-00265-5 (2020).
McGuire, M. H. et al. Pan-cancer genomic analysis links 3’UTR DNA methylation with increased gene expression in T cells. EBioMedicine 43, 127–137. https://doi.org/10.1016/j.ebiom.2019.04.045 (2019).
Rauluseviciute, I., Drabløs, F. & Rye, M. B. DNA hypermethylation associated with upregulated gene expression in prostate cancer demonstrates the diversity of epigenetic regulation. BMC Med. Genom. 13, 6. https://doi.org/10.1186/s12920-020-0657-6 (2020).
Evans, D. G. R. et al. A dominantly inherited 5’ UTR variant causing methylation-associated silencing of BRCA1 as a cause of breast and ovarian cancer. Am. J. Hum. Genet. 103, 213–220. https://doi.org/10.1016/j.ajhg.2018.07.002 (2018).
Arechederra, M. et al. Hypermethylation of gene body CpG islands predicts high dosage of functional oncogenes in liver cancer. Nat. Commun. 9, 3164. https://doi.org/10.1038/s41467-018-05550-5 (2018).
Nassar, F. J., Msheik, Z. S., Nasr, R. R. & Temraz, S. N. Methylated circulating tumor DNA as a biomarker for colorectal cancer diagnosis, prognosis, and prediction. Clin. Epigenet. 13, 111. https://doi.org/10.1186/s13148-021-01095-5 (2021).
Vlassenbroeck, I. et al. Validation of real-time methylation-specific PCR to determine O6-methylguanine-DNA methyltransferase gene promoter methylation in glioma. J. Mol. Diagn. 10, 332–337. https://doi.org/10.2353/jmoldx.2008.070169 (2008).
Ogino, S. et al. Precision and performance characteristics of bisulfite conversion and real-time PCR (MethyLight) for quantitative DNA methylation analysis. J. Mol. Diagn. 8, 209–217. https://doi.org/10.2353/jmoldx.2006.050135 (2006).
Hattermann, K., Mehdorn, H. M., Mentlein, R., Schultka, S. & Held-Feindt, J. A methylation-specific and SYBR-green-based quantitative polymerase chain reaction technique for O6-methylguanine DNA methyltransferase promoter methylation analysis. Anal. Biochem. 377, 62–71. https://doi.org/10.1016/j.ab.2008.03.014 (2008).
Yoshioka, M. et al. Real-time methylation-specific PCR for the evaluation of methylation status of MGMT gene in glioblastoma. Oncotarget 9, 27728–27735. https://doi.org/10.18632/oncotarget.25543 (2018).
Vishnoi, K., Viswakarma, N., Rana, A. & Rana, B. Transcription factors in cancer development and therapy. Cancers (Basel) 12, 1. https://doi.org/10.3390/cancers12082296 (2020).
Héberlé, É. & Bardet, A. F. Sensitivity of transcription factors to DNA methylation. Essays Biochem. 63, 727–741. https://doi.org/10.1042/ebc20190033 (2019).
Reimand, J. et al. Pathway enrichment analysis and visualization of omics data using g:Profiler, GSEA. Cytosc. Enrichment Map. Nat. Protoc 14, 482–517. https://doi.org/10.1038/s41596-018-0103-9 (2019).
Xu, Z., Sandler, D. P. & Taylor, J. A. Blood DNA methylation and breast cancer: A prospective case-cohort analysis in the sister study. J. Natl. Cancer Inst. 112, 87–94. https://doi.org/10.1093/jnci/djz065 (2020).
Kao, W. Y. et al. Genome-wide identification of blood DNA methylation patterns associated with early-onset hepatocellular carcinoma development in hepatitis B carriers. Mol. Carcinog. 56, 425–435. https://doi.org/10.1002/mc.22505 (2017).
Dong, L. & Ren, H. Blood-based DNA methylation biomarkers for early detection of colorectal cancer. J. Proteom. Bioinform. 11, 120–126. https://doi.org/10.4172/jpb.1000477 (2018).
Hsiung, D. T. et al. Global DNA methylation level in whole blood as a biomarker in head and neck squamous cell carcinoma. Cancer Epidemiol. Biomarkers Prev. 16, 108–114. https://doi.org/10.1158/1055-9965.Epi-06-0636 (2007).
Jjingo, D., Conley, A. B., Yi, S. V., Lunyak, V. V. & Jordan, I. K. On the presence and role of human gene-body DNA methylation. Oncotarget 3, 462–474. https://doi.org/10.18632/oncotarget.497 (2012).
Esteller, M. CpG island hypermethylation and tumor suppressor genes: A booming present, a brighter future. Oncogene 21, 5427–5440. https://doi.org/10.1038/sj.onc.1205600 (2002).
Yegnasubramanian, S. et al. DNA hypomethylation arises later in prostate cancer progression than CpG island hypermethylation and contributes to metastatic tumor heterogeneity. Cancer Res. 68, 8954–8967. https://doi.org/10.1158/0008-5472.Can-07-6088 (2008).
Jen, J. & Wang, Y. C. Zinc finger proteins in cancer progression. J. Biomed. Sci. 23, 53. https://doi.org/10.1186/s12929-016-0269-9 (2016).
Katoh, M., Igarashi, M., Fukuda, H., Nakagama, H. & Katoh, M. Cancer genetics and genomics of human FOX family genes. Cancer Lett. 328, 198–206. https://doi.org/10.1016/j.canlet.2012.09.017 (2013).
Seto, K. K. Y. & Andrulis, I. L. Atypical protein kinase C zeta: Potential player in cell survival and cell migration of ovarian cancer. PLOS ONE 10, e0123528. https://doi.org/10.1371/journal.pone.0123528 (2015).
Xiao, Z. J. et al. NFATc2 enhances tumor-initiating phenotypes through the NFATc2/SOX2/ALDH axis in lung adenocarcinoma. Elife 6. https://doi.org/10.7554/eLife.26733 (2017).
Tian, M. et al. IRF3 prevents colorectal tumorigenesis via inhibiting the nuclear translocation of β-catenin. Nat. Commun. 11, 5762. https://doi.org/10.1038/s41467-020-19627-7 (2020).
Lee, C. J. et al. Stat2 stability regulation: An intersection between immunity and carcinogenesis. Exp. Mol. Med. 52, 1526–1536. https://doi.org/10.1038/s12276-020-00506-6 (2020).
Li, F., Wang, T. & Tang, S. SOX14 promotes proliferation and invasion of cervical cancer cells through Wnt/β-catenin pathway. Int. J. Clin. Exp. Pathol. 8, 1698–1704 (2015).
Kim, H. S. et al. Functional roles of Src and Fgr in ovarian carcinoma. Clin. Cancer Res. 17, 1713–1721. https://doi.org/10.1158/1078-0432.Ccr-10-2081 (2011).
Wilson, C. & Krieg, A. J. KDM4B: A nail for every hammer? Genes (Basel) 10. https://doi.org/10.3390/genes10020134 (2019).
Nallanthighal, S., Heiserman, J. P. & Cheon, D. J. Collagen type XI alpha 1 (COL11A1): A novel biomarker and a key player in cancer. Cancers (Basel) 13. https://doi.org/10.3390/cancers13050935 (2021).
Yan, X., Lin, Y., Liu, S., aziz, F. & Yan, Q. Fucosyltransferase IV (FUT4) as an effective biomarker for the diagnosis of breast cancer. Biomed. Pharmacother. 70, 299–304. https://doi.org/10.1016/j.biopha.2014.12.048 (2015).
Scaloni, A. et al. Deficiency of acylpeptide hydrolase in small-cell lung carcinoma cell lines. J. Lab. Clin. Med. 120, 546–552 (1992).
Kawahara, M. et al. H2.0-like homeobox regulates early hematopoiesis and promotes acute myeloid leukemia. Cancer Cell 22, 194–208. https://doi.org/10.1016/j.ccr.2012.06.027 (2012).
Saraiva, D. P. et al. Expression of HLA-DR in cytotoxic T lymphocytes: A validated predictive biomarker and a potential therapeutic strategy in breast cancer. Cancers (Basel) 13. https://doi.org/10.3390/cancers13153841 (2021).
Thompson, L. World Health Organization classification of tumours: Pathology and genetics of head and neck tumours. Ear Nose Throat J. 85, 74 (2006).
Chan, A. T. et al. Nasopharyngeal cancer: EHNS-ESMO-ESTRO Clinical Practice Guidelines for diagnosis, treatment and follow-up. Ann. Oncol. 23(Suppl 7), vii83–vii85. https://doi.org/10.1093/annonc/mds266 (2012).
Huang, S. H. & O'Sullivan, B. Overview of the 8th Edition TNM Classification for Head and Neck Cancer. Curr. Treat Opt. Oncol. 18, 40. https://doi.org/10.1007/s11864-017-0484-y (2017).
Tian, Y. et al. ChAMP: Updated methylation analysis pipeline for Illumina BeadChips. Bioinformatics 33, 3982–3984. https://doi.org/10.1093/bioinformatics/btx513 (2017).
Pidsley, R. et al. Critical evaluation of the Illumina MethylationEPIC BeadChip microarray for whole-genome DNA methylation profiling. Genome Biol. 17, 208. https://doi.org/10.1186/s13059-016-1066-1 (2016).
Wang, T. et al. A systematic study of normalization methods for Infinium 450K methylation data using whole-genome bisulfite sequencing data. Epigenetics 10, 662–669. https://doi.org/10.1080/15592294.2015.1057384 (2015).
Basu, B. et al. Genome-wide DNA methylation profile identified a unique set of differentially methylated immune genes in oral squamous cell carcinoma patients in India. Clin. Epigenet. 9, 13. https://doi.org/10.1186/s13148-017-0314-x (2017).
Khongsti, S. et al. Whole genome DNA methylation profiling of oral cancer in ethnic population of Meghalaya, North East India reveals novel genes. Genomics 110, 112–123. https://doi.org/10.1016/j.ygeno.2017.09.002 (2018).
Ernst, J., Plasterer, H. L., Simon, I. & Bar-Joseph, Z. Integrating multiple evidence sources to predict transcription factor binding in the human genome. Genome Res. 20, 526–536. https://doi.org/10.1101/gr.096305.109 (2010).
Ge, S. X., Jung, D. & Yao, R. ShinyGO: A graphical gene-set enrichment tool for animals and plants. Bioinformatics 36, 2628–2629. https://doi.org/10.1093/bioinformatics/btz931 (2020).
Zhou, G. et al. NetworkAnalyst 3.0: A visual analytics platform for comprehensive gene expression profiling and meta-analysis. Nucleic Acids Res. 47, W234–W241. https://doi.org/10.1093/nar/gkz240 (2019).
Breuer, K. et al. InnateDB: Systems biology of innate immunity and beyond—recent updates and continuing curation. Nucleic Acids Res. 41, D1228-1233. https://doi.org/10.1093/nar/gks1147 (2013).
Acknowledgements
This work was supported by Visva-Bharati University, West Bengal, India. We express sincere gratitude to the doctors and workers of Eden Medical Centre, Dimapur, Nagaland.
Funding
The study was supported by the extramural grand of the Department of Biotechnology, India. The funding agency had no role in study design, data collection, and analysis, decision to publish, or manuscript preparation.
Author information
Authors and Affiliations
Contributions
K.C.: conceptualization, methodology, formal analysis and investigation, writing—original draft preparation; S.M.: formal analysis and investigation, review and editing; M.G.: Formal analysis and manuscript preparation; S.D.R. and M.A.: resources (provided the NPC samples, patient’s information and helped on data collection; N.R.C., K.C., S.M.: original draft preparation; T.C.: supervision, funding and conceptualization, review and manuscript preparation and final editing.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Chatterjee, K., Mal, S., Ghosh, M. et al. Blood-based DNA methylation in advanced Nasopharyngeal Carcinoma exhibited distinct CpG methylation signature. Sci Rep 13, 22086 (2023). https://doi.org/10.1038/s41598-023-45001-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-023-45001-w
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.