Disulfidptosis-associated lncRNAs predict breast cancer subtypes

Xia, Qing; Yan, Qibin; Wang, Zehua; Huang, Qinyuan; Zheng, Xinying; Shen, Jinze; Du, Lihua; Li, Hanbing; Duan, Shiwei

doi:10.1038/s41598-023-43414-1

Download PDF

Article
Open access
Published: 27 September 2023

Disulfidptosis-associated lncRNAs predict breast cancer subtypes

Qing Xia^1,2,
Qibin Yan^1,2,
Zehua Wang¹,
Qinyuan Huang¹,
Xinying Zheng^1,2,
Jinze Shen¹,
Lihua Du²,
Hanbing Li² &
…
Shiwei Duan¹

Scientific Reports volume 13, Article number: 16268 (2023) Cite this article

1094 Accesses
3 Citations
Metrics details

Subjects

Abstract

Disulfidptosis is a newly discovered mode of cell death. However, its relationship with breast cancer subtypes remains unclear. In this study, we aimed to construct a disulfidptosis-associated breast cancer subtype prediction model. We obtained 19 disulfidptosis-related genes from published articles and performed correlation analysis with lncRNAs differentially expressed in breast cancer. We then used the random forest algorithm to select important lncRNAs and establish a breast cancer subtype prediction model. We identified 132 lncRNAs significantly associated with disulfidptosis (FDR < 0.01, |R|> 0.15) and selected the first four important lncRNAs to build a prediction model (training set AUC = 0.992). The model accurately predicted breast cancer subtypes (test set AUC = 0.842). Among the key lncRNAs, LINC02188 had the highest expression in the Basal subtype, while LINC01488 and GATA3-AS1 had the lowest expression in Basal. In the Her2 subtype, LINC00511 had the highest expression level compared to other key lncRNAs. GATA3-AS1 had the highest expression in LumA and LumB subtypes, while LINC00511 had the lowest expression in these subtypes. In the Normal subtype, GATA3-AS1 had the highest expression level compared to other key lncRNAs. Our study also found that key lncRNAs were closely related to RNA methylation modification and angiogenesis (FDR < 0.05, |R|> 0.1), as well as immune infiltrating cells (P.adj < 0.01, |R|> 0.1). Our random forest model based on disulfidptosis-related lncRNAs can accurately predict breast cancer subtypes and provide a new direction for research on clinical therapeutic targets for breast cancer.

Prediction of DNA methylation-based tumor types from histopathology in central nervous system tumors with deep learning

Article 17 May 2024

Single-cell and spatial transcriptomics analysis of non-small cell lung cancer

Article Open access 23 May 2024

Tumor biomarkers for diagnosis, prognosis and targeted therapy

Article Open access 20 May 2024

Introduction

Breast cancer is one of the most common malignancies in women and is responsible for the highest mortality rate among women¹. It is a genetically and clinically heterogeneous disease with multiple subtypes that have distinct molecular features². PAM50 technology can detect the expression levels of 55 genes and divide breast cancer into 5 subtypes: Luminal A (LumA), Luminal B (LumB), HER2-enriched (Her2), Basal-like (Basal), and Normal-like (Normal)^3,4. However, PAM50 assay is expensive and difficult to perform⁵, necessitating the need for new, less expensive alternatives to predict breast cancer subtypes.

Disulfidptosis is a newly discovered form of cell death⁶. Under glucose-deficient conditions, cells with high expression of SLC7A11 consume large quantities of NADPH, leading to an abnormal accumulation of disulfides such as cystine. This results in disulfide stress and rapid cell death⁷. It has been found that levels of methionine and cysteine are increased in colorectal cancer tissues⁸. Hydrogen sulfide (H2S) and related reactive sulfur species (RSS) help cancer cells adapt to the immune microenvironment⁹.

Long noncoding RNAs (lncRNAs) are transcripts longer than 200 nucleotides that do not encode proteins¹⁰. lncRNAs can act as decoys, scaffolds, and enhancers, and are involved in chromatin remodeling and transcriptional and post-transcriptional regulation¹¹. There is accumulating evidence that lncRNAs often play oncogenic or tumor suppressor roles in human cancers^12,13.

Machine learning (ML) is a powerful data analysis technique¹⁴ that leverages algorithms capable of processing complex functions to construct highly accurate predictive models. ML finds applications across various domains of clinical research, enabling breakthroughs such as the detection of COVID-19¹⁵, the diagnosis of coronary artery disease¹⁶, the identification of prostate cancer¹⁷, and the classification of leukemia subtypes¹⁸. One noteworthy ML algorithm is Random Forest (RF), which belongs to the ensemble learning category. RF harnesses the collective power of numerous individual decision trees for tasks like classification and feature selection¹⁹. In this collaborative process, each tree within the random forest makes predictions and casts votes, with the class garnering the most votes ultimately becoming the prediction of the overall model²⁰. The distinguishing advantage of the RF model lies in its teamwork approach, akin to having a team of classifiers. These individual “team members” work synergistically to derive the final prediction result, delivering remarkable efficiency and exceptional accuracy²¹. Remarkably, despite the extensive exploration of machine learning techniques in the context of breast cancer prediction²², no prior studies have delved into the utilization of Random Forest models for predicting breast cancer subtypes. By integrating the strengths of the RF algorithm into the realm of breast cancer subtype prediction, we aim to unlock new avenues of insight and potentially enhance the accuracy of this critical healthcare application.

Materials and methods

Datasets

Data collection and processing were carried out as follows. Given the TCGA database’s comprehensive amalgamation of genetic, clinical, and image data spanning diverse tumor types, it stands as an indispensable asset in the field of cancer research. Therefore, our initial step involved procuring RNA-seq data for breast cancer and adjacent normal tissues directly from the TCGA database (https://portal.gdc.cancer.gov/). Subtype classification of TCGA-BRCA patients was then obtained (Table S1)²³. After excluding patients with unknown subtypes, a total of 1104 tumor samples and 113 adjacent normal samples remained. Genes with zero expression in more than half of the patients were removed, resulting in the extraction of 3305 lncRNAs. The 1104 patients were randomly divided into a training group (776 patients) and a test group (328 patients) using the caret package in R statistical software.

Identification of differential lncRNAs associated with disulfidptosis

The voom algorithm of the limma package (version 3.52.4)²⁴ in R software (version 4.3.0) was used to identify lncRNAs that were differentially expressed between breast cancer tissues and adjacent normal tissues. lncRNAs were considered significantly differentially expressed if they had a false discovery rate (FDR) of less than 0.05 and an absolute log2 fold change (|logFC|) of greater than or equal to 1. The relationship between differentially expressed lncRNAs and disulfidptosis-related genes was then assessed using the Pearson correlation score. A significant correlation was defined as having an absolute correlation coefficient (|R|) greater than 0.15 and an FDR less than 0.05.

Establishment and evaluation of breast cancer subtype prediction model

The Random Forest (RF) algorithm in the Random Forest R software package (version 4.7–1.1)²⁵ was used for gene selection and model building by reducing the feature dimension based on variable importance (VIMP) and minimum depth. The article explains several key performance metrics for classifier assessment. AUC, denoting the area under the ROC curve, serves as a crucial gauge for classifier performance. Specificity (Sp) quantifies the proportion of accurately classified negative samples, while Sensitivity (Sn) measures the proportion of correct classifications among actual positive samples. Precision, on the other hand, estimates the ratio of correctly classified positive samples within the overall positives. The F1 score offers valuable insights into classification balance. To assess the breast cancer subtype prediction model effectively, employ a comprehensive evaluation comprising AUC, Sp, Sn, Precision, and F1 score. The SHAP package²⁶ in Python was used to provide the degree of influence of each feature in the model and its positive or negative impact on each predicted outcome for explaining machine learning models.

Immune infiltration analysis based on disulfidptosis-associated lncRNAs

Immune infiltration was analyzed using the CIBERSORT deconvolution algorithm²⁷ on R software to calculate the composition of tumor immune cells from expression profiles. Single-Sample Gene Set Enrichment Analysis (ssGSEA) in the GSVA R software package (version :1.44.5)²⁸ was used to calculate the degree of infiltration of 28 immune cell types based on published immune cell gene signatures²⁹. The correlation between disulfidptosis-related lncRNAs and immune infiltration was calculated to explore their relationship in different breast cancer subtypes.

Interaction of RNA methylation with disulfidptosis-associated lncRNAs

RNA methylation modifications are key regulators that affect cellular biological functions such as cell proliferation and metastasis, stem cell differentiation, and homeostasis in cancer³⁰. Three RNA methylation modification-related genes were obtained from the literature, including 23 m6A modification genes³¹, 12 m5C modification genes³², and 10 m1A modification genes³³. Correlations between disulfidptosis-associated lncRNAs and RNA methylation genes were calculated to explore their relationship in different breast cancer subtypes.

Interaction of angiogenesis with disulfidptosis-associated lncRNAs

Angiogenesis, the process of forming new blood vessels from pre-existing vessels, is an important event in tumor growth and hematogenous metastasis³⁴. A set of 36 angiogenesis-related mRNAs was obtained from the Hallmark gene set³⁵. The association between disulfidptosis-associated lncRNAs and angiogenesis-associated genes (AAGs) was assessed to explore their relationship in different breast cancer subtypes.

Statistical analysis

Statistical analyses were performed using R version 4.1.0. Gene expression in tumor tissue was compared to that in adjacent non-tumor tissue using a t-test. Correlations between genes were calculated using Pearson analysis, and differences in proportions between groups were compared using Wilcoxon tests. The corrected P value (P.adj) was calculated using the Bonferroni method. The performance of the models was evaluated using receiver operating characteristic (ROC) curves and the area under the curve (AUC). Two-sided tests were used to report p-values, with values less than 0.05 considered statistically significant.

Ethics approval and consent to participate

TCGA belong to public databases. The patients involved in the database have obtained ethical approval. Users can download relevant data for free for research and publish relevant articles. Our study is based on open-source data, so there are no ethical issues and other conflicts of interest.

Results

Enrichment analysis of disulfidptosis-related genes

We conducted an enrichment analysis of disulfidptosis-related genes by identifying 19 disulfidptosis-related mRNAs through a literature search (Fig. 1). Under glucose-deficient conditions, high expression of SLC7A11 mediates cystine uptake into cells, consuming large amounts of NADPH and reducing cystine to cysteine, resulting in NADPH depletion. This promotes the oxidation of the sulfhydryl group (–SH) of cysteine on actin cytoskeletal proteins to form intermolecular or intramolecular disulfide bonds (–S–S–), leading to the collapse of the cytoskeleton and separation of the plasma membrane, eventually inducing cell death^6,7,36. INF2 and PDLIM1 are involved in actin synthesis and have the unique ability to accelerate actin polymerization and depolymerization³⁷. CD2AP can recruit capping proteins to specific subcellular locations and modulate their actin capping activity through allosteric effects to affect actin assembly³⁸. MYH9 and MYH10 interact with actin to become part of the cytoskeleton. Under glucose-deficient conditions, disulfide bonds can form between MYH9 and MYH10 proteins, leading to abnormal protein function^39,40. ACTN4, FLNA, FLNB, IQGAP1, and TLN1 are intracellular actin-binding proteins that maintain cytoskeleton stability^41,42,43. MYL6 is involved in muscle contraction and cell motility by interacting with actin and myosin heavy chains⁴⁴. Aberrant expression and aggregation of ACTB can affect cytoskeletal changes⁴⁵. DSTN promotes depolymerization and reorganization of actin and is involved in cytoskeletal remodeling and regulation of actin filament turnover⁴⁶. CAPZB plays an important role in regulating the dynamics of actin filaments and stabilizing their length⁴⁷. RPN1 and NCKAP1 are cytoskeletal proteins involved in upregulating Arp2/3 complex-mediated actin nucleation⁴⁸.

Feature selection

A total of 345 differentially expressed lncRNAs were found in breast cancer (FDR < 0.05, |logFC|> 1.5), including 129 that were up-regulated and 216 that were down-regulated (Fig. 2A). Using Pearson correlation analysis, 132 lncRNAs significantly associated with disulfidptosis were identified (p.adj < 0.05, |R|> 0.15). Preprocessing, which includes feature selection, is crucial in machine learning, as is well known. It can somewhat increase the model's prediction accuracy in addition to reducing the complexity of the training model⁴⁹. As a result, we prioritize the significance of characteristics using the random forest algorithm, screen out features that are closely related to the model, and improve the accuracy of the breast cancer prediction model. Four lncRNAs with the highest relative importance values and relative importance greater than 40 were selected as key factors for constructing a model to predict breast cancer subtypes (Fig. 2B,C).

We identified biomarkers by evaluating the predictive models (Figs. 2D–F). The expression levels of the four key lncRNAs used to build the model were significantly different among the five subtypes. In the Basal subtype, LINC00511 and LINC02188 had the highest expression (0.75–1 quantile, Q4), while LINC01488 had the lowest (0.25–0.5 quantile, Q2). LINC00511 was significantly higher than adjacent tissues (P < 0.001). In the Her2 subtype, GATA3-AS1 had the highest expression (Q4), while LINC01488 had the lowest (Q2). LINC00511 and LINC01488 were significantly higher than adjacent tissues (P < 0.001). In LumA and LumB subtypes, GATA3-AS1 had the highest expression (Q4), while LINC00511 had the lowest (0.5–0.75 quantile, Q3). LINC00511 and LINC01488 were significantly higher than adjacent tissues (P < 0.001), while LINC02188 was significantly lower (P < 0.001). In the Normal subtype, GATA3-AS1 had the highest expression (Q4), and LINC00511 and LINC01488 were significantly higher than adjacent tissues (P < 0.001).

Evaluation of a model of disulfidptosis-associated lncRNAs for predicting breast cancer subtypes

The Support Vector Machine model (SVM) is a supervised learning algorithm utilized for data analysis via classification and regression⁵⁰. The K-Nearest Neighbor (KNN) algorithm, on the other hand, is a straightforward instance-based learning technique⁵¹. Meanwhile, the Naive Bayesian (NB) classifier stands as a well-established supervised algorithm within the field of machine learning⁵². Subsequently, we harnessed Random Forest (RF) to construct a breast cancer subtype prediction model, incorporating these four pivotal algorithms for comparison: RF, KNN, SVM, and NB.

In Fig. 3A–D, we present the AUC (Area Under the Curve) results for these four machine learning models. RF achieves an AUC of 0.842, while NB records an AUC of 0.583, SVM achieves 0.826, and KNN attains 0.865. Figure 3E exhibits additional metrics, including Specificity (Sp), Sensitivity (Sn), Precision, and F1 score for the three machine learning models. For RF, Sp is 0.8, Sn is 0.6, Precision is 0.57, and F1 score is 0.58. KNN yields Sp of 0.77, Sn of 0.62, Precision of 0.52, and F1 score of 0.55. Meanwhile, NB displays Sp of 0.8, Sn of 0.36, Precision of 0.42, and F1 score of 0.35. SVM produces Sp of 0.8, Sn of 0.35, Precision of 0.37, and F1 score of 0.3.

While KNN demonstrates a higher AUC value than RF, it’s essential to consider that RF outperforms KNN in terms of Sp, Precision, and F1 score. Furthermore, the overall evaluation indices of RF surpass those of SVM and NB models, underscoring the robustness of the breast cancer prediction model constructed using Random Forest.

We then used the SHAP model to evaluate the role of key lncRNAs in our model. In the model, GATA3-AS1 had the most significant impact, while LINC00511 had the least. The results indicated that each key gene had a different contribution to the model (Fig. 3F).

Immune infiltration and the disulfidptosis-associated lncRNAs

We conducted an immune infiltration assay to explore the relationship between disulfidptosis and immune infiltration in different breast cancer subtypes. CIBERSORT and ssGSEA were used to evaluate the immune infiltration of patients, and the correlation between key lncRNAs and immune cells was calculated. Key lncRNAs were found to have varying degrees of correlation with the level of immune cell infiltration (Fig. 4A).

RNA methylation genes and the disulfidptosis-associated lncRNAs

We explored the relationship between disulfidptosis-related lncRNAs and RNA methylation in different breast cancer subtypes by assessing the correlation of key lncRNAs with RNA methylation genes. We observed diversity in the association between four lncRNAs and RNA m6A modifying genes (Fig. 4B). GATA3-AS1 and LINC01488 showed positive correlations with RNA m1A modifier gene expression, while LINC00511 showed a negative correlation. The correlation between LINC02188 and RNA m1A modifier genes varied (Fig. 4C). LINC00511, LINC01488, and LINC02188 were positively correlated with RNA m5C modification genes (Fig. 4D), while the association between GATA3-AS1 and m5C modifier genes was diverse (Fig. 4D).

Angiogenic genes and the disulfidptosis-associated lncRNAs

Angiogenesis, the formation of new blood vessels, has been shown to be integral to cancer development⁵³. Our study assessed the correlation of key lncRNAs with angiogenic genes to explore the relationship between disulfidptosis and angiogenesis in different breast cancer subtypes. Our results indicate that the relationship between the four lncRNAs and angiogenesis is intricate. Each of the four lncRNAs exhibited positive or negative correlations with multiple angiogenic genes (Fig. 4E).

Discussion

Disulfidptosis is a recently discovered type of cell death that differs from apoptosis, autophagy, and ferroptosis. In this study, we established a prediction model for breast cancer subtypes based on 4 lncRNAs related to disulfidptosis. The model includes 3 lncRNAs that are highly expressed in breast cancer (GATA3-AS1, LINC00511, and LINC01488) and 1 lncRNA that is lowly expressed (LINC02188). LINC01488 and LINC00511 showed higher expression in the Basal subtype, while GATA3-AS1 showed higher expression in the Her2 and Normal subtypes. GATA3-AS1 and LINC02188 showed higher expression in the LumA and LumB subtypes.

In this study, LINC02188 was found to be associated with a reduced risk of Her2, LumA, and LumB breast cancer subtypes for the first time. Located on chromosome 16 and 658 bp in length, LINC02188 has been shown to be associated with the activation of various immune cells, RNA methylation modifier genes, and the expression of multiple angiogenic factors. The COPS3 protein is a subunit of the COP9 signalosome (CSN) that exerts deubiquitination and protein kinase activity in various processes⁵⁴. PD-L1 is a ligand for programmed cell death protein 1 (PD-1) that inhibits T cell signaling by interacting with PD-1⁵⁵. GATA3 is a transcription factor that plays an important role in the differentiation of mammary epithelium, urothelium, and T lymphocyte subsets⁵⁶. GATA3-AS1 induces PD-L5 deubiquitination through the miR-1-676p/COPS3 axis while destabilizing GATA3 protein by promoting its ubiquitination, thus promoting TNBC progression and immune escape⁵⁷. MMP13 is a matrix metalloproteinase that remodels the extracellular matrix and promotes cancer cell invasiveness⁵⁸. LINC00511 promotes breast cancer proliferation, migration, and invasion through the miR-150/MMP13 axis. In HCC, LINC01488 inhibits metastasis and tumorigenesis via the miR-124-3p|miR-138-5p/vimentin axis⁵⁹. Our research shows that LINC01488 is lowly expressed in the LumB subtype of breast cancer but highly expressed in the Basal, Normal, Her2, and LumA subtypes. We speculate that LINC01488 expression may be tissue-specific.

Plasmacytoid dendritic cells (pDCs) can recognize viruses and tumor cells and enhance the function of natural killer cells (NK cells), T cells, B cells, and other dendritic cells to promote cellular innate and adaptive immune responses⁶⁰. T cells are lymphocytes that can kill tumor cells by recognizing tumor-specific or tumor-associated antigens, exerting anti-tumor immune effects and playing a key role in tumor monitoring^61,62. Macrophages are important cells in the tumor microenvironment that can polarize into M1 or M2 phenotypes in response to different stimuli and signals⁶³. M1 macrophages mainly induce the production of pro-inflammatory cytokines such as TNF-α, IL-1β, IL-6, and IL-12, which are conducive to anti-tumor effects⁶⁴. In the Basal subtype, high expression of LINC02188 and LINC00511 may increase immune infiltration of tumor tissue by activating various immune cells, thereby inhibiting tumor development. Mast cells can promote tumor cell proliferation and invasion⁶⁵. M2 macrophages secrete anti-inflammatory cytokines such as IL-10, CCL18, and CCL22, which are beneficial to cancer cell growth⁶⁶. In the LumA and LumB subtypes, high expression of LINC01488 and GATA3-AS1 may activate Mast cells and M2 macrophages, promoting tumor immune escape and development.

RNA methylation plays a critical role in cancer development⁶⁷. m6A is the most prevalent internal mRNA modification in eukaryotic cells and regulates multiple RNA processing steps⁶⁸. The relationship between the four lncRNAs and m6A modifying genes is complex and has both positive and negative correlations. RNA m1A modification disrupts base pairing and can affect local RNA structure or protein-RNA interaction⁶⁹. GATA3-AS1 and LINC01488 were positively correlated with RNA m1A modifier gene expression, while LINC00511 was negatively correlated. RNA m5C modification can promote mRNA nucleoplasmic transport, DNA damage repair, enhance mRNA stability and regulate mRNA splicing⁷⁰. LINC00511, LINC01488, LINC02188 were all positively correlated with RNA m5C modification genes. Since DNMT3A, DNMT3B, and DNMT1 are also responsible for DNA m5C methylation modification, this suggests that LINC00511 and LINC02188 may promote DNA methylation while GATA3-AS1 reduces it.

This study has some limitations. Firstly, we only used the TCGA internal dataset for analysis and lack external validation. In the future, we need to verify the accuracy of the model using more clinical samples. Secondly, as disulfidptosis research is a novel and fast-growing field, more regulators may be discovered in the future and the model can be further optimized after a deeper understanding of the biological process of disulfidptosis. Additionally, this study was based on RNA profiling and cannot explain the direct molecular mechanism of disulfidptosis-related lncRNAs in breast cancer development at the protein level.

Prior research endeavors have explored the application of computational methodologies, notably machine learning models, in the realms of breast cancer diagnosis and prognosis. For instance, one study leveraged support vector machine (SVM) techniques, employing FTIR spectra from plasma, to detect breast cancer⁷¹. Another notable contribution by Mariia V. Guryleva et al. combined the Boruta algorithm with a Random Forest (RF) model to delineate genes associated with Polyunsaturated Fatty Acid (PUFA) metabolism changes in breast cancer, thereby facilitating breast cancer subtype prediction⁷². Hang et al. introduced an MRI-based multiparameter radiomics model that adeptly forecasts molecular subtypes and androgen receptor expression in breast cancer⁷³. Additionally, Zheng et al. harnessed deep learning radiomics for the prediction of axillary lymph node status in early breast cancer⁷⁴.

In contrast to prior studies, our research represents a pioneering effort in incorporating disulfidptosis-associated long non-coding RNAs (lncRNAs) into the signature screening process. This novel approach enabled the identification of four distinctive biomarkers showcasing differential expression patterns across various breast cancer subtypes. Remarkably, our findings also suggest an intriguing link between breast cancer subtypes and disulfidptosis phenomena.

Furthermore, it's noteworthy that previous investigations have predominantly focused on utilizing machine learning models trained on breast cancer images for tumor classification into neoplastic and benign categories. While these studies exhibit higher accuracy and specificity, it's imperative to acknowledge the resource-intensive nature of training such models using expensive and time-consuming breast cancer images.

Conclusions

Here, we constructed a breast cancer subtype prediction model containing 4 key lncRNAs using a random forest model. We found strong correlations between key lncRNAs in the model and immune milieu, RNA methylation, and angiogenesis. Our findings reveal the potential function of disulfidptosis-asscociated lncRNAs in breast cancer subtypes and provide a new direction for improving individualized treatment of breast cancer.

Data availability

All relevant data are within the paper and its Supporting Information files. The data that support the findings of this study are openly available in TCGA database at https://portal.gdc.cancer.gov/.

Abbreviations

LumA:: Luminal A
LumB:: Luminal B
Her2:: HER2-enriched
Basal:: Basal-like
Normal:: Normal-like
H2S:: Hydrogen sulfide
RSS:: Reactive sulfur species
lncRNAs:: Long noncoding RNAs
FDR:: False discovery rate
RF:: Random forest
VIMP:: Variable importance
ssGSEA:: Single-sample gene set enrichment analysis
AAGs:: Angiogenesis-associated genes
P.adj:: Corrected P value
AUC:: Area under the curve
ROC:: Receiver operating characteristic
Sp:: Specificity
Se:: Sensitivity
NB:: Naive Bayesian mode
SVM:: Support vector machine
KNN:: K-nearest neighbor
CSN:: COP9 signalosome
PD-1:: Programmed cell death protein 1
pDCs:: Plasmacytoid dendritic cells
NK cells:: Natural killer cells

References

Kashyap, D. et al. Global increase in breast cancer incidence: risk factors and preventive measures. Biomed. Res. Int. 2022, 9605439. https://doi.org/10.1155/2022/9605439 (2022).
Article CAS PubMed PubMed Central Google Scholar
Orrantia-Borunda, E., Anchondo-Nuñez, P., Acuña-Aguilar, L. E., Gómez-Valles, F. O. & Ramírez-Valdespino, C. A. Subtypes of breast cancer. In Breast Cancer (ed. Mayrovitz, H. N.). Brisbane (AU). https://doi.org/10.36255/exon-publications-breast-cancer-subtypes (2022).
Perou, C. M. et al. Molecular portraits of human breast tumours. Nature 406(6797), 747–752. https://doi.org/10.1038/35021093 (2000).
Article ADS CAS PubMed Google Scholar
Iwamoto, T., Kajiwara, Y., Zhu, Y. & Iha, S. Biomarkers of neoadjuvant/adjuvant chemotherapy for breast cancer. Chin. Clin. Oncol. 9(3), 27. https://doi.org/10.21037/cco.2020.01.06 (2020).
Article PubMed Google Scholar
Falck, A. K., Ferno, M., Bendahl, P. O. & Ryden, L. St Gallen molecular subtypes in primary breast cancer and matched lymph node metastases–aspects on distribution and prognosis for patients with luminal A tumours: Results from a prospective randomised trial. BMC Cancer 13, 558. https://doi.org/10.1186/1471-2407-13-558 (2013).
Article PubMed PubMed Central Google Scholar
Zheng, P., Zhou, C., Ding, Y. & Duan, S. Disulfidptosis: A new target for metabolic cancer therapy. J. Exp. Clin. Cancer Res. 42(1), 103. https://doi.org/10.1186/s13046-023-02675-4 (2023).
Article PubMed PubMed Central Google Scholar
Liu, X. et al. Actin cytoskeleton vulnerability to disulfide stress mediates disulfidptosis. Nat. Cell Biol. 25(3), 404–414. https://doi.org/10.1038/s41556-023-01091-2 (2023).
Article CAS PubMed PubMed Central Google Scholar
Fukuoka, H. et al. Sulphur metabolism in colon cancer tissues: A case report and literature review. J. Int. Med. Res. 49(11), 3000605211059936. https://doi.org/10.1177/03000605211059936 (2021).
Article PubMed Google Scholar
Zuhra, K., Tome, C. S., Forte, E., Vicente, J. B. & Giuffre, A. The multifaceted roles of sulfane sulfur species in cancer-associated processes. Biochim. Biophys. Acta Bioenerg. 1862(2), 148338. https://doi.org/10.1016/j.bbabio.2020.148338 (2021).
Article CAS PubMed Google Scholar
Esteller, M. Non-coding RNAs in human disease. Nat. Rev. Genet. 12(12), 861–874. https://doi.org/10.1038/nrg3074 (2011).
Article CAS PubMed Google Scholar
Fang, Y. & Fullwood, M. J. Roles, functions, and mechanisms of long non-coding RNAs in cancer. Genomics Proteomics Bioinform. 14(1), 42–54. https://doi.org/10.1016/j.gpb.2015.09.006 (2016).
Article CAS Google Scholar
Youness, R. A. & Gad, M. Z. Long non-coding RNAs: Functional regulatory players in breast cancer. Noncoding RNA Res. 4(1), 36–44. https://doi.org/10.1016/j.ncrna.2019.01.003 (2019).
Article CAS PubMed PubMed Central Google Scholar
Taheri, M., Omrani, M. D. & Ghafouri-Fard, S. Long non-coding RNA expression in bladder cancer. Biophys. Rev. 10(4), 1205–1213. https://doi.org/10.1007/s12551-017-0379-y (2018).
Article CAS PubMed Google Scholar
Deo, R. C. Machine learning in medicine. Circulation 132(20), 1920–1930. https://doi.org/10.1161/CIRCULATIONAHA.115.001593 (2015).
Article PubMed PubMed Central Google Scholar
Ghaderzadeh, M. & Aria, M. Management of Covid-19 detection using artificial intelligence in 2020 pandemic. Proceedings of the 5th International Conference on Medical and Health Informatics; Kyoto, Japan: Association for Computing Machinery. pp 32–38. https://doi.org/10.1145/3472813.3472820 (2021).
Garavand, A. et al. Efficient model for coronary artery disease diagnosis: A comparative study of several machine learning algorithms. J. Healthc. Eng. 2022, 5359540. https://doi.org/10.1155/2022/5359540 (2022).
Article PubMed PubMed Central Google Scholar
Ghaderzadeh, M. Clinical decision support system for early detection of prostate cancer from benign hyperplasia of prostate. Stud. Health Technol. Inform. 192, 928 (2013).
PubMed Google Scholar
Ghaderzadeh, M., Asadi, F., Hosseini, A., Bashash, D. & Roshanpour, A. J. S. P. Machine learning in detection and classification of leukemia using smear blood images: A systematic review. Sci. Program. 2021(5), 1–14 (2021).
Google Scholar
Rigatti, S. J. Random forest. J. Insur. Med. 47(1), 31–39. https://doi.org/10.17849/insm-47-01-31-39.1 (2017).
Article PubMed Google Scholar
Macaulay, B. O., Aribisala, B. S., Akande, S. A., Akinnuwesi, B. A. & Olabanjo, O. A. Breast cancer risk prediction in African women using random forest classifier. Cancer Treat. Res. Commun. 28, 100396. https://doi.org/10.1016/j.ctarc.2021.100396 (2021).
Article PubMed Google Scholar
Diaz-Uriarte, R. & Alvarez de Andres, S. Gene selection and classification of microarray data using random forest. BMC Bioinform. 7, 3. https://doi.org/10.1186/1471-2105-7-3 (2006).
Article CAS Google Scholar
Reig, B., Heacock, L., Geras, K. J. & Moy, L. Machine learning in breast MRI. J. Magn. Reson. Imaging 52(4), 998–1018. https://doi.org/10.1002/jmri.26852 (2020).
Article PubMed Google Scholar
Schettini, F. et al. Clinical, pathological, and PAM50 gene expression features of HER2-low breast cancer. NPJ Breast Cancer 7(1), 1. https://doi.org/10.1038/s41523-020-00208-2 (2021).
Article CAS PubMed PubMed Central Google Scholar
Costa-Silva, J., Domingues, D. & Lopes, F. M. RNA-Seq differential expression analysis: An extended review and a software tool. PLoS ONE 12(12), e0190152. https://doi.org/10.1371/journal.pone.0190152 (2017).
Article CAS PubMed PubMed Central Google Scholar
Jones, F. C. et al. Random forests as cumulative effects models: A case study of lakes and rivers in Muskoka, Canada. J. Environ. Manag. 201, 407–424. https://doi.org/10.1016/j.jenvman.2017.06.011 (2017).
Article Google Scholar
Scavuzzo, C. M. et al. Feature importance: Opening a soil-transmitted helminth machine learning model via SHAP. Infect. Dis. Model. 7(1), 262–276. https://doi.org/10.1016/j.idm.2022.01.004 (2022).
Article PubMed PubMed Central Google Scholar
Newman, A. M. et al. Robust enumeration of cell subsets from tissue expression profiles. Nat. Methods 12(5), 453–457. https://doi.org/10.1038/nmeth.3337 (2015).
Article CAS PubMed PubMed Central Google Scholar
Hanzelmann, S., Castelo, R. & Guinney, J. GSVA: Gene set variation analysis for microarray and RNA-seq data. BMC Bioinform. 14, 7. https://doi.org/10.1186/1471-2105-14-7 (2013).
Article Google Scholar
Bindea, G. et al. Spatiotemporal dynamics of intratumoral immune cells reveal the immune landscape in human cancer. Immunity 39(4), 782–795. https://doi.org/10.1016/j.immuni.2013.10.003 (2013).
Article CAS PubMed Google Scholar
Song, P., Tayier, S., Cai, Z. & Jia, G. RNA methylation in mammalian development and cancer. Cell Biol. Toxicol. 37(6), 811–831. https://doi.org/10.1007/s10565-021-09627-8 (2021).
Article CAS PubMed PubMed Central Google Scholar
Zhao, Q. et al. m(6)A RNA modification modulates PI3K/Akt/mTOR signal pathway in Gastrointestinal Cancer. Theranostics 10(21), 9528–9543. https://doi.org/10.7150/thno.42971 (2020).
Article CAS PubMed PubMed Central Google Scholar
Chen, B. et al. m5C regulator-mediated modification patterns and tumor microenvironment infiltration characterization in colorectal cancer: One step closer to precision medicine. Front Immunol. 13, 1049435. https://doi.org/10.3389/fimmu.2022.1049435 (2022).
Article CAS PubMed PubMed Central Google Scholar
Zhao, M., Shen, S. & Xue, C. A novel m1A-score model correlated with the immune microenvironment predicts prognosis in hepatocellular carcinoma. Front Immunol. 13, 805967. https://doi.org/10.3389/fimmu.2022.805967 (2022).
Article CAS PubMed PubMed Central Google Scholar
Li, S. et al. Angiogenesis in pancreatic cancer: Current research status and clinical implications. Angiogenesis 22(1), 15–36. https://doi.org/10.1007/s10456-018-9645-2 (2019).
Article PubMed Google Scholar
Hanahan, D. & Weinberg, R. A. Hallmarks of cancer: The next generation. Cell 144(5), 646–674. https://doi.org/10.1016/j.cell.2011.02.013 (2011).
Article CAS PubMed Google Scholar
Gurel, P. S. et al. INF2-mediated severing through actin filament encirclement and disruption. Curr. Biol. 24(2), 156–164. https://doi.org/10.1016/j.cub.2013.12.018 (2014).
Article CAS PubMed PubMed Central Google Scholar
Zhou, J. K., Fan, X., Cheng, J., Liu, W. & Peng, Y. PDLIM1: Structure, function and implication in cancer. Cell Stress. 5(8), 119–127. https://doi.org/10.15698/cst2021.08.254 (2021).
Article CAS PubMed PubMed Central Google Scholar
Edwards, M. et al. Capping protein regulators fine-tune actin assembly dynamics. Nat. Rev. Mol. Cell Biol. 15(10), 677–689. https://doi.org/10.1038/nrm3869 (2014).
Article CAS PubMed PubMed Central Google Scholar
Ye, G. et al. Nuclear MYH9-induced CTNNB1 transcription, targeted by staurosporin, promotes gastric cancer cell anoikis resistance and metastasis. Theranostics 10(17), 7545–7560. https://doi.org/10.7150/thno.46001 (2020).
Article CAS PubMed PubMed Central Google Scholar
Vicente-Manzanares, M., Ma, X., Adelstein, R. S. & Horwitz, A. R. Non-muscle myosin II takes centre stage in cell adhesion and migration. Nat. Rev. Mol. Cell Biol. 10(11), 778–790. https://doi.org/10.1038/nrm2786 (2009).
Article CAS PubMed PubMed Central Google Scholar
Tentler, D., Lomert, E., Novitskaya, K. & Barlev, N. A. Role of ACTN4 in tumorigenesis, metastasis, and EMT. Cells https://doi.org/10.3390/cells8111427 (2019).
Article PubMed PubMed Central Google Scholar
Griffiths, P. & Bull, A. Facial papules and lung cysts: A case of Birt-Hogg-Dube syndrome. BMJ Case Rep. https://doi.org/10.1136/bcr-2019-232083 (2019).
Article PubMed PubMed Central Google Scholar
Wei, T. & Lambert, P. F. Role of IQGAP1 in carcinogenesis. Cancers (Basel) https://doi.org/10.3390/cancers13163940 (2021).
Article PubMed PubMed Central Google Scholar
Vierthaler, M. et al. ADCK2 knockdown affects the migration of melanoma cells via MYL6. Cancers (Basel) https://doi.org/10.3390/cancers14041071 (2022).
Article PubMed Google Scholar
Guo, C., Liu, S., Wang, J., Sun, M. Z. & Greenaway, F. T. ACTB in cancer. Clin. Chim. Acta 417, 39–44. https://doi.org/10.1016/j.cca.2012.12.012 (2013).
Article CAS PubMed Google Scholar
Zhang, H. J. et al. Destrin contributes to lung adenocarcinoma progression by activating Wnt/beta-catenin signaling pathway. Mol. Cancer Res. 18(12), 1789–1802. https://doi.org/10.1158/1541-7786.MCR-20-0187 (2020).
Article CAS PubMed Google Scholar
Kumar, D. et al. Genetic instability in lymphocytes is associated with blood plasma antioxidant levels in health care workers occupationally exposed to ionizing radiation. Int. J. Toxicol. 35(3), 327–335. https://doi.org/10.1177/1091581815625593 (2016).
Article CAS PubMed Google Scholar
Repulles, M., Lopez-Marquez, V., Templado, J., Taviani, M. & Machordom, A. Genetic structure of the endangered coral Cladocora caespitosa matches the main bioregions of the mediterranean sea. Front Genet. 13, 889672. https://doi.org/10.3389/fgene.2022.889672 (2022).
Article PubMed PubMed Central Google Scholar
Cueto-Lopez, N. et al. A comparative study on feature selection for a risk prediction model for colorectal cancer. Comput. Methods Programs Biomed. 177, 219–229. https://doi.org/10.1016/j.cmpb.2019.06.001 (2019).
Article PubMed Google Scholar
Liu, H. X. et al. Diagnosing breast cancer based on support vector machines. J. Chem. Inf. Comput. Sci. 43(3), 900–907. https://doi.org/10.1021/ci0256438 (2003).
Article CAS PubMed Google Scholar
Goin, J. E. Classification bias of the k-nearest neighbor algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 6(3), 379–381. https://doi.org/10.1109/tpami.1984.4767533 (1984).
Article CAS PubMed MATH Google Scholar
Langarizadeh, M. & Moghbeli, F. Applying Naive Bayesian networks to disease prediction: A systematic review. Acta Inform. Med. 24(5), 364–369. https://doi.org/10.5455/aim.2016.24.364-369 (2016).
Article PubMed PubMed Central Google Scholar
Viallard, C. & Larrivee, B. Tumor angiogenesis and vascular normalization: Alternative therapeutic targets. Angiogenesis. 20(4), 409–426. https://doi.org/10.1007/s10456-017-9562-9 (2017).
Article CAS PubMed Google Scholar
Wei, N. & Deng, X. W. The COP9 signalosome. Annu. Rev. Cell Dev. Biol. 19, 261–286. https://doi.org/10.1146/annurev.cellbio.19.111301.112449 (2003).
Article CAS PubMed Google Scholar
Keir, M. E., Butte, M. J., Freeman, G. J. & Sharpe, A. H. PD-1 and its ligands in tolerance and immunity. Annu. Rev. Immunol. 26, 677–704. https://doi.org/10.1146/annurev.immunol.26.021607.090331 (2008).
Article CAS PubMed PubMed Central Google Scholar
Miettinen, M. et al. GATA3: A multispecific but potentially useful marker in surgical pathology: A systematic analysis of 2500 epithelial and nonepithelial tumors. Am. J. Surg. Pathol. 38(1), 13–22. https://doi.org/10.1097/PAS.0b013e3182a0218f (2014).
Article PubMed PubMed Central Google Scholar
Zhang, M. et al. LncRNA GATA3-AS1 facilitates tumour progression and immune escape in triple-negative breast cancer through destabilization of GATA3 but stabilization of PD-L1. Cell Prolif. 53(9), e12855. https://doi.org/10.1111/cpr.12855 (2020).
Article CAS PubMed PubMed Central Google Scholar
Sanchez, K. & Maguire-Zeiss, K. MMP13 expression is increased following mutant alpha-synuclein exposure and promotes inflammatory responses in microglia. Front Neurosci. 14, 585544. https://doi.org/10.3389/fnins.2020.585544 (2020).
Article PubMed PubMed Central Google Scholar
Lin, S. L. et al. A novel long non-coding RNA-01488 suppressed metastasis and tumorigenesis by inducing miRNAs that reduce vimentin expression and ubiquitination of cyclin E. Cells https://doi.org/10.3390/cells9061504 (2020).
Article PubMed PubMed Central Google Scholar
Zhang, H. et al. A distinct subset of plasmacytoid dendritic cells induces activation and differentiation of B and T lymphocytes. Proc. Natl. Acad. Sci. U. S. A. 114(8), 1988–1993. https://doi.org/10.1073/pnas.1610630114 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Lanitis, E., Dangaj, D., Irving, M. & Coukos, G. Mechanisms regulating T-cell infiltration and activity in solid tumors. Ann. Oncol. 28(suppl_12), xii18–xii32. https://doi.org/10.1093/annonc/mdx238 (2017).
Article CAS PubMed Google Scholar
Zeng, Z., Chew, H. Y., Cruz, J. G., Leggatt, G. R. & Wells, J. W. Investigating T cell immunity in cancer: Achievements and prospects. Int. J. Mol. Sci. https://doi.org/10.3390/ijms22062907 (2021).
Article PubMed PubMed Central Google Scholar
Anderson, N. R., Minutolo, N. G., Gill, S. & Klichinsky, M. Macrophage-based approaches for cancer immunotherapy. Cancer Res. 81(5), 1201–1208. https://doi.org/10.1158/0008-5472.CAN-20-2990 (2021).
Article CAS PubMed Google Scholar
Beyer, M. et al. High-resolution transcriptome of human macrophages. PLoS ONE 7(9), e45466. https://doi.org/10.1371/journal.pone.0045466 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Komi, D. E. A. & Redegeld, F. A. Role of mast cells in shaping the tumor microenvironment. Clin. Rev. Allergy Immunol. 58(3), 313–325. https://doi.org/10.1007/s12016-019-08753-w (2020).
Article CAS PubMed Google Scholar
Genin, M., Clement, F., Fattaccioli, A., Raes, M. & Michiels, C. M1 and M2 macrophages derived from THP-1 cells differentially modulate the response of cancer cells to etoposide. BMC Cancer 15, 577. https://doi.org/10.1186/s12885-015-1546-9 (2015).
Article CAS PubMed PubMed Central Google Scholar
Tomson, C. R., Veale, D. & Gould, K. Antibiotic policy and infective exacerbation of obstructive airways disease. Lancet 2(8549), 45. https://doi.org/10.1016/s0140-6736(87)93081-9 (1987).
Article CAS PubMed Google Scholar
Wang, S. et al. Roles of RNA methylation by means of N(6)-methyladenosine (m(6)A) in human cancers. Cancer Lett. 408, 112–120. https://doi.org/10.1016/j.canlet.2017.08.030 (2017).
Article CAS PubMed Google Scholar
Li, X. et al. Transcriptome-wide mapping reveals reversible and dynamic N(1)-methyladenosine methylome. Nat. Chem. Biol. 12(5), 311–316. https://doi.org/10.1038/nchembio.2040 (2016).
Article CAS PubMed Google Scholar
Guo, G. et al. Advances in mRNA 5-methylcytosine modifications: Detection, effectors, biological functions, and clinical relevance. Mol. Ther. Nucleic Acids 26, 575–593. https://doi.org/10.1016/j.omtn.2021.08.020 (2021).
Article CAS PubMed PubMed Central Google Scholar
Kepesidis, K. V. et al. Breast-cancer detection using blood-based infrared molecular fingerprints. BMC Cancer 21(1), 1287. https://doi.org/10.1186/s12885-021-09017-7 (2021).
Article PubMed PubMed Central Google Scholar
Guryleva, M. V. et al. Investigation of the role of PUFA metabolism in breast cancer using a rank-based random forest algorithm. Cancers (Basel) https://doi.org/10.3390/cancers14194663 (2022).
Article PubMed Google Scholar
Huang, Y. et al. Multi-parametric MRI-based radiomics models for predicting molecular subtype and androgen receptor expression in breast cancer. Front Oncol. 11, 706733. https://doi.org/10.3389/fonc.2021.706733 (2021).
Article CAS PubMed PubMed Central Google Scholar
Zheng, X. et al. Deep learning radiomics can predict axillary lymph node status in early-stage breast cancer. Nat Commun. 11(1), 1236. https://doi.org/10.1038/s41467-020-15027-z (2020).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The present study was supported by the Qiantang Scholars Fund in Hangzhou City University.

Funding

This study was supported by the Qiantang Scholars Fund in Hangzhou City University.

Author information

Authors and Affiliations

Key Laboratory of Novel Targets and Drug Study for Neural Repair of Zhejiang Province, School of Medicine, Hangzhou City University, Hangzhou, 310015, Zhejiang, China
Qing Xia, Qibin Yan, Zehua Wang, Qinyuan Huang, Xinying Zheng, Jinze Shen & Shiwei Duan
College of Pharmacy, Zhejiang University of Technology, Hangzhou, 310014, Zhejiang, China
Qing Xia, Qibin Yan, Xinying Zheng, Lihua Du & Hanbing Li

Authors

Qing Xia
View author publications
You can also search for this author in PubMed Google Scholar
Qibin Yan
View author publications
You can also search for this author in PubMed Google Scholar
Zehua Wang
View author publications
You can also search for this author in PubMed Google Scholar
Qinyuan Huang
View author publications
You can also search for this author in PubMed Google Scholar
Xinying Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Jinze Shen
View author publications
You can also search for this author in PubMed Google Scholar
Lihua Du
View author publications
You can also search for this author in PubMed Google Scholar
Hanbing Li
View author publications
You can also search for this author in PubMed Google Scholar
Shiwei Duan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Q.X., Q.Y., Z.W., Q.H., X.Z., J.S. and S.D. collected and analyzed the literature, drafted the figures, and wrote the manuscript. S.D., H.L., and L.D. conceived the idea and gave the final approval of the submitted version. All authors have read and agreed to the published version of the manuscript.

Corresponding authors

Correspondence to Hanbing Li or Shiwei Duan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Table S1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Xia, Q., Yan, Q., Wang, Z. et al. Disulfidptosis-associated lncRNAs predict breast cancer subtypes. Sci Rep 13, 16268 (2023). https://doi.org/10.1038/s41598-023-43414-1

Download citation

Received: 21 July 2023
Accepted: 23 September 2023
Published: 27 September 2023
DOI: https://doi.org/10.1038/s41598-023-43414-1

This article is cited by

Epigenetic regulation of diverse cell death modalities in cancer: a focus on pyroptosis, ferroptosis, cuproptosis, and disulfidptosis
- Shimeng Zhou
- Junlan Liu
- Xiaowei Qi
Journal of Hematology & Oncology (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.