Introduction

Cardiac glycosides have been traditionally used for treatment of heart failure and cardiac arrhythmias. The mechanisms of cardiac glycoside action involve inhibition of Na+/K+-ATPases and secondary activation of Na+/Ca2+ membrane exchangers. Although some preclinical investigations have suggested that cardiac glycosides may have anticancer properties1,2,3,4,5, others have reported they exacerbate cancer risk in clinical settings6. The basic structure of cardiac glycosides is similar to that of estradiol. One cardiac glycoside, digoxin, has the ability to bind oestrogen receptors and increases the risk of oestrogen-sensitive breast and uterine cancers7,8,9,10. Thus, the effect of cardiac glycosides in cancer remains controversial.

Recently, drug repositioning approach has been remarkably employed as conventional drug development requires high cost and long duration. Various methods utilising big data have been developed to identify unexpected associations between existing drugs and the outcomes of interests11,12,13,14,15. Big data contains real-world data such as the US Food and Drug Administration Adverse Event Reporting System (FAERS) and claims database or bioinformatics database such as Omics data and are widely available to researchers. Disproportionality analysis (DPA) and sequence symmetry analysis (SSA) using real-world data have been used in pharmacovigilance. Integrated analysis using DPA and SSA can enhance signal detection because SSA detects any additional true-positive signals that are not detected using DPA alone16. These analytical methods can detect a risk signal, indicating a possible causal relationship between drug exposure and an adverse event. Conversely, inverse signals detected using DPA and SSA generally have been thought to have no implication. However, we previously noted that inverse signals between a target drug and an adverse drug reaction suggest potential alternative therapeutic opportunities17, and several reports have evaluated these inverse associations for drug repositioning approaches18,19. Recently, drug repositioning studies using clinical real-world data have been conducted20,21. Moreover, bioinformatics databases are publicly available and are considered as a valuable source of data regarding phenotypes. Specifically, bioinformatics databases containing microarray gene expression profiles have been used for seeking novel molecular mechanisms18,22. In turn, the value of an integrative approach using both real-world data and bioinformatics databases was recently reported23. In the present study, functional relationships between digoxin and cancer were investigated by integrative analysis of multiple, large clinical and bioinformatics databases.

Results

Association between digoxin and cancer based on FAERS and Japan Medical Data Center (JMDC) claims databases

A total of 300,541 drug-reaction pairs for digoxin were found in the FAERS database. The association between digoxin use and cancers based on FAERS are shown in Table 1. In the analyses of individual cancers, significant inverse signals in both reporting odds ratios (RORs) and information components (ICs) were found for digoxin with gastric, colorectal, pancreatic, breast, ovarian, prostate and bladder cancers as well as melanoma and haematological malignancy. There were no positive signals in this DPA using the FAERS database.

Table 1 The association between digoxin use and cancers based on FAERS.

The characteristics of the study population in the JMDC claims database are summarised in Table S1. The number of claims pertaining to digoxin during the study period was 52,828. Among 3,035 digoxin users, 1,297 incident users who received their first digoxin prescription were identified. Table 2 shows the associations between digoxin use and cancers. Digoxin use was inversely associated with diagnoses of oesophageal, gastric, colorectal, lung and prostate cancers as well as haematological malignancy. Analyses of pancreatic, breast, uterine, ovarian and bladder cancer diagnoses showed no significant signal associations with digoxin use. Consequently, analyses of both FAERS and JMDC claims databases revealed significant inverse signals in gastric cancer, colorectal cancer, prostate cancer and haematological malignancy related to digoxin use.

Table 2 Event sequence symmetry analysis: the associations between digoxin and cancers.

Pathway enrichment analysis revealed biogroups associated with canonical pathways inversely correlated between digoxin and cancers

Substantial similarities and some differences in biogroups associated with canonical pathways were identified when differentially expressed genes (DEGs) from three cancer cell lines (HL60, MCF7 and PC3) treated with digoxin or four cancers (gastric cancer, colon cancer, prostate cancer and haematological malignancy) compared with normal tissue were analysed using the BaseSpace Correlation Engine [BSCE, Illumina Inc., CA, USA] (Fig. 1). Pathway enrichment analysis showed that 197 upregulated and 71 downregulated pathways among the three cancer cell lines were commonly observed after digoxin treatment. On the other hand, 17 upregulated and 8 downregulated pathways were commonly observed among the four cancers. Of the inversely correlated canonical pathways between digoxin and cancers, the “Caspase cascade in apoptosis,” “Genes involved in Cell Cycle Checkpoints,” and “Mechanism of Gene Regulation by Peroxisome Proliferators via PPARα” were observed.

Figure 1
figure 1

Pathway enrichment analysis using the BaseSpace Correlation Engine database. Human cancer cell lines: HL60, MCF7 and PC3. PPARα, peroxisome proliferator-activated receptor α.

Computational identification of drugs/compounds associated with digoxin

After digoxin treatment of the three cancer cell lines (HL60, MCF7 and PC3), 313 upregulated and 313 downregulated genes were found in common (Fig. 2). Connectivity Map analysis of DEGs showed that 80 drugs/compounds were found to be positively correlated (connectivity score: >0) with gene expression signatures for digoxin and eight were inversely correlated (connectivity score: <0; Table 3). In high connectivity score compounds/drugs, cardiac glycosides, such as digoxin, proscillaridin and lanatoside C, were detected. In addition, several anticancer agents (etoposide, vorinostat, carmustine and lomustine) were identified in the 80 drugs/compounds.

Figure 2
figure 2

Computational identification of drugs/compounds associated with digoxin using Connectivity Map (CMap). Human cancer cell lines: HL60, MCF7 and PC3. Red arrows, upregulated; blue arrows, downregulated.

Table 3 CMap analysis for compounds with gene expression signature for digoxin.

According to the results of pathway enrichment analysis with digoxin and the four cancers (gastric cancer, colon cancer, prostate cancer and haematological malignancy), we searched for compounds that influenced the “caspase cascade” and “cell cycle checkpoints” in the 80 drugs/compounds and focused on gossypol, which has been reported to induce apoptosis via the caspase cascade24. Pathway enrichment analyses in the BSCE database using gossypol identified biogroups associated with canonical pathways that were commonly regulated in the three cancer cell lines. Biogroups ranked in the top 10 in order of high score are shown in Fig. 3. In the same way, biogroups associated with digoxin were identified. By comparing the biogroups derived from digoxin analysis with the top 10 for gossypol, eight out of 10 biogroups were matched. As a negative control, we selected kaempferol, which had the lowest connectivity score of the eight matched drugs/compounds. The results of pathway enrichment analysis revealed the top 10 biogroups associated with canonical pathways of kaempferol were not matched with those of digoxin.

Figure 3
figure 3

Validation of biogroups associated with canonical pathways of digoxin for anticancer properties. Human cancer cell lines: HL60, MCF7 and PC3. Red arrows, upregulated; blue arrows, downregulated.

Discussion

In our analyses of both the FAERS and JMDC claims databases, significant inverse associations were found between digoxin and four cancers (gastric, colorectal, prostate cancer and haematological malignancy). Consistent findings from independent analyses involving different databases, methodologies and algorithms suggest that digoxin use is inversely associated with the risks of these cancers. Of note, there were no significant positive associations between digoxin and any of the investigated cancers. Analysis of the BSCE database suggested the molecular mechanisms by which digoxin suppresses the proliferation of cancer cells include the caspase cascade in apoptosis and peroxisome proliferator-activated receptor α (PPARα) pathways. Thus, these data mining analyses suggest that digoxin may have potential anticancer effects against several cancers.

If digoxin had an influence on cancers in a clinical setting, signals could have been observed by analysis of large-scale clinical data. Analyses of the FAERS and JMDC claims databases revealed significant inverse signals between digoxin and the four cancers. Previous basic research studies have supported the potential value of cardiac glycosides, such as digoxin, as chemotherapeutic candidates25,26,27,28. At the mechanistic level, several different pathways have been suggested to be responsible for mediating cytotoxic effects, including calcium-dependent activation of caspases and other hydrolytic enzymes5,29, generation of reactive oxygen species30, topoisomerase inhibition31 and interference of signal transduction pathways, such as Src, epidermal growth factor receptor, p21 and hypoxia inducible factor-1α2,4. In the present study, data mining was used to elucidate the molecular mechanisms of digoxin. In reference to the signature reversion strategy32, biogroups associated with canonical pathways can be screened on the basis of the chance overlap between the gene expression signature of digoxin and those of cancers. We detected inversely regulated biogroups associated with canonical pathways between digoxin and the four cancers. The matching between digoxin and cancer profiles revealed three canonical pathways, which included the “Caspase cascade in apoptosis.” Previous reports have shown that digoxin has apoptosis-related function5,29. Furthermore, the associated PPARα pathway was also noted. PPARα is known to be involved in fatty acid metabolism and fatty acid transport and metabolism are associated with metastatic progression and poor prognosis of cancers33. Several studies using cancer cell lines have shown PPARα as a biomarker34,35. Thus, digoxin may have a novel ability to suppress cancer invasiveness and progression via pathways associated with PPARα.

In the present study, no significant associations were observed between digoxin and several cancers such as breast or uterine cancers. When three, significant, inverse signals (ROR, IC and adjusted SR) were observed in our analysis, we considered that the criteria of significant inverse association were satisfied. Therefore, the reliability of inverse association derived from integrative analysis using DPA and SSA seemed to be high. Digoxin has been reported to be a risk factor of breast cancer as it binds to the oestrogen receptor8,36. Breast and endometrial cancers are oestrogen-dependent cancers. Therefore, the oncogenic property rather than the anticancer property of digoxin is concerning. As a result, our analysis did not detect significant inverse association between digoxin and oestrogen-dependent cancers. These findings support the usefulness of our integrative approach.

If digoxin has antiproliferative activity, the identified digoxin-induced DEGs should match with those induced by anticancer agents. The result of Connectivity Map analysis suggests several anticancer agents (etoposides, vorinostat, carmustine and lomustine) that are generally used in clinical settings. These drugs have been known to cause cell cycle arrest and to induce apoptosis. Gossypol also has DEG signatures similar to that of digoxin. Eight out of the top 10 biogroups associated with the canonical pathways of gossypol, which were matched to that of digoxin, were mainly concerned with the cell cycle. Thus, digoxin may have the ability to cause cell cycle arrest followed by apoptosis.

There are several inherent limitations associated with FAERS analysis such as reporting bias or unmeasured confounders37. Under-reporting or selective reporting are included. Certainly, not all adverse events observed in clinical settings are included in the database. As FAERS database contains missing data, misspelled drug names or duplicate data, we had deleted or corrected such data before conducting analysis in this study. Several variables were also limited in our FAERS analysis. Age, sex, race, treatment duration, drug dosage or co-administered drugs were not considered. It is generally recognised that oncogenesis occurs over a long period. Thus, the duration of digoxin treatment may be an important factor for suppressing oncogenesis. However, the duration of digoxin treatment was not available in the FAERS database. Although these aforementioned factors may affect the results of DPA, DPA using spontaneous reporting systems is useful for detecting the potential signals of drugs. We confirmed the robustness of the signals by analysing the common signals using two different algorithms: ROR and IC. However, signal detection using DPA should not be interpreted as assuming a causal relation between drugs and clinical events. Basically, DPA cannot be used for estimating the causative factors or the comparative risk of any drugs owing to the lack of denominators in the database. The hypotheses generated by DPA needs to be validated using other methods. In the next step, a different method, event SSA (ESSA), and a different database, JMDC claims database, were used in our analysis. ESSA has been used to detect the association between drug exposure and the outcome of interest as it has demonstrated moderate sensitivity and high specificity for signal detection38. However, this method also has potential limitations such as varying prescription trends, time-varying confounders, detection bias or confounding by indication39. In the present study, we used JMDC claims database for conducting ESSA. In this database, the proportion of elderly patients aged ≥65 years is low as this database has been built by the employees’ health insurance system. The prevalence of cancer generally increases with increasing age; therefore, this restriction in JMDC claims database may reduce the generalizability of results. The diagnoses contained in the claims database were not validated, and cancer diagnosis was broadly classified in our analysis. There exist phenotypic and functional heterogeneities in cancers. For example, lung cancer is divided into non-small cell lung cancer and small cell lung cancer and uterine cancer into cervical and endometrial cancers. Furthermore, breast cancer has been classified into some subtypes. Thus, more detailed analyses are necessary for each cancer type. Detailed classification of cancers leads to reduce sample size. Therefore, a big database will be needed. In our analysis, the maximum interval duration of ESSA was 48 months. As oncogenesis requires long duration, it may be better to prolong an interval more. Reportedly, during the 10-year follow-up period, an increased risk of cancer was observed in patients receiving digoxin40. Another report demonstrated that the risk of cancers associated with both long- and short-term digoxin uses was controversial41. Using longer intervals for ESSA would introduce other confounding factors and lead to false positive signals42. A careful interpretation of the results is required if the interval is extended. Herein, the BSCE database was utilised to estimate new molecular mechanisms of digoxin in cancers. We speculated molecular mechanisms using the transcriptome database composed of in-vitro studies; however, these mechanisms may differ from mechanisms occurring in human body. Indeed, no scientific approach is all-powerful; the same is true of in silico approaches. As a basic premise to incorporate in silico approaches in drug evaluation, it should be emphasised that they are not a substitute for in vivo experiments and should be performed in parallel to basic or clinical studies.

The results of our integrative analysis using different methodologies, algorithms and large-scale databases suggests that digoxin use is inversely associated with at least four cancers. Furthermore, the possibility of digoxin’s anticancer effect via PPARα and apoptotic caspase cascade pathways was suggested. Although several studies have suggested that digoxin is a potential candidate anticancer agent43,44,45, no definitive evidence exists yet. Our results provide a framework for uncovering and validating previously overlooked/undetected associations between digoxin use and anticancer effects by using both clinical and bioinformatics databases. Further basic research and epidemiological studies are required to confirm our findings.

Materials and Methods

Analysis of the FAERS database

The FAERS clinical database was accessed from the US Food and Drug Administration’s website (http://www.fda.gov/Drugs/GuidanceComplianceRegulatoryInformation/Surveillance/AdverseDrugEffects/). The present study included data from the first quarter of 2004 through the end of 2016. A total of 99,108,600 reports were obtained. Preferred terms from the Medical Dictionary for Regulatory Activities (MedDRA® version 20.1) were used to classify adverse events.

Arbitrary drug names, including trade names and abbreviations, were mapped into unified generic names via text mining using the Martindale website (https://www.medicinescomplete.com/mc/login.htm). Digoxin was identified by linking this archive with the FAERS database. All records that included digoxin in DRUG files were selected, and relevant reactions from REACTION files were then identified. Adverse events in the FAERS database were coded using MedDRA® preferred terms, which are grouped by defined medical conditions of the area of interest. We identified preferred terms related to cancers using Standardized MedDRA® queries. Preferred terms were linked to High-Level Terms (Table S2).

ROR and IC in DPA were utilised to detect spontaneous report signals. ROR and IC with a 95% confidence interval (CI) were calculated according to methods previously described17. Briefly, signal scores were calculated using a case/non-case method. Those reports containing the event of interest were defined as cases; all other reports comprised the non-cases. For the ROR, an inverse signal was defined if the upper limit of the 95% two-sided CI was less than 1. For the IC, an inverse signal was defined if the upper limit of the 95% CI was less than 0. In the current study, two methods were used to detect inverse signals, and the association between digoxin use and the diagnosis of cancers was listed as an inverse signal (drug repositioning signal) when the two indices met the criteria outlined above.

Analysis of the JMDC claims database

The JMDC claims database is a large and chronologically organised claims database constructed by the JMDC Co., Ltd.46 using standardised disease classifications and anonymous record linkage. In total, this database includes approximately 3.8 million insured persons in Japan (approximately 3.2% of the population) and mainly comprises company employees and their family members. This database provides information on the beneficiaries, including encrypted personal identifiers, age, sex, International Classification of Diseases, 10th revision, procedure and diagnostic codes, as well as the name, dose and number of days supplied of the prescribed and/or dispensed drugs. All drugs were coded according to the Anatomical Therapeutic Chemical Classification of the European Pharmaceutical Market Research Association. An encrypted personal identifier was used to link claims data from different hospitals, clinics and pharmacies.

ESSA was performed to evaluate the association between digoxin use and cancers, and adjusted sequence ratios (SRs) were calculated with reference to a previous report18. Briefly, the ESSA evaluates asymmetry in the distribution of an incident event before and after initiation of a specific treatment. Asymmetry may indicate an association between a specific treatment of interest and the event42,47. The crude SR is defined as the ratio of the number of newly diagnosed cancer patients after initiation of digoxin relative to the number before initiation. In addition, the SRs were adjusted for temporal trends in digoxin and events using the method proposed by Hallas47. The probability that digoxin was prescribed first, in the absence of any causal relationship, can be estimated by a so-called null-effect SR. The null-effect SR generated by the proposed model may be interpreted as a reference value for the SR. Therefore, the null-effect SR is the expected SR in the absence of any causal association after accounting for incidence trends. By dividing the crude SR by the null-effect SR, an adjusted SR corrected for temporal trends is obtained. A slightly modified model was used to account for the limited time interval allowed between digoxin use and cancer diagnosis. In the present study, an adjusted SR of less than 1 was defined as an inverse association between digoxin use and risk of cancer (drug repositioning signal).

All incident users of digoxin and all newly diagnosed cancer cases were identified from January 2005 to February 2015. Target cancers were defined according to International Classification of Disease, 10th revision, codes (Table S3). Those patients were followed up until February 2015. Incidence was defined as the first prescription of digoxin. To exclude prevalent users of digoxin, the analysis was restricted to users whose first prescription was administered in July 2005 or later (after a run-in period of 6 months). Likewise, the analysis was restricted to cases whose first diagnosis was in July 2005 or later. Waiting time distribution analysis was performed to ensure that the analysis was restricted to incident users of digoxin and newly diagnosed cases of cancer48. An identical run-in period was also applied to patients enrolled in the cohort after June 2005. Incident users were identified by excluding those patients who received their first digoxin prescription before July 2005, and newly diagnosed cancer cases were identified by excluding those patients whose first diagnosis of cancer was before July 2005. Those patients who had initiated a new treatment with digoxin and whose first diagnosis of cancer was within 12, 24, 36 and 48 month periods (intervals) of treatment initiation were identified. Patients who had received their first digoxin prescription and whose first cancer diagnosis was within the same month were not included in determination of the SR. The 95% CI for the adjusted SR was calculated using a method for exact CIs for binomial distributions49.

Pathway enrichment analysis using the BSCE database

The BSCE bioinformatics database was used to investigate microarray gene expression profiles. This engine can search curated gene expression profiles of compounds or diseases of interest available from an open source: Gene Expression Omnibus. Target drugs/compounds and cancer names were used as queries to filter gene expression datasets of subsequent gene expression profiling. For each query term, we identified microarray datasets that met the inclusion criteria of differential mRNA expression data for humans acquired by analyses of a perturbed condition and unaffected control with a high signal-to-noise ratio. We utilised publicly available microarray datasets shown in Table S4. Gene expression datasets (i.e. biosets) were extracted from the BSCE database and subjected to pathway enrichment analysis. To profile the results of pathway enrichment analysis, we performed text mining and used curated pathways. We derived commonly differential regulated pathways when the query drug/compound was administered with three human-derived cancer cell lines (HL60, MCF7 and PC3). We also derived differentially regulated pathways in four cancers (gastric cancer, colorectal cancer, prostate cancer and haematological malignancy) compared with control. If a drug/compound had a signature with up- or downregulated pathways opposite that of a disease signature (down- or upregulated pathways), that drug/compound had a molecular mechanism involving these pathways and could potentially be used as a treatment for that disease.

Combination analysis using the BSCE database and Connectivity Map

We focused on DEGs induced by target drug/compound administration in HL60, MCF7 and PC3 cell lines. DEG probes commonly expressed in these three cell lines with a P < 0.05 and |Fold change| > 1.5 were obtained from the BSCE database, and these probes were included in the Connectivity Map analysis50. When the reference signature for a drug matched the digoxin signature with a P < 0.05, it was short-listed with the connectivity score to indicate its correlation with a query drug/compound.

Data management

Data management and analysis were performed using Visual Mining Studio software (version 8.1; NTT DATA Mathematical Systems Inc., Tokyo, Japan). Quantitative data are expressed as means ± standard deviations and categorical data as frequencies (percentages).