Multimodal CNN-DDI: using multimodal CNN for drug to drug interaction associated events

Asfand-e-yar, Muhammad; Hashir, Qadeer; Shah, Asghar Ali; Malik, Hafiz Abid Mahmood; Alourani, Abdullah; Khalil, Waqar

doi:10.1038/s41598-024-54409-x

Download PDF

Article
Open access
Published: 19 February 2024

Multimodal CNN-DDI: using multimodal CNN for drug to drug interaction associated events

Muhammad Asfand-e-yar¹,
Qadeer Hashir¹,
Asghar Ali Shah²,
Hafiz Abid Mahmood Malik³,
Abdullah Alourani⁴ &
…
Waqar Khalil¹

Scientific Reports volume 14, Article number: 4076 (2024) Cite this article

1502 Accesses
1 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Drug-to-drug interaction (DDIs) occurs when a patient consumes multiple drugs. Therefore, it is possible that any medication can influence other drugs’ effectiveness. The drug-to-drug interactions are detected based on the interactions of chemical substructures, targets, pathways, and enzymes; therefore, machine learning (ML) and deep learning (DL) techniques are used to find the associated DDI events. The DL model, i.e., Convolutional Neural Network (CNN), is used to analyze the DDI. DDI is based on the 65 different drug-associated events, which is present in the drug bank database. Our model uses the inputs, which are chemical structures (i.e., smiles of drugs), enzymes, pathways, and the target of the drug. Therefore, for the multi-model CNN, we use several layers, activation functions, and features of drugs to achieve better accuracy as compared to traditional prediction algorithms. We perform different experiments on various hyperparameters. We have also carried out experiments on various iterations of drug features in different sets. Our Multi-Modal Convolutional Neural Network - Drug to Drug Interaction (MCNN-DDI) model achieved an accuracy of 90.00% and an AUPR of 94.78%. The results showed that a combination of the drug’s features (i.e., chemical substructure, target, and enzyme) performs better in DDIs-associated events prediction than other features.

Prediction of drug-drug interaction events using graph neural networks based feature extraction

Article Open access 16 September 2022

Identifying the serious clinical outcomes of adverse reactions to drugs by a multi-task deep learning framework

Article Open access 24 August 2023

DeepARV: ensemble deep learning to predict drug-drug interaction of clinical relevance with antiretroviral therapy

Article Open access 06 May 2024

Introduction

A DDI occurs when two or more drugs are taken together and result in adverse effects on the organism. It is feasible to manage more drugs, but some diseases are complex and can be treated with only one. This is because many diseases are justified by multiple medications. Therefore, DDIs pose risks and can be deadly if not treated properly. However, it also has curative properties. These risks have been identified in many studies, including research aimed at identifying whether the intake of two or more drugs will be safe. In pharmaceutical research/settings, DDIs are identified through thorough experimental and clinical testing. Despite high-throughput methods, the sheer number of DDIs makes experimental testing challenging and expensive ?. Computational methods can be used to address this problem by predicting potential DDIs, which can be an effective and fast alternative based on already-known knowledge of DDIs. In the last few decades, the speedy growth in drug development has provided medical practitioners with additional options for treating the diseases of patients. Therefore, using multiple drugs together can lead the patient to a severe condition, or it can be a cause of death. A doctor or physician prescribing multiple drugs to a patient may cause a drug-to-drug interaction, or we can say the drug-to-drug interaction is a medication error¹. A drug interaction can occur when one drug changes the action of other drugs, which may result in harmful or boost effects of the second agent (i.e., drug)².

There are three types of drug-to-drug interactions.

Does not react.
Antagonistic (i.e., it is a type of interaction between multiple drugs that produces an adverse effect). This means that this type of reaction may adversely affect the patient.
Synergistic (i.e., this type of interaction occurs between multiple drugs which may boost the effect on the body)^3,4,5.

Therefore, to improve the drug discovery process and patient recovery process for deadly diseases (for example, Cancer, Aids, Asthma, etc.), it is significant that we should know more about DDI. Discovering the DDI’s through the wet lab experiments is time-consuming and intensive work required⁶. Drug interaction prediction usually seeks to detect potential interactions between drugs that may lead to adverse effects or reduced efficacy⁷. While Drug combination prediction is intended to improve treatment effectiveness, drug interaction prediction identify possible side effects or difficulties that may arise from the use of particular drugs in combination⁸.To overcome the above problems ML and DL techniques are applied to predict DDI. Machine learning and deep learning have made important developments in the last few years in predicting DDIs. These methods are proposed for identifying latent DDIs between drug combinations. Yet, the majority of these methods limit themselves to using drugs as input and performing the DDI prediction task. We use multiple features of drug targets, enzymes, pathways, and chemical substructures for our experiments; and we calculate similarities of drugs using Jaccard similarity measures. Our proposed method is MMCNN-DDI for the DDI’s. It is based on a Multi-Modal Convolutional Neural Network for the event prediction between DDIs. Therefore, we use four CNN sub-models for each feature of drugs and then combine these sub-models to predict drug-drug interaction events. We used DDIs using Multi-Modal Deep Learning (b) to create a dataset for the prediction; the data set has 572 drugs and various features like chemical substructures (SMILES), enzymes, pathways, and targets, 74,528 interactions, and 65 types of drug-drug interaction events. We perform various experiments on different hyperparameters. Initially, we used a different number of layers and activation functions using the dense layer and input layer according to the Jaccard similarity. We used a 1D CNN input layer with a filter size of 1 and 5 kernels, three dense layers of 1024, 512, and 256 neurons, and an output layer of 65 neurons. Our MCNN-DDI model reached an accuracy of 90.00% and an AUPR of 94.78%. The experimental outcomes show that our method has achieved high accuracy and performs well against existing methods.

Related work

In the past decade, scientists have used ML and DL for drug development purposes to make drug development quick and help the drug industries develop drugs quickly because using wet-lab experiments is time-consuming and expensive, as discussed in some studies. The research⁹ proposed a method named Semi-Nonnegative Matrix Factorization (DDINMF) for degressive and enhancive calculation of DDI’s, and the proposed model is created on semi-non-negative matrix factorization. The study¹⁰ integrated various structures of drugs like chemical substructure, enzymes, pathways, and targets and proposed a method named Sparse Feature Learning ensemble method with Linear Neighborhood Regularization (SFLLN) for predicting DDI’s. Authors¹¹ integrate three different deep learning techniques, which are Recurrent Neural Networks (RNNs), CNNs, and Mixture Density Neural Networks (MDNs), for the efficient prediction of DDI. The research¹² proposed a novel method named Deep Predictor (DPDDI) for the DDI prediction in which Graph Convolutional Network (GCN) is used for the extraction of network structure features of medicine from the network of DDI’s, and a model of Deep Neural Network (DNN) is working as a predictor. Authors¹³ developed a new model that uses deep feed-forward networks and autoencoders and trained on three different similarity profiles, which are Target, Gene, and Similarity Profiles, Gene Ontology term similarity profiles, and SSP for predicting DDI’s effect.

Study¹⁴ proposed a method named the Neural Network-Based Method for DDIs (NDD). The model first calculates the diverse similarity of drugs, for example, pathways, side effects, target, transporter, and substructure, and then uses neural networks to predict DDIs. In 2019 authors¹⁵ used multi-modal deep autoencoders, which learn from multiple drug feature networks simultaneously a unified representation of drugs. Several operations are adopted on drug embedding, which is retained for the drug-drug pairs representation and then used in the random forest (RF) for the DDI’s prediction. The study¹⁶ use multiple drug data sources like Drug Bank¹⁷, KEGG drugs¹⁸, and PharmGKB¹⁹ and use 12000 features of drugs and integrate these features using Knowledge Graphs (KGs). Different embedding approaches are used to train the prediction model, leading to a best-performing combination: The ComplEX embedding method. Xinyu et al.²⁰ generated features of 5000 drugs from a drug bank database and built a DNN model to predict 80 types of DDI’s using 5000 drug features; these features of drugs were produced using SMILES. Deng et al.²¹ used a drug bank database and collected DDI from that and then used event trimming and dependency analysis to extract 65 different categories of DDI and then proposed a novel framework named Drug-Drug Interaction Multimodal Deep Learning (DDIMDL), which used a sub-model for every feature of drugs and then concatenated the sub-models for the prediction of the DDI. This research²² used a One-Class Support Vector Machine (OCSVM) for reliable negative seed generations, then used all the positive and negative labels for training and an iterative Support Vector Machine (SVM) to identify all negative from and un-label samples to predict DDIs. The study²³ explored graph outcomes knowledge for the DDI prediction to overcome the following two problems: the first is to achieve a good performance, and the second is to keep certain interpretability.

In this research authors²⁴ created a DDI prediction framework based on Knowledge Graph (KG) embeddings and introduced a Gumbel-SoftMax and Wasserstein Distances-Based Adversarial Autoencoders (AAE’s) where the autoencoder is used to make high-quality negative samples. In 2020 study²⁵ extracted various features of drugs which include targets, categories, enzymes, and pathways, focused on 65 types of DDI events, and proposed a CNN-based model named CNN-DDI for the drug-drug interaction prediction. Study²⁶ introduces a multi-scale feature fusion method to fuse multi-modal features well using scalar and cross-level components. In research authors²⁷ proposed a technique called Knowledge Graph Neural Network (KGNN) for the DDI’s prediction. This method learns for each entity from neighborhoods, and then the neighborhood’s information is integrated with bias from the current entity representations. Study²⁸ used drug embedding and Graph autoencoders with multiple knowledge sources to effectively predict DDI’s. To learn the embeddings of drugs, they used a Drug Target Interaction network and a variational autoencoder to gain rich chemical structure representation.

In this research authors²⁹ used seven types of drug pair similarities to create feature vectors and then proposed a model based on logistic regression for DDI predictions. the research³⁰ and³¹ used interaction profile fingerprint and structural similarity for the DDI’s prediction. Most previous studies focuOne-Classher two drugs interacting with each other or not, and now most researchers focus on the deep learning prediction model using deep neural networks; compared with simple CNN and DNN, the multi-modal CNN performed well, which can also effectively overcome the overfitting problem as compared to simple DNN. This study developed a novel method based on a Multi-Modal CNN named (MMCNN-DDI). First, four features of drug targets, enzymes, pathways, and chemical substructures. Then we use different similarity measures like the Jaccard and Cosine similarity matrix to calculate drug pairs’ similarities. Then construct four sub-models based on a CNN for each feature of drugs and feed these similarities matrix to sub-models last; these sub-models are concatenated for predicting DDI’s. The study³² presents DANN-DDI, a deep attention neural network framework to predict unknown DDIs by integrating multiple drug features. The model uses a graph representation learning approach to obtain medicine embeddings, followed by an attention neural network (ANN) to acquire representations of drug-drug pairs. The technique outperforms several state-of-the-art prediction methods and can recognize innovative connections and DDI-associated events. This study³³ presents a new method called DDI-IS-SL for predicting drug-drug interactions created on combined similarity and semi-supervised learning (SSL). DDI-IS-SL integrates drug biological, chemical, and phenotype data to compute drug similarity and uses the Regularized Least Squares classifier to predict interaction possibility scores of drug pairs. The paper³⁴ discusses DDIs which occur when drugs affect each other, leading to unexpected or severe side effects. DDIs are important to consider for drug-related studies, for example, drug repurposing, and drug-target interaction. The paper introduces DDIPred, a new technique for DDI prediction that utilizes drug chemical building embedding and graph convolutional networks (GCNN). In this study³⁵, a new single-stage finder model has been developed. The model comprises a base network, which is then followed by several multiscale feature map blocks. This design allows for the output of the base network to be transformed into larger feature maps, which in turn generates more anchor boxes to detect smaller objects. As a result, the size of the feature maps is decreased, allowing for more precise detection of labeled objects. This paper³⁶ discusses the use of DL and ML algorithms to predict (DDIs), which is a low-cost and real method. The paper also highlights the need for further research in this area to reduce the number of interactions and their adverse effects. The recent use of deep learning techniques for recognizing relations among different medicines to evade adverse effects is also discussed, and the importance of increased accuracy and performance in predicting DDIs is emphasized. The paper concludes by suggesting that future research should consider drug-food interactions in addition to DDIs.

This research³⁷ Predicting multiple interactions that a drug may encounter is crucial for drug development and safety. Artificial Intelligence (AI) has offered innovative methods to predict these interactions efficiently compared to traditional labor-intensive approaches. This research systematically examines AI applications in predicting drug-drug, drug-food (excipients), and drug-microbiome interactions. It outlines common model methods, evaluation indicators, algorithms, and databases used for these interactions. Particularly, ML models focusing on metabolic en- zyme P450, drug similarity, and drug targets are discoursed. The research³⁸ study outlines progress in AI for each type, summarizing data sets and methods. It introduces common databases, presents research advancements, and traces the timeline of DDI prediction events. The paper also discusses the challenges and potential of AI in enhancing clinical decision-making and patient outcomes in DDI prediction.

In this research³⁹ authors employ machine learning to predict drug risk levels based on Adverse Drug Reactions (ADRs). Using a dataset of 985,960 ADR reports from the Chinese spontaneous reporting database, we address class imbalance with the Synthetic Minority Oversampling Technique (SMOTE). The approach involves a multi-classification framework, utilizing ADR signal values and four different classifiers. The optimal combination, PRR-SMOTE-RF, achieved an accuracy rate of 95%. This study has potential applications in assisting experts in assessing the transition of prescription drugs to over-the-counter status. In this study⁴⁰ the authors employ artificial neural networks and factor propagation over graph nodes, presenting two innovative methods: adjacency matrix factorization (AMF) and adjacency matrix factorization with propagation (AMFP). These findings emphasize the potential of AMF, AMFP, and the ensemble-based classifier in providing vital information for drug development and prescription, even with partial or noisy data. The study also underscores the importance of the drug interaction network as a valuable data source for identifying potential DDIs.

This study⁴¹ addresses the importance of predicting interactions between G protein-coupled receptors (GPCRs) and drugs, a crucial aspect of drug development. The results demonstrate improved predictive performance compared to existing models, offering potential benefits for drug development efforts. The study⁴² focuses on Herbs and their partnership with medicines which become popular worldwide. This study collects all the facts about Panax notoginseng and medicines, helping doctors and patients make better choices for their health.

The research⁴³ examined, a novel ensemble neural network model, proposed to improve the accuracy of predicting drug-drug interactions. In this study, the authors introduce a super-smart computer model that can predict interactions between 86 different drugs with almost 94% accuracy. A comparative analysis table of the discussed work is presented in Table 1.

Table 1 Comparison with the related work.

Full size table

The goal of this study⁴⁴ is to give a comprehensive review of the current status and trends in drug-target interaction prediction. It lists several databases and web servers with data on drug space, target space, the drug-target interaction network, and side effect networks. The paper⁴⁵ deals with the function of small molecules and microRNAs (miRNAs) in cellular biology. The authors cover four experimental methods that have been employed in the last few years to look for small molecule inhibitors of miRNAs as well as three classes of models that can be used to predict whether a compound binds with a certain miRNA. The study⁴⁶ calls for more effective drugs to combat complex human diseases. The paper explains the background of the drug and introduces the concept of drug-pathway associations. The authors⁴⁷ present a Multi-Channel Feature Fusion model for multi-typed DDI prediction (MCFF-MTDDI). They extracted drug chemical structure features, drug pairs’ extra label features, and KG features of drugs. A multi-channel feature fusion module was then used to fuse these various features. The study⁴⁸ compared ML and PBPK models in predicting drug-drug interactions (DDIs). Data-driven in nature, and able to handle huge datasets with complex relations, ML models are thus well suited for predicting DDIs from disparate databases. The research proposed an integrated approach that combines ML with PBPK models, thus increasing the accuracy and efficiency of DDI predictions while also making the process more interpretable.

Materials and methods

The dataset used in this study was provided by the DDIMDL²¹ this dataset has 572 drugs and 74528 interactions of drug pairs according to DDIMDL. Remove²¹ the repeated interaction of drug pairs then in the dataset 37264 are left. They collected DDI’s from drug bank which was in the descriptive format and applied NLP techniques for a better understanding of DDI’s. Then four different features of drug targets, enzymes, pathways, and chemical substructures from the Drug Bank database. Drug Bank¹⁷database contains 12,151 drugs and its broad information which includes the drug name, chemical substructure (i.e., SMILES) or we can say the chemical formula of drugs, targets, enzymes, pathways, description, protein, etc., also contains 3844 drugs approved from Food and Drug Administration (FDA) and 5867 experimental drugs.

Methods overview

Our proposed framework, as shown in Fig. 1, has two components. Initially, features are extracted then pass the required data in the models, to train the model. Only required preprocessing techniques are applied in the feature extraction component. Then we used feature engineering techniques on the drug’s data. In the second component, we applied the CNN to train our model and evaluated that model with multiple performance measures, for example, accuracy, precision, recall, and F1 score. In this study, we proposed a method-based multi-modal CNN for the DDIs associated events predictions, as shown in Fig.2. First, we input four features of drug targets, enzymes, and pathways Schemes follow the same formatting. If there are multiple panels, they are listed below:

First, we input four features of drug targets, enzymes, pathways, and chemical substructure.
Second we use encoding and get binary vectors of features where 1 represents the presence of a compound and 0 represents the absence of a compound in the drug and then use similarity measure to calculate the similarity matrix of each drug pairs.
In the third step we create sub-models based on CNN for the prediction of DDI’s events.

Similarity of drug pairs

For drug pairs similarity calculation, we use the Jaccard similarity in our proposed architecture. The Jaccard similarity compared a compound of two drugs to check whether the compound is shared or not. Mathematical formulas of the Jaccard similarity measure are given in Eq. (1). Jaccard Similarity Formula

$$\begin{aligned} J_{\text {sim}}(J, K)= & {} \frac{|J \cap K|}{|J \cup K|} \nonumber \\= & {} \frac{\text {Total number of compounds in drug pairs}}{\text {Number of compounds in either set}} \end{aligned}$$

(1)

Where:

J is the bit vector for the first set (drug pair)
K is the bit vector for the second set (drug pair)
$|J \cap K|$ represents the intersection of J and K
$|J \cup K|$ represents the union of J and K

Table 2 MCNN-DDI results on different layers and activation function with Jaccard Similarity.

Full size table

Table 3 MCNN-DDI results on different sets of features.

Full size table

Extracting drug features

We have four different drug feature which includes targets, enzymes, pathways, and chemical substructures, as shown in the layer of “Drug Features” in Fig. 1. We used the encoding layer to create a bit vector of each drug where 1 represents the presence of the compound in the drug and 0 represents the absence of the compound. For example, pathways can be represented by a 957-dimensional bit vector which is defined by the PubChem chemical molecule database⁴⁹. PubChem defines a 202-dimensional bit vector for enzymes, an 1162-dimensional bit vector for targets, and an 808-dimensional bit vector for chemical substructure. The selected four drugs’ features for the experiment have high dimensions and most of the values are 0 in every bit vector of the drug. The value is 0 if a chemical compound is missing in a drug. Due to a maximum number of 0’s the dimension of a drug vector increases. Therefore, to reduce the drug vector sparsity we use Jaccard similarity measures. This similarity measure calculates a drug pair similarity matrix from a bit vector. Hence, we get 572 $\times$ 572 matric for each of the four drug features. This metric is used as a representation of drugs.

Let M = (Ajk) where M is the metric A is a drug and j and k are in the range of 0 and 1. Therefore, when the j and k are higher then the similarity between pairs of drugs is higher. Afterward, we feed these 572 $\times$ 572 as input of every drug feature to the sub-model based on CNN.

Multimodal CNN for prediction

As we are using different drug features in our study, we created sub-models based on convolutional neural networks for every feature of drugs.

CNN is a type of deep neural network⁵⁰. CNN was developed in the mid-1980s⁵¹. The CNN consists of three types of layers first one is input layers, second is hidden layers, and in the last output layers. The hidden layer in CNN includes layers that perform convolutions. Commonly the hidden layer of CNN includes a layer that performs convolution kernel dot product with the input matrix layer’s commonly used activation function is ReLU⁵² and usually the product is Frobenius inner product. Features maps are generated by the convolutional operations which act as an input to the next layer and then followed by other layers like normalization layers, fully connected layers, and pooling layers.

Our model architecture was inspired by DDIMDL which uses a simple multi-modal deep neural network for DDI prediction and achieved an accuracy of 88.5%.

In the CNN model, we use the input layer with the filter size of 1 and the kernel size of 5 with the “tanh“ activation function. We use the “tanh“ as an activation function because the similarity matrix using the Jaccard similarity measures produces some negative values, therefore we use “tanh” to use both positive and negative values. Then we use the flattening layer to convert the data into a dimensional array for inputting to the next layer. After flattening layer uses a dense layer of 1024,512,256 neurons and uses an “elu” as an activation function⁵³. With every dense layer, we use the “Bach Normalization” layer⁵⁴ for the normalization of the previous output layer. To avoid over-fitting of the model we also use a drop-out layer⁵⁵ of value 0.3 with every dense layer. Then we use a dense layer of 65 neurons as an output layer because we are classifying 65 different types of drug-drug interaction with the “SoftMax” activation function. We use Adam⁵⁶ as an optimizer in our model, and for the loss function, we use categorical cross entropy⁵⁷.

Results

In this section, we delve into the methods we employed and the exciting results we obtained in our study on drug-to-drug interactions (DDIs). Our approach harnessed the power of machine learning, specifically utilizing a Convolutional Neural Network (CNN) to analyze 65 unique drug-associated events extracted from the DrugBank database. Our model considers a wide range of input data, including chemical structures represented as SMILES, information about enzymes, pathways, and drug targets. Through extensive experimentation, where we fine-tuned various model parameters and explored different combinations of drug features, our Multi-Modal Convolutional Neural Network-Drug to Drug Interaction (MCNN-DDI) yielded impressive results.

Furthermore, we have carried out two different statistical tests to evaluate the performance of different methods. A paired t-test is used to compare our proposed method with other methods. This test determines if the mean difference in performance metrics is significant. For example, accuracy or AUC-ROC statistics between our method and baseline methods, the t-test shows that our proposed methods perform better than other methods, as shown in Table 4. The second ANOVA statistical test determines the effect of performance on dependent variables and improvement in evaluation metrics. A significance level $\alpha = 0.05$ was used to perform both tests. The statistical significance is indicated by the p-value if it is less than $\alpha = 0.05$.

Performance of model on different hyperparameters

The hyperparameters can affect the performance of the model. Hence, we perform different experiments on different hyperparameters. First, we used different numbers of layers and different activation functions in the dense layer and input layer with the Jaccard similarity matrices the number of layers and the activation function results with Jaccard similarity are given in Table 2.

Evaluating MNN-DDI performance on different set of features

For the results, we generate a classification report in our study, and the evaluation measures include accuracy score, F1-score, precision, and recall on both content and context results. All the above measurements are used for the model results evaluations. We have also done some experiments with different sets of drug features to evaluate our model performance on individual drug features and a combination of different drug features. The results on a single and a set of features are given in Table 3. The chemical substructure achieves an accuracy of 0.8861 which is more informative as compared to other features while the pathways achieve an accuracy of 0.8317 and the model train on target achieves an accuracy of 0.8441. The enzyme features of drugs individually achieve an accuracy of 0.6808 which show that this feature of drugs is not informative as compared to other. When used individually in combination with other features it produces very good results. Whenever using two sets of features the smiles and pathways achieve an accuracy of 0.8993 which is greater than all of the other drug’s features used in a set of two. Using three sets of features among all the features the following features chemical substructures, Targets, and Enzymes achieved an accuracy of 0.9000. The model is trained on all four combined features and achieved an accuracy of 0.8953. All the experiments show that using all four features of drugs combined can’t perform as well as the above three features perform.

Evaluating the performance of our model more in-depth, we investigated how often DDIs among the top 100 most highly scored predictions were correctly predicted. Therefore, we used a ranking strategy, to analyze how well our model prioritizes true positives. Hence, methodologically, we computed prediction scores for all the DDIs and ranked them based on these predictions. So, we analyze the precision predictions between different drug interaction’s ranking particularly in the best 100. By this, we learned more about the model’s ability to identify the most pertinent DDIs. Our model has demonstrated the ability to prioritize DDIs accurately as per our expectations from this top-best 100 analysis. Our model shows good performance in predicting interactions between different types of drugs such as cardiovascular, infectious, and cancerous diseases. In addition, our results were compared with the other approaches and they confirmed the effectiveness and reliability of our model to rank DDIs.

We also examined whether the model provided equal accuracy of prediction DDIs across cardiovascular, infectious, and cancer diseases. After a detailed evaluation, we noted a consistency in the performance of our model within the cardiovascular, infectious, and cancer diseases. The consistency shows the strength of our model. Our model consistently placed true positive interactions among the first-ranked predictions, indicating its capability to accurately detect clinically relevant DDIs.

Discussion

To show the robustness of our model MMCNNDDI we compare our model (as shown in Fig. 1) with the following models which are DDIMDL, DeepDDI, RF, K-nearest Neighbor (KNN), and Logistic Regression (LR). All the given methods use using same drug features like targets, enzymes, pathways, and chemical substructure except DeepDDI (as shown in 2). For the random forest, we set the decision tree values to 100 and for KNN we set the neighbor value to 4. The experiment results of all the methods are shown in the Table 4.

Table 4 MCNN-DDI results comparison with other models.

Full size table

In Table 4, it can be observed that our model outperforms all the other models in the four assessments, while the LR model is poorly related to all the other models. Our model achieves high accuracy and AUPR values of 0.9000 and 0.9478, respectively, demonstrating its superiority (as shown in Table 3). When compared to DDIMDL, our model shows an improvement in accuracy score from 0.8852 to 0.9000 and an increase in AUPR value from 0.9208 to 0.9478, along with improvements in other evaluation metrics.

Conclusion

Recently deep learning techniques have been used for the prediction of drug-drug interactions but mostly these studies concentrate on one feature of drugs or whether one drug interacts with another or not. In this research study, we use deep learning multi-modal CNN techniques on the drug bank database which was created by DDIMDL for the prediction of drug-drug interaction events. The data set has 572 drugs and their diverse features like chemical substructures (SMILES), enzymes, pathways, and targets, 74528 interactions, and 65 types of drug-drug interaction events. Our MCNN-DDI model achieved an accuracy of 90.00% and an AUPR of 94.78% as shown in Table 3. Our MCNN-DDI model has a 1D CNN input layer with a filter size of 1 and 5 kernels size, three dense layers of 1024, 512, and 256 neurons and an output layer of 65 neurons, 1 Flatten layer, and Bach Norm and Dropout layer with 0.3 value and use a combination of four drug features for the input of the model. We used four CNN sub-models for every feature of drugs and then in the last, we combined these sub-models for the prediction of drug-drug interaction events.

Data availability

All data generated or analyzed during this study are cited in Reference¹⁹ and are also publicly available:https://go.drugbank.com/.

References

Bhaskar, K. et al. Incidence of potential drug-drug interactions in a limited and stereotyped prescription setting-comparison of two free online pharmacopoeias. Cureus 8, 859 (2016).
Google Scholar
Van-Dijk, K., de-Vries, C. S., van-Den-Berg, P., Brouwers, J. & De-Van-den-Berg, L. J. Occurrence of potential drug-drug interactions in nursing home residents. Int. J. Pharm. Pract. 9, 45–52 (2001).
Article Google Scholar
Liu, S. et al. Drug-drug interaction extraction via convolutional neural networks. Comput. Math. Methods Med. 2016, 145 (2016).
Article Google Scholar
Kusuhara, H. How far should we go? Perspective of drug-drug interaction studies in drug development. Drug Metab. Pharmacokinet. 29, 227–228 (2014).
Article CAS PubMed Google Scholar
Percha, B. & Altman, R. B. Informatics confronts drug-drug interactions. Trends Pharmacol. Sci. 34, 178–184 (2013).
Article CAS PubMed Google Scholar
Bjornsson, T. D. et al. The conduct of in vitro and in vivo drug-drug interaction studies: A pharmaceutical research and manufacturers of america (phrma) perspective. Drug Metab. Dispos. 31, 815–832 (2003).
Article CAS PubMed Google Scholar
Li, T.-H., Wang, C.-C., Zhang, L. & Chen, X. Snrmpacdc: Computational model focused on siamese network and random matrix projection for anticancer synergistic drug combination prediction. Brief. Bioinform. 24, bbac503 (2023).
Article PubMed Google Scholar
Chen, X. et al. Nllss: Predicting synergistic drug combinations based on semi-supervised learning. PLoS Comput. Biol. 12, e1004975 (2016).
Article PubMed PubMed Central Google Scholar
Yu, H. et al. Predicting and understanding comprehensive drug-drug interactions via semi-nonnegative matrix factorization. BMC Syst. Biol. 12, 101–110 (2018).
Article Google Scholar
Zhang, W. et al. Sflln: A sparse feature learning ensemble method with linear neighborhood regularization for predicting drug-drug interactions. Inf. Sci. 497, 189–201 (2019).
Article ADS CAS Google Scholar
Kumar Shukla, P. et al. Efficient prediction of drug-drug interaction using deep learning models. IET Syst. Biol. 14, 211–216 (2020).
Article PubMed PubMed Central Google Scholar
Feng, Y.-H., Zhang, S.-W. & Shi, J.-Y. Dpddi: A deep predictor for drug-drug interactions. BMC Bioinform. 21, 1–15 (2020).
Article Google Scholar
Lee, G., Park, C. & Ahn, J. Novel deep learning model for more accurate prediction of drug-drug interaction effects. BMC Bioinform. 20, 1–8 (2019).
Article Google Scholar
Rohani, N. & Eslahchi, C. Drug-drug interaction predicting by neural network using integrated similarity. Sci. Rep. 9, 13645 (2019).
Article ADS PubMed PubMed Central Google Scholar
Liu, S., Huang, Z., Qiu, Y., Chen, Y.-P. P. & Zhang, W. Structural network embedding using multi-modal deep auto-encoders for predicting drug-drug interactions. In 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 445–450 (IEEE, 2019).
Karim, M. R. et al. Drug-drug interaction prediction based on knowledge graph embeddings and convolutional-lstm network. In Proceedings of the 10th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics 113–123 (2019).
Wishart, D. S. et al. Drugbank 5.0: A major update to the drugbank database for 2018. Nucleic Acids Res. 46, D1074–D1082 (2018).
Article CAS PubMed Google Scholar
Kanehisa, M., Goto, S., Furumichi, M., Tanabe, M. & Hirakawa, M. Kegg for representation and analysis of molecular networks involving diseases and drugs. Nucleic Acids Res. 38, D355–D360 (2010).
Article CAS PubMed Google Scholar
Thorn, C. F., Klein, T. E. & Altman, R. B. Pharmgkb: The pharmacogenomics knowledge base. Pharmacogenom. Methods Protocols 2013, 311–320 (2013).
Article Google Scholar
Hou, X., You, J. & Hu, P. Predicting drug-drug interactions using deep neural network. In Proceedings of the 2019 11th International Conference on Machine Learning and Computing, 168–172 (2019).
Deng, Y. et al. A multimodal deep learning framework for predicting drug-drug interaction events. Bioinformatics 36, 4316–4322 (2020).
Article CAS PubMed Google Scholar
Zheng, Y. et al. Ddi-pulearn: A positive-unlabeled learning method for large-scale prediction of drug-drug interactions. BMC Bioinform. 20, 1–12 (2019).
Article CAS Google Scholar
Chen, X., Liu, X. & Wu, J. Drug-drug interaction prediction with graph representation learning. In 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 354–361 (IEEE, 2019).
Dai, Y., Guo, C., Guo, W. & Eickhoff, C. Drug-drug interaction prediction with wasserstein adversarial autoencoder-based knowledge graph embeddings. Brief. Bioinform. 22, bbaa256 (2021).
Article PubMed Google Scholar
Zhang, C. & Zang, T. Cnn-ddi: A novel deep learning method for predicting drug-drug interactions. In 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 1708–1713 (IEEE, 2020).
Chen, Y. et al. Muffin: Multi-scale feature fusion for drug-drug interaction prediction. Bioinformatics 37, 2651–2658 (2021).
Article CAS PubMed Google Scholar
Lin, X., Quan, Z., Wang, Z.-J., Ma, T. & Zeng, X. Kgnn: Knowledge graph neural network for drug-drug interaction prediction. IJCAI 380, 2739–2745 (2020).
Google Scholar
Purkayastha, S., Mondal, I., Sarkar, S., Goyal, P. & Pillai, J. K. Drug-drug interactions prediction based on drug embedding and graph auto-encoder. In 2019 IEEE 19th International Conference on Bioinformatics and Bioengineering (BIBE) 547–552 (IEEE, 2019).
Gottlieb, A., Stein, G. Y., Oron, Y., Ruppin, E. & Sharan, R. Indi: A computational framework for inferring drug interactions and their associated recommendations. Mol. Syst. Biol. 8, 592 (2012).
Article PubMed PubMed Central Google Scholar
Vilar, S. et al. Drug-drug interaction through molecular structure similarity analysis. J. Am. Med. Inform. Assoc. 19, 1066–1074 (2012).
Article PubMed PubMed Central Google Scholar
Vilar, S. et al. Similarity-based modeling in large-scale prediction of drug-drug interactions. Nat. Protoc. 9, 2147–2163 (2014).
Article CAS PubMed PubMed Central Google Scholar
Liu, S. et al. Enhancing drug-drug interaction prediction using deep attention neural networks. IEEE/ACM Trans. Comput. Biol. Bioinf. 20, 976–985 (2023).
Article Google Scholar
Yan, C. et al. Predicting drug-drug interactions based on integrated similarity and semi-supervised learning. IEEE/ACM Trans. Comput. Biol. Bioinf. 19, 168–179 (2020).
Article Google Scholar
Sadeghi, S. & Ngom, A. Ddipred: Graph convolutional network-based drug-drug interactions prediction using drug chemical structure embedding. In 2022 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB) 1–6 (IEEE, 2022).
Vijayan, A. & Chandrasekar, B. Advance single stage convolutional neural network for drug-drug interactions. In 2022 Fourth International Conference on Cognitive Computing and Information Processing (CCIP) 1–6 (IEEE, 2022).
Sivakumar, B. et al. Drug interaction prediction using various methods to reduce adverse effects. In 2022 6th International Conference on Trends in Electronics and Informatics (ICOEI) 123–127 (IEEE, 2022).
Chen, S. et al. Artificial intelligence-driven prediction of multiple drug interactions. Brief. Bioinform. 23, bbac427 (2022).
Article PubMed Google Scholar
Zhang, Y., Deng, Z., Xu, X., Feng, Y. & Junliang, S. Application of artificial intelligence in drug-drug interactions prediction: A review. J. Chem. Inf. Model. 2023, 854 (2023).
Google Scholar
Wei, J., Lu, Z., Qiu, K., Li, P. & Sun, H. Predicting drug risk level from adverse drug reactions using smote and machine learning approaches. IEEE Access 8, 185761–185775 (2020).
Article Google Scholar
Shtar, G., Rokach, L. & Shapira, B. Detecting drug-drug interactions using artificial neural networks and classic graph similarity measures. PLoS ONE 14, e0219796 (2019).
Article CAS PubMed PubMed Central Google Scholar
Qiu, W., Lv, Z., Hong, Y., Jia, J. & Xiao, X. Bow-gbdt: A gbdt classifier combining with artificial neural network for identifying gpcr-drug interaction based on wordbook learning from sequences. Front. Cell Dev. Biol. 8, 623858 (2021).
Article PubMed PubMed Central Google Scholar
Xie, Y. & Wang, C. Herb-drug interactions between panax notoginseng or its biologically active compounds and therapeutic drugs: A comprehensive pharmacodynamic and pharmacokinetic review. J. Ethnopharmacol. 2023, 116156 (2023).
Article Google Scholar
Vo, T. H., Nguyen, N. T. K. & Le, N. Q. K. Improved prediction of drug-drug interactions using ensemble deep neural networks. Med. Drug Discov. 17, 100149 (2023).
Article CAS Google Scholar
Chen, X. et al. Drug-target interaction prediction: Databases, web servers and computational models. Brief. Bioinform. 17, 696–712 (2016).
Article CAS PubMed Google Scholar
Chen, X., Guan, N.-N., Sun, Y.-Z., Li, J.-Q. & Qu, J. Microrna-small molecule association identification: From experimental results to computational models. Brief. Bioinform. 21, 47–61 (2020).
CAS PubMed Google Scholar
Wang, C.-C., Zhao, Y. & Chen, X. Drug-pathway association prediction: From experimental results to computational models. Brief. Bioinform. 22, bbaa061 (2021).
Article PubMed Google Scholar
Han, C.-D., Wang, C.-C., Huang, L. & Chen, X. Mcff-mtddi: Multi-channel feature fusion for multi-typed drug-drug interaction prediction. Brief. Bioinform. 2023, bbad215 (2023).
Article Google Scholar
Gill, J. et al. Comparing the applications of machine learning, pbpk, and population pharmacokinetic models in pharmacokinetic drug-drug interaction prediction. CPT Pharmacometr. Syst. Pharmacol. 11, 1560–1568 (2022).
Article CAS Google Scholar
Kim, S. et al. Pubchem substance and compound databases. Nucleic Acids Res. 44, D1202–D1213 (2016).
Article CAS PubMed Google Scholar
Kim, P. Convolutional neural network bt-matlab deep learning: With machine learning. Neural Netw. Artif. Intell. 2023, 121–147 (2023).
Google Scholar
Hecht-Nielsen, R. Theory of the backpropagation neural network. In Neural Networks for Perception 65–93 (Elsevier, 1992).
Krizhevsky, A., Sutskever, I. & Hinton, G. E. Imagenet classification with deep convolutional neural networks. Commun. ACM 60, 84–90 (2017).
Article Google Scholar
Ding, B., Qian, H. & Zhou, J. Activation functions and their characteristics in deep neural networks. In 2018 Chinese Control and Decision Conference (CCDC) 1836–1841 (IEEE, 2018).
Ioffe, S. & Szegedy, C. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International Conference on Machine Learning 448–456 (pmlr, 2015).
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I. & Salakhutdinov, R. Dropout: A simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014).
MathSciNet Google Scholar
Kinga, D., Adam, J. B. et al. A method for stochastic optimization. In International Conference on Learning Representations (ICLR), vol. 5 6 (San Diego, 2015).
Koidl, K. Loss functions in classification tasks. In School of Computer Science and Statistic Trinity College, Dublin 1–5 (2013).

Download references

Acknowledgements

Researchers would like to thank the Deanship of Scientific Research, Qassim University for funding publication of this project.

Author information

Authors and Affiliations

Department of Computer Science, CoE-AI, Center of Excellence Artificial Intelligence, Bahria University, Islamabad, Pakistan
Muhammad Asfand-e-yar, Qadeer Hashir & Waqar Khalil
Department of Computer Science, Bahria University, Islamabad , Pakistan
Asghar Ali Shah
Florida International University, Miami, USA
Hafiz Abid Mahmood Malik
Department of Management Information Systems and Production Management, College of Business and Economics, Qassim University, Buraydah 51452, Saudi Arabia
Abdullah Alourani

Authors

Muhammad Asfand-e-yar
View author publications
You can also search for this author in PubMed Google Scholar
Qadeer Hashir
View author publications
You can also search for this author in PubMed Google Scholar
Asghar Ali Shah
View author publications
You can also search for this author in PubMed Google Scholar
Hafiz Abid Mahmood Malik
View author publications
You can also search for this author in PubMed Google Scholar
Abdullah Alourani
View author publications
You can also search for this author in PubMed Google Scholar
Waqar Khalil
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.A., Q.H., A.A.S. and H.A.M.M. envisioned the idea for research designed, wrote and discussed the results. A.A. and W.K. worked on the literature and discussion section. All authors provided critical feedback, reviewed the paper, and approved the manuscript

Corresponding authors

Correspondence to Hafiz Abid Mahmood Malik or Abdullah Alourani.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Asfand-e-yar, M., Hashir, Q., Shah, A.A. et al. Multimodal CNN-DDI: using multimodal CNN for drug to drug interaction associated events. Sci Rep 14, 4076 (2024). https://doi.org/10.1038/s41598-024-54409-x

Download citation

Received: 05 December 2023
Accepted: 12 February 2024
Published: 19 February 2024
DOI: https://doi.org/10.1038/s41598-024-54409-x

Keywords

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.