Active learning graph neural networks for partial charge prediction of metal-organic frameworks via dropout Monte Carlo

Thaler, Stephan; Mayr, Felix; Thomas, Siby; Gagliardi, Alessio; Zavadlav, Julija

doi:10.1038/s41524-024-01277-8

Download PDF

Article
Open access
Published: 03 May 2024

Active learning graph neural networks for partial charge prediction of metal-organic frameworks via dropout Monte Carlo

npj Computational Materials volume 10, Article number: 86 (2024) Cite this article

543 Accesses
3 Altmetric
Metrics details

Subjects

Abstract

Metal-organic frameworks (MOF) are an attractive class of porous materials due to their immense design space, allowing for application-tailored properties. Properties of interest, such as gas sorption, can be predicted in silico with molecular mechanics simulations. However, the accuracy is limited by the available empirical force field and partial charge estimation scheme. In this work, we train a graph neural network for partial charge prediction via active learning based on Dropout Monte Carlo. We show that active learning significantly reduces the required amount of labeled MOFs to reach a target accuracy. The obtained model generalizes well to different distributions of MOFs and Zeolites. In addition, the uncertainty predictions of Dropout Monte Carlo enable reliable estimation of the mean absolute error for unseen MOFs. This work paves the way towards accurate molecular modeling of MOFs via next-generation potentials with machine learning predicted partial charges, supporting in-silico material design.

Benchmarking graph neural networks for materials chemistry

Article Open access 03 June 2021

CHGNet as a pretrained universal neural network potential for charge-informed atomistic modelling

Article Open access 14 September 2023

Graph neural networks for materials science and chemistry

Article Open access 26 November 2022

Introduction

Metal-organic-Frameworks (MOF) have become a centerpiece of the research on porous materials, building upon a tradition of research often centered around Zeolites. The large surface area and pores allow for novel applications such as physisorption for storage or filtering—both in gaseous^1,2 and solvent environments^3,4, hosting catalytic processes⁵ or innovative sensing mechanisms⁶. Compared to inorganic zeolites found in similar applications, MOFs are characterized by a mesostructural arrangement of metal nodes and organic linker-molecules, which form a network. While these can mimic the network topology of zeolites, such as in the ZIF-family of MOFs⁷, continuous research has allowed for an ever wide range of structures made possible by building block-like attachment chemistries of linker molecules on metal-clusters^8,9. Once a working experimental mechanism has been established to create a MOF from a specific set of precursor molecules, the subsequent modification of linker precursors allows for a rapid combinatorial increase of possible structures with varying properties¹⁰.

For standardized theoretical experiments across this diversity of structures, computational scientists have started to collate datasets ready for high-throughput computation. The earliest effort was the CoreMOF dataset^11,12 created from structures deposited in the Cambridge Structural Database (CSD)¹³. The QMOF database significantly improved on this, ensuring data provenance and proper deduplication of the CSD-data^14,15 while also adding structures from the growing collections of hypothetical MOFs^16,17,18. The latter idea is fully embraced in the MOFX¹⁹ and ARC-MOF²⁰ projects, which both collate and standardize structures across databases of hypothetical MOF-structures and zeolites.

Typically, high-throughput studies built on these databases target the sorption capacities of the structures for a set of standard, commercially relevant gases (e.g. CO2, N2, H2), which are computed via time-consuming grand-canonical molecular mechanics simulations²¹. These simulations are typically performed using generic force field parameters, empirical partial charge models for coulombic interactions as well as rigid framework structures²². However, the fidelity of this approach is hampered by the accuracy of available classical force fields^23,24 as well as the rigid framework assumption, given that experiments show that properties can change drastically by framework flexibility and naturally occurring defects^25,26. Direct surrogate modeling using recent Machine Learning (ML) methods circumvents the cost of molecular mechanics simulations, but the underlying classical force field data limits the accuracy of these ML estimators^27,28,29.

ML potentials promise significantly increased accuracy compared to classical force fields^30,31,32,33 and are increasingly used as a drop-in replacements in molecular mechanics simulations^{34,35,36,37,38}. Encouraging first results in MOF applications³⁹ and growing evidence that with increasing system size, separating the interactions into a short-range ML potential component and a classical long-range electrostatics component is beneficial to improve accuracy and computational efficiency^40,41,42, emphasize the importance of a reliable partial charge assignment modeling mechanism. Standalone, such a model can be used for diverse purposes, ranging from computing observables, such as dipole momenta for small molecules^43,44, to replacing older empirical charge models used in REAX-FF-MD⁴⁵. With an appropriate loss, the model can also be trained end-to-end to model potential energy surfaces for ionic materials^46,47.

Recent work has shown that ML partial charge predictors can also be used to replace computationally expensive density functional theory (DFT) computations for MOFs^48,49,50. However, due to the high dimensionality of chemical space, achieving a sufficient training data coverage by labeling randomly generated structures tends to be computationally prohibitive. This applies particularly to the context of screening for compounds, where the ML model needs to predict the labels for parts of chemical space, where by definition, there are no experimental or simulation data yet⁵¹. Active learning^52,53 (AL) promises to efficiently generate diverse training datasets by only labeling configurations for which the model is uncertain, maximizing the information content per structure to reduce the required amount of labeled data significantly. To achieve this goal, AL builds the training data in an iterative manner by screening for high uncertainty inputs, labeling them and re-training the model on the extended training data. Consequently, AL efficacy critically depends on the quality of the employed uncertainty quantification (UQ) scheme to estimate the prediction error. For state-of-the-art graph neural network (GNN) surrogate models, popular UQ schemes include the Deep Ensemble method^54,55, stochastic-gradient Markov chain Monte Carlo (SG-MCMC)^56,57,58,59 and Dropout Monte Carlo^60,61 (DMC). However, the former two methods are inefficient for AL given that they require retraining of several models at each AL iteration.

In this work, we investigate the efficacy of DMC for AL of MOF partial charges. To that end, we predict partial charges using a charge neutrality-enforcing⁴⁸ GNN model, which we augment with Dropout layers to enable UQ via DMC. DMC represents a computationally efficient UQ scheme in an AL setting given that only a single parameter set needs to be re-trained at each AL iteration. We showcase the efficacy of DMC-based AL by training the GNN on the QMOF database and comparing its performance with an optimal AL oracle and a random baseline. Beyond AL, DMC can assess the model validity in the face of novel datapoints – allowing practitioners to estimate the expected error of the prediction. To evaluate the generalization and UQ capability of the obtained GNN under distribution shift, we benchmark the model on a subset of ARC-MOF and IZA-Zeolite.

Results

Active learning scheme

Figure 1a visualizes the AL pipeline used to train the GNN partial charge prediction model. The GNN enforces charge neutrality and features Dropout⁶² with a probability of p = 10% at all linear layers to enable UQ via DMC⁶⁰ (more details in the methods section). We split the available training data into two sets: An initially small labeled set and an initially large pool set that is considered to be unlabeled. The AL cycle starts by training the GNN on the labeled set. Afterwards, we evaluate the uncertainty of the model with respect to all MOFs in the pool set. To this end, we predict the partial charges q associated with the atoms of each MOF for D = 8 different random Dropout configurations to estimate the mean μ and standard deviation σ across the D different predictions as

$${{{\boldsymbol{\mu }}}}=\frac{1}{D}\mathop{\sum }\limits_{d=1}^{D}{{{{\bf{q}}}}}_{d}\quad ;\quad {{{\boldsymbol{\sigma }}}}=\frac{1}{D-1}\mathop{\sum }\limits_{d=1}^{D}\sqrt{{({{{{\bf{q}}}}}_{d}-{{{\boldsymbol{\mu }}}})}^{2}}\,,$$

(1)

where q_d are the predicted partial charges corresponding to Dropout configuration d.

**Fig. 1: Graph neural network training and testing.**

From the predicted standard deviation σ for each atom, we compute the average standard deviation of each MOF

$${\delta }_{{{{\rm{MOF}}}}}=\frac{1}{{N}_{{{{\rm{atoms}}}}}}\mathop{\sum }\limits_{k=1}^{{N}_{{{{\rm{atoms}}}}}}{\sigma }_{k}\,,$$

(2)

where N_atoms is the number of atoms in the MOF. This allows to select the 16 MOFs with the largest δ_MOF in the pool set. As an alternative to δ_MOF, the MOFs could also be selected based on the maximum σ_k. Afterwards, we retrieve the labels of the selected MOFs, which emulates computing partial charge labels via DFT in practice. The newly labeled MOFs are added to the labeled set and retraining closes the AL loop. Training ends after a pre-defined accuracy threshold is reached or the whole pool set has been added to the labeled set. Refer to the methods section for a full set of AL hyperparameters.

After finishing training, the goal is to test the performance of the obtained model and its uncertainty estimates under the realistic scenario of distribution shift. To this end, we benchmark the model with respect to three datasets distinct from the training data (Fig. 1b): first, structurally bigger MOFs from the same distribution, second, MOFs from a different distribution and last, Zeolites representing related, but different structures.

Datasets

We deploy v13 of the QMOF dataset (as documented on the project homepage: https://github.com/Andrew-S-Rosen/QMOF/blob/main/updates.md), which contains 20375 MOFs with 77 different atom types. To test the generalization capabilities of the obtained NN model with respect to large structures of the same distribution, we hold-out all MOFs with more than 100 atoms (Fig. 2a). Given that GNNs cannot be expected to accurately model atoms that are not contained in the training data, we remove all MOFs containing atom types that occur less than 10 times in the < 100 atom MOF data subset. Consequently, the number of atom types reduces to 74, the set of small MOFs is reduced by 4 to 11173 and the set of large MOFs is reduced by 42–9156. Finally, we obtain the training dataset by applying a random 80%–8%–12% training-validation-test split to the small MOF dataset.

To further evaluate the model performance with regards to inputs from a different distribution of MOFs, we consider the ARC-MOF database²⁰. The database collates structures from 14 pre-existing datasets and includes additional synthetic structures generated with pormake¹⁷. We neglected the ARC-MOF databases 12 and 14 to minimize the overlap with the QMOF dataset to create real-world out-of-sample distributions. We chose a stratified sampling scheme that selects a minimum of 5% or a maximum of 20 structures per database to distinguish performance discrepancies among distinct datasets. We have included further comparisons illustrating the difference between these distributions in the Supplementary Figs. 1–3. Finally, we benchmark our model on 20 randomly selected structures from the IZA-Zeolite dataset acquired via MOFX-DB¹⁹. Zeolites have a different atomic structure than MOFs, but are commonly used in similar application areas, providing a challenging benchmark for generalization. In order to maintain consistency with the QMOF dataset, we computed partial charge labels of the structures sampled from ARC-MOF and IZA-Zeolite using the DFT settings used in QMOF as outlined in the methods section.

Evaluation metrics

Due to the chemical structure of MOFs, the majority of atoms correspond to organic linkers. Only four atom types (H, C, N, O) constitute 92.6% of the atoms in the QMOF dataset (Fig. 2b). Consequently, partial charge prediction for MOFs is a highly class-imbalanced ML problem.

The mean absolute error (MAE) is most commonly used to evaluate the performance of partial charge predictions for MOFs⁵⁰, likely due to its straightforward interpretability. However, given the class imbalance of MOF atoms, MAE is dominated by organic linker atoms and is therefore an insufficient metric to evaluate model performance across the broad range of metal nodes. We therefore propose the per-species MAE (SMAE) as an alternative, interpretable metric to evaluate model performance for highly class-imbalanced datasets:

$${{{\rm{SMAE}}}}=\frac{1}{{N}_{{{{\rm{species}}}}}}\mathop{\sum }\limits_{i=1}^{{N}_{{{{\rm{species}}}}}}{{{{\rm{MAE}}}}}^{(i)}\quad {{{\rm{with}}}}\quad {{{{\rm{MAE}}}}}^{(i)}=\frac{1}{{N}_{{{{\rm{i}}}}}}\mathop{\sum }\limits_{k=1}^{{N}_{{{{\rm{i}}}}}}\left| {\mu }_{k}^{(i)}-{\hat{q}}_{k}^{(i)}\right| \,,$$

(3)

where N_species is the number of different atom types, N_i is the number of atoms of atom type i in the data (sub)set and ${\hat{q}}^{(i)}$ is the corresponding partial charge label. All atom types contribute equally to SMAE, effectively increasing the weight of metal nodes compared to the MAE.

Partial charge prediction performance

We evaluate the performance of the GNN for partial charge prediction by training the model on the full training data for 500 epochs without Dropout. On the test set, the resulting model yields MAE = 0.0083e and SMAE = 0.0339e, respectively (Table 1). This is in line with reported performance of partial charge prediction for MOFs in the literature with MAE values of 0.0192e⁵⁰ and 0.025e⁴⁸ on the CoRE MOF-2019 dataset¹². Training the GNN with a Dropout probability of p = 10% yields test set errors of MAE = 0.0115e and SMAE = 0.0437e. We found that increasing p monotonously increases the test set errors (Supplementary Fig. 4). Consequently, we selected a small Dropout rate of p = 10% for the subsequent active learning task to minimize the model accuracy penalty from Dropout. Throughout the manuscript, we consider the mean over D = 8 Dropout configurations μ as the prediction of GNN models with Dropout. We choose this evaluation scheme because generating a single prediction by simply deactivating the Dropout layers at inference time yields significantly worse predictions across all our experiments.

Table 1 Predictive performance summary

Full size table

To set these metrics into perspective and judge whether the performance of the GNN is sufficient for surrogate modeling, we compare them to the error resulting from using a computationally less expensive DFT simulation with Γ-only sampling (as performed by Kancharlapalli et al.⁵⁰) as a predictor for the results with a finer k-point grid (as in the work of Nazarian et al.⁶³ and the QMOF dataset¹⁴). Using the 1260 identical structures that are shared between the works of Kancharlapalli et al.⁵⁰ and Nazarian et al.⁶³, we find MAE = 0.058e and SMAE = 0.136e. Accounting for possible outliers and noise, we set the target accuracy for our model to 50% of those values (Fig. 3). Thus, the achieved accuracy levels of our models with and without Dropout are more than sufficient for surrogate modeling.

Active learning

The goal of AL is to train a ML model that achieves the target accuracy while requiring the least amount of labeled training data. We benchmark the DMC-based AL method against a baseline scheme that randomly selects MOFs to be added to the training set as well as an AL oracle that uses the (in practice unavailable) true MAE of the MOF as the AL selection criterion:

$${{{{\rm{MAE}}}}}_{{{{\rm{MOF}}}}}=\frac{1}{{N}_{{{{\rm{atoms}}}}}}\mathop{\sum }\limits_{k=1}^{{N}_{{{{\rm{atoms}}}}}}| {\mu }_{k}-\hat{{q}_{k}}| .$$

(4)

As visualized in Fig. 3a, Dropout AL outperforms the random selection baseline, reducing the required amount of training data to reach the SMAE accuracy target to 13% of the training data compared to 24% for the random selection. As expected, the AL oracle outperforms the Dropout AL, reaching the DFT accuracy target at 6% of the training data. Nonetheless, at more than 14% of the training data, Dropout AL closes this gap and achieves the same performance as the oracle.

The increase in data efficiency of AL becomes more pronounced if duplicate structures exist in the dataset: We applied the AL pipeline to the original CoreMOF-2019¹² dataset containing 817 duplicate structures, which we detected via an improved de-duplication screening using the method described by Rosen et al.¹⁴. For this dataset, the random selection baseline required almost 4 times as many labeled samples as the Dropout AL scheme to reach the accuracy target (Supplementary Fig. 5). Interestingly, when considering MAE, AL appears to provide no significant benefit - even with the true error as uncertainty metric (Fig. 3b). The performance measured by MAE seems to be simply a function of the amount of training data, irrespective of the diversity of the structures.

Test set performance analysis

We analyze the performance of the GNN at the end of the active learning training (100% of the training data, termed AL GNN) on the small MOF test set. The AL GNN achieves a test set accuracy (MAE = 0.0136e, SMAE = 0.0325e) similar to the GNN directly trained on all the training data (Table. 1). Among organic linker atom types, the AL GNN features very small MAE only for hydrogen (MAE = 0.0063e). Interestingly, despite the large amount of atoms in the training data, the resulting MAE of carbon (0.0143e), nitrogen (0.0180e) and oxygen (0.0174e) atoms are in line with the MAE of many metal nodes (Supplementary Fig. 6). There are 4 atom types with large MAE > 0.1e: Hf, Nb, Fe, and, Sn (in error decreasing order). The large MAE of Hf and Nb can be explained by a lack of training data because both atom types appear less than 10 times in the training data. In contrast, there are 354 Fe and 233 Sn atoms in the training set. The cause for the large MAE of these more common metals is a topic for further research – a cursory visual inspection of 6 samples with individual per-atom-errors larger than 0.2e did not reveal any obvious commonalities, except 83% of samples exhibiting nonstandard-topologies that evaded classification with moffragmentor⁶⁴.

Even though the per-species MAE of organic linker atoms is comparatively small, there are several atoms in the test data that exhibit very large errors (Fig. 4). Interestingly, these large error atoms tend to cluster together at similar DFT charge values (e.g. $\hat{q}\,\approx\, 0.65$ e or $\hat{q}\,\approx\, -0.15$ e for carbon and $\hat{q}\,\approx\, -0.55$ e or $\hat{q}\,\approx\, -0.95$ e for oxygen). The AL GNN appears to contain little information about these clusters given that the model predicts almost the full range of partial charges found for this atom type (see Supplementary Fig. 7 for dedicated parity plots for each organic atom type). Manual inspection of structures with very large error ( > 0.2e per atom) reveals several interesting features: First, 20 of the 24 structures exhibit both abnormal errors for oxygen and carbon, while 70% of those are classified as the lvt-topology. Visual inspection reveals that nearly every sample exhibits characteristic wire-like structures of carbon-atoms (see a sample in Supplementary Fig. 8), which appears to reduce the ability of our GNN to model this structure. Further research could help elucidate if these structures are physically feasible or if database screening criteria need to be refined.

Dropout Monte Carlo inference

Next, we investigate the generalization capabilities and the quality of uncertainty estimates of the AL GNN. The uncertainty estimate in Eq. (1) typically underestimates uncertainties for held-out data⁶⁵. This is particularly true for the GNN in this work due to the small Dropout probability p = 10%. While a grid search over the Dropout rate could be performed to improve the calibration of the uncertainty estimates via a higher Dropout rate^60,66, this higher rate would decrease the predictive accuracy (Supplementary Fig. 4). To avoid increasing the Dropout rate, we compute a calibrated uncertainty $\tilde{{{{\boldsymbol{\sigma }}}}}$ by scaling the distribution of predicted uncertainties σ (Eq. (1) such that the variance in the uncertainty estimates matches the variance of the error on the validation set, as is common in the literature⁶⁵:

$$\tilde{{{{\boldsymbol{\sigma }}}}}=\alpha * {{{\boldsymbol{\sigma }}}}\,\,\,{{{\rm{with}}}}\,\,\,\alpha =\sqrt{-\frac{1}{D}+\frac{D-3}{D-1}\frac{1}{{N}_{{{{\rm{val}}}}}}{\sum }_{l=1}^{{N}_{{{{\rm{val}}}}}}\frac{{({\mu }_{l}-{\hat{q}}_{l})}^{2}}{{\sigma }_{l}^{2}}}\,,$$

(5)

where N_val is the total number of partial charge labels ${\hat{q}}_{l}$ in the validation set and μ_l and σ_l are the DMC mean and standard deviations of the corresponding atom (Eq. (1)). The calibration of the AL GNN via Eq. (5) yields α = 1.504.

We investigate the model errors as well as the corresponding uncertainty estimates on three levels for each dataset: First, we analyse the MAE and SMAE for the whole dataset. Second, we compute MAE_MOF (Eq. (4)) for each MOF to assess the performance of the UQ scheme to provide reliable uncertainty estimates for individual MOFs. To this end, we estimate MAE_MOF via the mean calibrated standard deviation ${\tilde{\delta }}_{{{{\rm{MOF}}}}}$:

$${\tilde{\delta }}_{{{{\rm{MOF}}}}}=\frac{1}{{N}_{{{{\rm{atoms}}}}}}\mathop{\sum }\limits_{k=1}^{{N}_{{{{\rm{atoms}}}}}}{\tilde{\sigma }}_{k}\,.$$

(6)

Last, we investigate the absolute errors on the atom level $| \mu -\hat{q}|$ to assess whether the calibrated standard deviation of the atom $\tilde{\sigma }$ is able to identify regions of high error within the MOF.

On the test set, we find that the calibrated DMC can estimate aggregate error metrics well: While the mean $\tilde{\sigma }$ of 0.0159e slightly overestimates the MAE of 0.0136e, the SMAE of 0.0354e is slightly underestimated with 0.0325e. On the level of single MOFs, the DMC uncertainty estimate is reliable in identifying large MAE_MOF (Fig. 5a): The vast majority of MAE_MOF errors are contained within the $3{\tilde{\delta }}_{{{{\rm{MOF}}}}}$ credible interval and a correlation coefficient of 0.94 indicates a strong correlation between the two. However, this UQ performance does not translate to the atom level, where the correlation coefficient between $| \mu -\hat{q}|$ and $\tilde{\sigma }$ is only 0.67. In addition, many atoms are not contained in the $3\tilde{\sigma }$ credible interval, which indicates that DMC cannot reliably predict error bounds for single atoms (Fig. 5b).

Quality of dropout Monte Carlo under distribution shift

First, we assess the generalization capabilities of the GNN with respect to large MOFs of the same distribution by predicting the partial charges of MOFs with more than 100 atoms in the QMOF dataset (Fig. 2a). The resulting errors of MAE = 0.0114e and SMAE = 0.0468e demonstrate that the obtained GNN generalizes to large MOFs very well. Compared to the small MOF test set, the MAE is slightly lower, while the SMAE is slightly larger. We attribute the smaller MAE in parts to the larger ratio of organic linker atoms of 94.5% compared to 92.6% in the small MOF test set. The DMC uncertainty estimates also show similar properties: The MAE is slightly overestimated with a prediction of 0.0140e, the SMAE is slightly underestimated with a prediction of 0.0405e and the vast majority of MOFs are included in the $3\tilde{\sigma }$ credible interval (Fig. 5c).

Next, we evaluate the prediction errors for MOFs with more than 100 atoms from the QMOF dataset that contain unseen atom types. As expected, with an MAE of 0.0316e and a SMAE of 0.1485, the prediction error is substantially larger compared to the case without unknown atom types. However, the corresponding UQ estimates for MAE and SMAE of 0.0141e and 0.0389e significantly underestimate the errors. On the MOF level, the errors are mostly contained within the $3{\tilde{\delta }}_{{{{\rm{MOF}}}}}$ credible interval (Fig. 5c). However, the DMC UQ estimate does not assign large uncertainties to these MOFs. This becomes particularly clear at the atomic level (Fig. 5d): Even though, as expected, the predictions for atom types unknown to the model are highly inaccurate, the predicted uncertainty is comparatively small, resulting in significant overconfidence.

Last, we test the generalization capacity of the obtained GNN to different structures by evaluating the model on our DFT-labeled subsets of the ARC-MOF database²⁰ and the IZA-Zeolite dataset. As expected, the prediction error increases in both cases due to the distribution shift: The model yields errors of MAE = 0.0239e, SMAE = 0.0696e and MAE = 0.0368e, SMAE = 0.0386e for the ARC-MOF and IZA Zeolite datasets, respectively. Considering that these errors remain in the range of the target accuracy, the generalization capabilities of the obtained GNN appear to be sufficient for its application under distribution shift in practice. In the case of ARC-MOF data, the UQ prediction underestimates these aggregate error metrics, with predictions of 0.0153e and 0.0369e for MAE and SMAE, respectively. Nonetheless, the vast majority of MOFs are contained in the $3{\tilde{\delta }}_{{{{\rm{MOF}}}}}$ credible interval (Fig. 5e). In the case of IZA Zeolites, with uncertainty estimates of 0.0887e and 0.0914e for MAE and SMAE, DMC overestimates these aggregate error metrics. The DMC UQ clearly highlights that predictions for these Zeolites are uncertain (Fig. 5e), enabling practitioners to interpret obtained partial charge predictions accordingly or re-train the GNN with additional Zeolite training data. These findings are reflected at the atomic level, where DMC is overconfident in ARC-MOF prediction and slightly underconfident in IZA Zeolite predictions (Fig. 5f).

The Zeolite example with its large uncertainty predictions with respect to a shift in input structures is in stark contrast to the low uncertainty predictions with respect to unknown atom types (Fig. 5d). While the latter UQ error is more severe due to providing overconfident predictions, it can be easily avoided by only applying the model to MOFs with known atom types. The reason behind the different UQ results can be likely found in the GNN architecture: Evaluating DimeNet++^33,67 on unknown atom types retrieves a randomly initialized atom type embedding from a look-up table, while changes in the input configuration propagate through the network via perturbations in the radial and spherical Bessel functions⁶⁷. The variance in GNN activations introduced by DMC appears to be amplified more by unexpected radial and spherical Bessel function values than by unexpected atom type embeddings, leading to comparatively smaller uncertainty predictions in the latter case. However, further research is required to uncover the detailed mechanisms that cause the difference in UQ estimates.

Discussion

The obtained GNN model yields partial charge prediction errors significantly below our DFT-derived target, promoting its application to downstream computational studies: The GNN could be used out-of-the-box to extend the feature set of ML-based MOF screening studies such as those performed by Ren et al.⁶⁸ – while only studied noble gases are studied, ML-predicted partial charges could help simulations with more complex molecules or provide additional, tabulated features to the gradient-boosting model. Additionally, the obtained GNN is suitable to predict partial charges for neural network potentials that model long-range electrostatic interactions⁴⁰ or could be used as an enhancement for descriptor-based models⁶⁹. Importantly, high-quality partial charge predictions can be obtained even under distribution shift, given that the GNN is able to generalize to large MOF structures, hypothetical MOFs and Zeolites, with errors below or close to their target values. This is particularly important for in-silico screening tasks, where the training and inference distributions are inherently different, representing a challenging setting for ML models trained on existing benchmark datasets^51,70,71. Nonetheless, the GNN model could still be inaccurate in some cases, as shown for MOFs with unknown atom types, highlighting the need for reliable UQ.

This work demonstrated that DMC provides sensible uncertainty estimates: The $3\tilde{\delta }$ credible interval contains the MAE_MOF in the vast majority of cases, even for out-of-distribution settings, allowing practitioners to judge whether the expected error is sufficiently small for the application at hand. In addition, DMC is able to recognize zeolites as unknown structures. This is important because it indicates that MOFs with defects or with a perturbed framework might also be recognizable. On the other hand, this work has also revealed some shortcomings of DMC: First, adding Dropout to linear layers in GNNs tends to decrease predictive accuracy for larger Dropout probabilities. Second, in order to obtain the most accurate results with a DMC-trained model, we found that simply deactivating all Dropout layers was insufficient. Hence, approximating the mean over multiple Dropout configurations is necessary, which increases the computational cost at inference time. However, it should be noted that inference is nonetheless several orders of magnitude faster than querying the DFT calculation. Retraining the GNN model without Dropout on the dataset obtained by AL would be a straightforward solution to these issues, avoiding any penalty on accuracy and obtaining predictions in a single forward pass. While the retrained model does not have UQ capabilities on its own, the DMC uncertainty estimates might be distilled into it via student-teacher learning^72,73,74. Alternatively, Gaussian Processes are attractive models for UQ because they provide uncertainty estimates without computational overhead⁷⁵. In particular, recent method developments such as Deep Kernel Learning^76,77 and linear scaling Gaussian Processes using graph-based feature-extractors⁷⁸ have the potential to increase the efficiency of UQ without compromising predictive performance compared to state-of-the-art GNNs. Benchmarking their performance, in particular in an AL context, against GNN baselines such as DMC⁶⁰ and the Deep Ensemble method⁵⁵ is an interesting direction for future research.

Our results show that UQ is not only important for trustworthy predictions, but also allows to train ML models efficiently using AL. However, the benefits of AL only become apparent if an evaluation metric is used that accounts for the strong class imbalance of MOFs, highlighting the importance of selecting appropriate evaluation criteria. In the context of MOFs, while DFT-labeled databases kept increasing in size over the past years^11,12,14,20, they will never contain all possible MOF structures. Based on our emulated database-building experiment, where AL clearly outperformed the random selection baseline, AL could play an important role in constructing the next iteration of MOF databases, maximizing the information content per structure. In addition, AL may prove valuable in addressing deduplication problems in MOF datasets: These range from failed deduplication in widely used databases¹² to the more subtle problem of different compounds containing the same structural motifs. AL may provide an elegant solution, as motifs with similar properties will be identified rapidly. MOF databases with broad structural coverage are the foundation to obtain transferable ML partial charge prediction models, paving the way towards accurate molecular modeling of MOFs, e.g. for in-silico material design.

Methods

Probabilistic GNN for partial charge prediction

To predict partial charges, we build on our custom DimeNet++^33,67 implementation⁷⁹ with default hyperparameters, which learns for each atom type an embedding vector. We augment the original backbone architecture via a scheme that enforces charge neutrality. Given that the output blocks of DimeNet++ predict per-atom scalars, we consider these scalars to be raw partial charges ${\bar{q}}_{j}$. In order to enforce charge neutrality, we subtract the average net charge of the MOF from each atom to obtain the final partial charges⁴⁸

$${q}_{j}={\bar{q}}_{j}-\frac{1}{{N}_{{{{\rm{atoms}}}}}}\mathop{\sum }\limits_{k=1}^{{N}_{{{{\rm{atoms}}}}}}{\bar{q}}_{k}\,.$$

(7)

The performance of the simple charge neutrality approach in Eq. (7) has been shown to be similar to more sophisticated schemes available in the literature⁴⁸.

Supplementary Fig. 9 shows the net charge per atom of the AL GNN for MOFs in the small MOF test set when deactivating the charge neutrality scheme. Given that eq. (7) corrects shifts in the net charge, the GNN mean net charge prediction becomes arbitrary. In the case of the AL GNN, the mean net charge per atom is shifted to − 0.0284e. The average charge redistribution per atom of the AL GNN is comparatively small with 0.0048e per atom.

To enable DMC, we apply Dropout to all output neurons of each linear layer in the DimeNet++ architecture across all Embedding, Interaction, Residual and Output blocks (see fig. 1 in the original manuscript⁶⁷). In particular, we do not apply Dropout to the message passing connectivity, nor to the initial edge embedding vectors. This way, we leave the molecular graph representation unaltered and only introduce noise into the learned transformations.

Active learning hyperparameters

We initialize the labeled set with 89 randomly selected MOFs from the training data (1% of the overall training data). We select a batch size of 8 MOFs and initially train the GNN model for 1000 epochs on the labeled set. After adding newly labeled MOFs, we preferentially train on these data points given that the model is already trained on the remainder of the labeled set. To this end, we employ a two-step training procedure: First, we train the model for 200 epochs of the newly labeled data on mini-batches consisting of 2 newly labeled MOFs and 6 randomly drawn MOFs from the labeled set. This represents a trade-off between avoiding overfitting to the newly labeled data and improved computational efficiency due to preferential training. Second, we train the GNN for 3 more epochs on the full labeled set to ameliorate the bias towards newly labeled MOFs from the previous preferential training.

Density functional theory setup

For computing partial charge labels, we employ the Vienna ab initio Simulation Package (VASP)⁸⁰ version 6.2.1 and chargemol v09_02_2017⁸¹. Using version 54 of the Projector-Augmented-Wave (PAW) pseudopotentials, we perform simulations using the PBE-functional, a kinetic energy cutoff of 520 eV, Gaussian smearing for the band occupancies of 0.01 eV and an energy convergence criterion of 1e-6. We choose k-point sampling such that (#k-points ⋅ real space cell-vector) > = 24, in line with chargemol best practices. The partial charges are then assigned from the resulting scf-charge density using the Density Derived Electrostatic and Chemical (DDEC)-charges assignment scheme⁸¹. We verified that these settings reproduce the partial charges from the QMOF-settings to numerical accuracy.

Data availability

Data (weights of the final trained model and data used for training and evaluation) is publicly available at: https://github.com/tummfm/mof-al. Instructions to acquire the QMOF-dataset including structural data and DFT-based labels are available here: https://github.com/Andrew-S-Rosen/QMOF. Structures for the ARCMOF-database can be found at https://doi.org/10.5281/zenodo.10818822. For the IZA-database they where acquired via the MOFDB-X-project: https://mof.tech.northwestern.edu/databases (version dc8a0295db).

Code availability

The active learning code is publicly available at the following GitHub repository: https://github.com/tummfm/mof-al.

References

Murray, L. J., Dincă, M. & Long, J. R. Hydrogen storage in metal-organic frameworks. Chem. Soc. Rev. 38, 1294 (2009).
Article CAS PubMed Google Scholar
DeSantis, D. et al. Techno-economic analysis of metal-organic frameworks for hydrogen and natural gas storage. Energy Fuels 31, 2024–2032 (2017).
Article CAS Google Scholar
Kobielska, P. A., Howarth, A. J., Farha, O. K. & Nayak, S. Metal-organic frameworks for heavy metal removal from water. Coord. Chem. Rev. 358, 92–107 (2018).
Article CAS Google Scholar
Li, R. et al. Efficient removal of per- and polyfluoroalkyl substances from water with zirconium-based metal-organic frameworks. Chem. Mater. 33, 3276–3285 (2021).
Article CAS Google Scholar
Kang, Y.-S. et al. Metal-organic frameworks with catalytic centers: From synthesis to catalytic application. Coord. Chem. Rev. 378, 262–280 (2019).
Article CAS Google Scholar
Li, H.-Y., Zhao, S.-N., Zang, S.-Q. & Li, J. Functional metal-organic frameworks as effective sensors of gases and volatile compounds. Chem. Soc. Rev. 49, 6364–6401 (2020).
Article CAS PubMed Google Scholar
Park, K. S. et al. Exceptional chemical and thermal stability of zeolitic imidazolate frameworks. Proc. Natl. Acad. Sci. USA 103, 10186–10191 (2006).
Article CAS PubMed PubMed Central Google Scholar
Perry, J. J., Perman, J. A. & Zaworotko, M. J. Design and synthesis of metal-organic frameworks using metal-organic polyhedra as supermolecular building blocks. Chem. Soc. Rev. 38, 1400 (2009).
Article CAS PubMed Google Scholar
Abednatanzi, S. et al. Mixed-metal metal-organic frameworks. Chem. Soc. Rev. 48, 2535–2565 (2019).
Article CAS PubMed Google Scholar
Lu, W. et al. Tuning the structure and function of metal-organic frameworks via linker design. Chem. Soc. Rev. 43, 5561–5593 (2014).
Article CAS PubMed Google Scholar
Chung, Y. G. et al. Computation-ready, experimental metal-organic frameworks: A tool to enable high-throughput screening of nanoporous crystals. Chem. Mater. 26, 6185–6192 (2014).
Article CAS Google Scholar
Chung, Y. G. et al. Advances, updates, and analytics for the computation-ready, experimental metal-organic framework database: CoRE MOF 2019. J. Chem. Eng. Data 64, 5985–5998 (2019).
Article CAS Google Scholar
Allen, F. H. The cambridge structural database: a quarter of a million crystal structures and rising. Acta Crystallogr. B 58, 380–388 (2002).
Article PubMed Google Scholar
Rosen, A. S. et al. Machine learning the quantum-chemical properties of metal–organic frameworks for accelerated materials discovery. Matter 4, 1578–1597 (2021).
Article CAS Google Scholar
Rosen, A. S.et al. High-throughput predictions of metal-organic framework electronic properties: theoretical challenges, graph neural networks, and data exploration. npj Comput. Mater. 8, 112 (2022).
Wilmer, C. E. et al. Large-scale screening of hypothetical metal-organic frameworks. Nat. Chem. 4, 83–89 (2011).
Article PubMed Google Scholar
Lee, S. et al. Computational screening of trillions of metal-organic frameworks for high-performance methane storage. ACS Appl. Mater. Interfaces 13, 23647–23654 (2021).
Article CAS PubMed Google Scholar
Nandy, A., Duan, C. & Kulik, H. J. Using machine learning and data mining to leverage community knowledge for the engineering of stable metal-organic frameworks. JACS 143, 17535–17547 (2021).
Article CAS Google Scholar
Bobbitt, N. S. et al. MOFX-DB: An online database of computational adsorption data for nanoporous materials. J. Chem. Eng. Data 68, 483–498 (2023).
Article CAS Google Scholar
Burner, J. et al. ARC-MOF: A diverse database of metal-organic frameworks with DFT-derived partial atomic charges and descriptors for machine learning. Chem. Mater. 35, 900–916 (2023).
Article CAS Google Scholar
Dubbeldam, D., Calero, S., Ellis, D. E. & Snurr, R. Q. RASPA: Molecular simulation software for adsorption and diffusion in flexible nanoporous materials. Mol. Simul. 42, 81–101 (2015).
Article Google Scholar
Altintas, C. et al. An extensive comparative analysis of two MOF databases: high-throughput screening of computation-ready MOFs for CH₄ and H₂ adsorption. J. Mater. Chem. A 7, 9593–9608 (2019).
Article CAS Google Scholar
Dürholt, J. P., Fraux, G., Coudert, F.-X. & Schmid, R. Ab initio derived force fields for zeolitic imidazolate frameworks: MOF-FF for ZIFs. J. Chem. Theory Comput. 15, 2420–2432 (2019).
Article PubMed Google Scholar
Weng, T. & Schmidt, J. R. Flexible and transferable ab initio force field for zeolitic imidazolate frameworks: ZIF-FF. J. Phys. Chem. A 123, 3000–3012 (2019).
Article CAS PubMed Google Scholar
Ghosh, P., Colón, Y. J. & Snurr, R. Q. Water adsorption in UiO-66: the importance of defects. Chem. Commun. 50, 11329–11331 (2014).
Article CAS Google Scholar
Han, C., Verploegh, R. J. & Sholl, D. S. Assessing the impact of point defects on molecular diffusion in ZIF-8 using molecular simulations. J. Phys. Chem. Lett. 9, 4037–4044 (2018).
Article CAS PubMed Google Scholar
Fernandez, M., Trefiak, N. R. & Woo, T. K. Atomic property weighted radial distribution functions descriptors of metal-organic frameworks for the prediction of gas uptake capacity. J. Phys. Chem. C. 117, 14095–14105 (2013).
Article CAS Google Scholar
Sun, Y. et al. Fingerprinting diverse nanoporous materials for optimal hydrogen storage conditions using meta-learning. Sci. Adv. 7 (2021).
Wu, X. et al. Mapping the porous and chemical structure-function relationships of trace CH₃I capture by metal-organic frameworks using machine learning. ACS Appl. Mater. Interfaces 14, 47209–47221 (2022).
Article CAS PubMed Google Scholar
Behler, J. & Parrinello, M. Generalized neural-network representation of high-dimensional potential-energy surfaces. Phys. Rev. Lett. 98, 146401 (2007).
Article PubMed Google Scholar
John, S. & Csányi, G. Many-body coarse-grained interactions using gaussian approximation potentials. J. Phys. Chem. B 121, 10934–10949 (2017).
Article CAS PubMed Google Scholar
Schütt, K. T. et al. SchNet: A continuous-filter convolutional neural network for modeling quantum interactions. Adv. Neural. Inf. Process. Syst. 30, 992–1002 (2017).
Gasteiger, J., Groß, J. & Günnemann, S. Directional message passing for molecular graphs. ICLR (2020).
Zeng, J., Cao, L., Xu, M., Zhu, T. & Zhang, J. Z. Complex reaction processes in combustion unraveled by neural network-based molecular dynamics simulation. Nat. Commun. 11, 5713 (2020).
Article CAS PubMed PubMed Central Google Scholar
Thaler, S., Stupp, M. & Zavadlav, J. Deep coarse-grained potentials via relative entropy minimization. J. Chem. Phys. 157, 244103 (2022).
Article CAS PubMed Google Scholar
Fu, X. et al. Forces are not enough: Benchmark and critical evaluation for machine learning force fields with molecular simulations. AI for Science: Progress and Promises Workshop, NeurIPS (2022).
Musaelian, A. et al. Learning local equivariant representations for large-scale atomistic dynamics. Nat. Commun. 14, 579 (2023).
Article CAS PubMed PubMed Central Google Scholar
Röcken, S. & Zavadlav, J. Accurate machine learning force fields via experimental and simulation data fusion. npj Comput. Mater. 10, 69 (2024).
Vandenhaute, S., Cools-Ceuppens, M., DeKeyser, S., Verstraelen, T. & Speybroeck, V. V. Machine learning potentials for metal-organic frameworks using an incremental learning approach. npj Comput. Mater. 9, 19 (2023).
Ko, T. W., Finkler, J. A., Goedecker, S. & Behler, J. A fourth-generation high-dimensional neural network potential with accurate electrostatics including non-local charge transfer. Nat. Commun. 12, 398 (2021).
Article CAS PubMed PubMed Central Google Scholar
Kosmala, A., Gasteiger, J., Gao, N. & Günnemann, S. Ewald-based long-range message passing for molecular graphs. ICML 724, 17544–17563 (2023).
Coste, A., Slejko, E., Zavadlav, J. & Praprotnik, M. Developing an implicit solvation machine learning model for molecular simulations of ionic media. J. Chem. Theory Comput. 20, 411–420 (2024).
Article CAS PubMed Google Scholar
Sifain, A. E. et al. Discovering a transferable charge assignment model using machine learning. J. Phys. Chem. Lett. 9, 4495–4501 (2018).
Article CAS PubMed Google Scholar
Veit, M., Wilkins, D. M., Yang, Y., DiStasio, R. A. & Ceriotti, M. Predicting molecular dipole moments by combining atomic partial charges and atomic dipoles. J. Chem. Phys. 153, 024113 (2020).
Kaymak, M. C. et al. JAX-ReaxFF: A gradient-based framework for fast optimization of reactive force fields. J. Chem. Theory Comput. 18, 5181–5194 (2022).
Article CAS PubMed Google Scholar
Ghasemi, S. A., Hofstetter, A., Saha, S. & Goedecker, S. Interatomic potentials for ionic systems with density functional accuracy based on charge densities obtained by a neural network. Phys. Rev. B 92, 045131 (2015).
Vondrák, M., Reuter, K. & Margraf, J. T. q-pac: A python package for machine learned charge equilibration models. J. Chem. Phys. 159, 054109 (2023).
Raza, A., Sturluson, A., Simon, C. M. & Fern, X. Message passing neural networks for partial charge assignment to metal-organic frameworks. J. Phys. Chem. C. 124, 19070–19082 (2020).
Article CAS Google Scholar
Altintas, C., Altundal, O. F., Keskin, S. & Yildirim, R. Machine learning meets with metal organic frameworks for gas storage and separation. JCIM 61, 2131–2146 (2021).
CAS Google Scholar
Kancharlapalli, S., Gopalan, A., Haranczyk, M. & Snurr, R. Q. Fast and accurate machine learning strategy for calculating partial atomic charges in metal–organic frameworks. J. Chem. Theory Comput. 17, 3052–3064 (2021).
Article CAS PubMed Google Scholar
Tossou, P., Wognum, C., Craig, M., Mary, H. & Noutahi, E. Real-world molecular out-of-distribution: Specification and investigation. J. Chem. Inf. Model. 64, 697–711 (2024).
Article CAS PubMed PubMed Central Google Scholar
Smith, J. S., Nebgen, B., Lubbers, N., Isayev, O. & Roitberg, A. E. Less is more: Sampling chemical space with active learning. J. Chem. Phys. 148, 241733 (2018).
Article PubMed Google Scholar
Zhang, L., Lin, D.-Y., Wang, H., Car, R. & Weinan, E. Active learning of uniformly accurate interatomic potentials for materials simulation. Phys. Rev. Mater. 3, 023804 (2019).
Article CAS Google Scholar
Hansen, L. & Salamon, P. Neural network ensembles. IEEE Trans. Pattern Anal. Mach. Intell. 12, 993–1001 (1990).
Article Google Scholar
Lakshminarayanan, B., Pritzel, A. & Blundell, C. Simple and scalable predictive uncertainty estimation using deep ensembles. Adv. Neural. Inf. Process. Syst. 6405–6416 (2017).
Welling, M. & Teh, Y. W. Bayesian learning via stochastic gradient langevin dynamics. ICML 681–688 (2011).
Nemeth, C. & Fearnhead, P. Stochastic gradient markov chain monte carlo. J. Am. Stat. Assoc. 116, 433–450 (2021).
Article CAS Google Scholar
Thaler, S., Doehner, G. & Zavadlav, J. Scalable bayesian uncertainty quantification for neural network potentials: Promise and pitfalls. J. Chem. Theory Comput. 19, 4520–4532 (2023).
Article CAS PubMed Google Scholar
Thaler, S., Fuchs, P., Cukarska, A. & Zavadlav, J. Jaxsgmc: Modular stochastic gradient mcmc in jax. SoftwareX 26, 101722 (2024).
Article Google Scholar
Gal, Y. & Ghahramani, Z. Dropout as a Bayesian approximation: Representing model uncertainty in deep learning. PMLR 48, 1050–1059 (2016).
Wen, M. & Tadmor, E. B. Uncertainty quantification in molecular simulations with dropout neural network potentials. npj Comput. Mater. 6, 124 (2020).
Article Google Scholar
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I. & Salakhutdinov, R. Dropout: A simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014).
Google Scholar
Nazarian, D., Camp, J. S. & Sholl, D. S. A comprehensive set of high-quality point charges for simulations of metal-organic frameworks. Chem. Mater. 28, 785–793 (2016).
Article CAS Google Scholar
Jablonka, K. M. Moffragmentor. https://github.com/kjappelbaum/moffragmentor (2024).
Imbalzano, G. et al. Uncertainty estimation for molecular dynamics and sampling. J. Chem. Phys. 154, 074102 (2021).
Article CAS PubMed Google Scholar
Gal, Y., Hron, J. & Kendall, A. Concrete dropout. Adv. Neural. Inf. Process. Syst. 30, 3584–3593 (2017).
Gasteiger, J., Giri, S., Margraf, J. T. & Günnemann, S. Fast and uncertainty-aware directional message passing for non-equilibrium molecules. Machine Learning for Molecules Workshop, NeurIPS (2020).
Ren, E. & Coudert, F.-X. Enhancing gas separation selectivity prediction through geometrical and chemical descriptors. Chem. Mater. 35, 6771–6781 (2023).
Article CAS Google Scholar
Xie, X., Persson, K. A. & Small, D. W. Incorporating electronic information into machine learning potential energy surfaces via approaching the ground-state electronic energy as a function of atom-based electronic populations. J. Chem. Theory Comput. 16, 4256–4270 (2020).
Article CAS PubMed Google Scholar
Christensen, A. S. & von Lilienfeld, O. A. On the role of gradients for machine learning of molecular energies and forces. Mach. Learn. Sci. Technol. 1, 045018 (2020).
Article Google Scholar
Stocker, S., Gasteiger, J., Becker, F., Günnemann, S. & Margraf, J. T. How robust are modern graph neural network potentials in long and hot molecular dynamics simulations? Mach. Learn. Sci. Technol. 3, 045010 (2022).
Article Google Scholar
Korattikara Balan, A., Rathod, V., Murphy, K. P. & Welling, M. Bayesian dark knowledge. Adv. Neural. Inf. Process. Syst. 3438–3446 (2015).
Wang, L. & Yoon, K.-J. Knowledge distillation and student-teacher learning for visual intelligence: A review and new outlooks. IEEE Trans. Pattern Anal. Mach. Intell. 44, 3048–3068 (2021).
Article Google Scholar
Kelvinius, F. E., Georgiev, D., Toshev, A. P. & Gasteiger, J. Accelerating molecular graph neural networks via knowledge distillation. Adv. Neural. Inf. Process. Syst. 36, 25761–25792 (2023).
Deringer, V. L. et al. Gaussian process regression for materials and molecules. Chem. Rev. 121, 10073–10141 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wilson, A. G., Hu, Z., Salakhutdinov, R. R. & Xing, E. P. Stochastic variational deep kernel learning. Adv. Neural. Inf. Process. Syst. 29, 2594–2602 (2016).
Wollschläger, T., Gao, N., Charpentier, B., Ketata, M. A. & Günnemann, S. Uncertainty estimation for molecules: Desiderata and methods. ICML 37133–37156 (2023).
Parkinson, J. & Wang, W. Linear-scaling kernels for protein sequences and small molecules outperform deep learning while providing uncertainty quantitation and improved interpretability. JCIM 63, 4589–4601 (2023).
CAS Google Scholar
Thaler, S. & Zavadlav, J. Learning neural network potentials from experimental data via differentiable trajectory reweighting. Nat. Commun. 12, 6884 (2021).
Article CAS PubMed PubMed Central Google Scholar
Hafner, J. Ab-initio simulations of materials using VASP: Density-functional theory and beyond. J. Comput. Chem. 29, 2044–2078 (2008).
Article CAS PubMed Google Scholar
Manz, T. A. & Sholl, D. S. Improved atoms-in-molecule charge partitioning functional for simultaneously reproducing the electrostatic potential and chemical states in periodic and nonperiodic materials. J. Chem. Theory Comput. 8, 2844–2867 (2012).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We acknowledge support: from (1) the TUM Innovation Network “Artificial Intelligence Powered Multifunctional Material Design” (ARTEMIS) funded through the German Excellence Initiative and the state of Bavaria (2) the “EXC 2089: e-conversion” DFG-cluster of excellence [project number: 390776260], https://www.e-conversion.de.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Professorship of Multiscale Modeling of Fluid Materials, Department of Engineering Physics and Computation, TUM School of Engineering and Design, Technical University of Munich, Munich, Germany
Stephan Thaler & Julija Zavadlav
Valence Labs, Montreal, QC, Canada
Stephan Thaler
Professorship of Simulation of Nanosystems for Energy Conversion, Department of Electrical Engineering, TUM School of Computation, Information and Technology, Technical University of Munich, Munich, Germany
Felix Mayr, Siby Thomas & Alessio Gagliardi
Munich Data Science Institute, Technical University of Munich, Munich, Germany
Alessio Gagliardi & Julija Zavadlav

Authors

Stephan Thaler
View author publications
You can also search for this author in PubMed Google Scholar
Felix Mayr
View author publications
You can also search for this author in PubMed Google Scholar
Siby Thomas
View author publications
You can also search for this author in PubMed Google Scholar
Alessio Gagliardi
View author publications
You can also search for this author in PubMed Google Scholar
Julija Zavadlav
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.T. conceptualization, methodology, AL and GNN software, formal analysis, investigation, visualization, writing—original draft and review. F.M. data curation and analysis, DFT methodology and software automation, visualization, writing—original draft and review. S.T. writing—original draft. A.G. conceptualization, review, supervision, resources. J.Z. conceptualization, writing—review and editing, supervision, resources.

Corresponding author

Correspondence to Julija Zavadlav.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

SUPPLEMENTAL MATERIAL

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Thaler, S., Mayr, F., Thomas, S. et al. Active learning graph neural networks for partial charge prediction of metal-organic frameworks via dropout Monte Carlo. npj Comput Mater 10, 86 (2024). https://doi.org/10.1038/s41524-024-01277-8

Download citation

Received: 17 October 2023
Accepted: 19 April 2024
Published: 03 May 2024
DOI: https://doi.org/10.1038/s41524-024-01277-8