Abstract
Vibrational Circular Dichroism (VCD) spectra often differ strongly from one conformer to another, even within the same absolute configuration of a molecule. Simulated molecular VCD spectra typically require expensive quantum chemical calculations for all conformers to generate a Boltzmann averaged total spectrum. This paper reports whether machine learning (ML) can partly replace these quantum chemical calculations by capturing the intricate connection between a conformer geometry and its VCD spectrum. Three hypotheses concerning the added value of ML are tested. First, it is shown that for a single stereoisomer, ML can predict the VCD spectrum of a conformer from solely the conformer geometry. Second, it is found that the ML approach results in important time savings. Third, the ML model produced is unfortunately hardly transferable from one stereoisomer to another.
Introduction
Chiroptical spectroscopic methods measure the difference in interaction between an optically active compound and left- or right-circularly polarized radiation1,2,3,4. The best known chiroptical method is Electronic Circular Dichroism (ECD), where one measures the difference in absorption of left- and right-handed circularly polarized visible and ultraviolet radiation by an optically active molecule. Vibrational Circular Dichroism (VCD) is an infrared chiroptical method where vibrational transitions are responsible for the difference in absorption. The main advantage of VCD compared to ECD is the richer information obtained from the former due to the much larger number of vibrational transitions compared to the number of accessible electronic transitions. Chiroptical methods find their main area of application in establishing the absolute configuration (AC) of molecules4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25. However, it also reveals a significant amount of information on the conformational properties of a molecule26,27,28,29,30,31,32,33,34,35,36,37,38,39. The link between conformation in the sense of its molecular geometry and its VCD spectrum is not easily established on the basis of e.g., some rules of thumb and one usually relies on the quantum chemically computed spectrum. The usual approach to establishing the AC of a compound is to choose a specific AC of the molecule, find all its conformers on the potential energy hypersurface and their energies and then combine all computed spectra using Boltzmann weighting in a simulated molecular spectrum for the chosen AC1. By repeating all these steps for each possible AC and comparing all computed spectra to the experimental one, one concludes what AC the experimental sample corresponded to. Said computed spectra are usually generated using Density Functional Theory (DFT) calculations requiring sufficient expertize and computational resources. Experience shows that the VCD spectra of individual conformers of the same molecule may differ significantly even if they belong to the same AC (see Supplementary Discussion 1 and Supplementary Figs. 1–3), explaining why rules of thumb cannot be established26,27,28,29,30,31,32,33,34. The first hypothesis tested in this paper is that machine learning (ML) can be used to predict the VCD spectrum for a specific conformer using only its geometry, in this sense providing an alternative to the DFT procedure. The second hypothesis of this paper is that ML may help reduce significantly the total time cost needed to obtain a molecular spectrum. This entails that ML should allow skipping enough time normally spent in DFT calculations to more than compensate for the time it takes to establish the ML model. The third and final issue examined is the extent to which an ML model is transferable from one AC to another. Does it suffice to learn from one AC and use this for all other possible AC’s? For instance, in a molecule with two stereocenters, does it suffice to establish an ML model for RR and to then use it also for RS, SR, and SS?
ML methods are powerful methods for the extraction of complex patterns hidden in spectral data, speeding up conventional workflows, and accelerating computational methods40,41,42,43,44,45,46,47,48,49,50,51. We have recently shown that there is hope that ML can play a role in VCD spectroscopy52. More specifically, we have shown that for a large set of congeneric molecules adopting a single conformer, we can use ML to reveal to what AC a VCD spectrum of an unknown molecule corresponds. Now the ambitions are higher. We want to extract a VCD spectrum solely from a conformer geometry. Figure 1 contrasts the current work against our previous work52 and other recent works53,54,55 that address the link between an AC and an experimental spectrum or property. The present paper concentrates on the link between the structure of a conformer and its VCD spectrum within a given AC of a molecule. Success in establishing this link will then also strongly benefit the usual approach to VCD-based AC assignments as it will allow circumpassing the quantum chemical calculation of all conformer VCD spectra.
To prove or disprove the hypotheses set, a test bench of compounds must be established. Somewhat naively, one could think of any set of compounds for which VCD has been computed and/or measured but this is not useful. We namely wish to be able to control ourselves the degree of conformational flexibility of the molecules and the nature of their intramolecular interactions by changing a number of substituents. At the same time, both effects should not intercorrelate too much. This entails the use of admittedly somewhat peculiar molecules but the priority is given to stepwise understand and prove the hypotheses. This would not be possible using too diverse compounds while at the same time, error cancellation could play a much larger role there. We use a tetra-substituted naphthalene framework whose conformational flexibility and intramolecular interactions (such as hydrogen bonding) we can control by judicious selection of substituents. This allows us to test whether ML is sufficiently reliable over a range of chemical situations.
To be able to impact the conformational flexibility and the degree to which intramolecular interactions play a role without changing too many features simultaneously, we have chosen compounds that have the same backbone. A tetra-substituted naphthalene framework is chosen as backbone. To this substituents containing a chiral center in the R-configuration are added. Changing the substitution pattern enables to control the intramolecular interaction between the substituents. An overview of the compounds considered in this work is provided in Fig. 2. In compounds 1a and 2a, the sidechains and their conformational properties can be expected to be largely independent from each other. For example, steric hindrance is limited thanks to the large distance between the sidechains. Vibrational mode coupling between the sidechain vibrations may still impact the vibrational frequencies and corresponding VCD intensities though. By changing the substitution pattern we impact the conformational freedom through specific intramolecular interactions. Hence, we may introduce steric interactions between the side chains when going from 1a to 1b and hydrogen bonding in going from 2a to 2b. Differences in the performance of the ML procedure can then be attributed to the interactions introduced. The influence of a wider variety in the functional groups present in the side chains, yielding more feature-rich spectra, is examined using compounds 3 and 4. The absence of a C2 axis in addition reveals the impact of the associated symmetry operation.
The obtained excellent quality of the spectral prediction suggests that ML can link the geometry of a conformation to its VCD spectrum. As such, ML can strongly reduce the effort spent in quantum chemically obtaining all VCD spectra provided the ML step has a much lower computational cost. This is indeed shown to be the case. On the other hand, unfortunately, the ML models are not transferable, not even within the same molecule but with different AC.
ML as well as DFT-based prediction of VCD spectra are quite technical fields and every step needs to be very well thought of. Because of the highly technical nature, the precise methodology including all error checks and balances used are given in the methods section and supplementary material. The main lines of the approach taken are:
-
Generate minimum energy conformations using a force field for all compounds in Fig. 2 with chosen AC equal to RRRR
-
Compute DFT geometries and VCD spectra using the B3PW91 functional and 6-31G(d) basis set
-
Establish a training, validation, and test set per compound to train an ML model to extract from solely a conformational geometry the VCD spectrum and test hypothesis 1 (see above)
-
Repeat this for all conformations of a molecule in the chosen AC and establish the time gained by using ML (hypothesis 2)
-
Test the ML model for a different AC of the same molecule or even a different molecule in the same AC or different (hypothesis 3)
Admittedly, in this study, the entire usual approach involving elaborate DFT calculations is also still performed to serve as a comparison basis but the end goal is to strongly reduce the number of these calculations although some will always remain required to train the model.
Results and discussion
For each conformer of each compound in a single, chosen AC, the VCD spectrum is computed using DFT. This basis of spectra is then used in finding the ML model as described in detail in the methods section. For each compound, the ML method is trained to allow the prediction of conformer VCD spectra solely from the geometry of the conformers. In this section, emphasis is placed on the actual results which are discussed in terms of the extent to which they (dis)prove the hypotheses formulated above.
Hypothesis 1: machine learning can predict conformer spectra solely from molecular geometry
The first hypothesis is that ML can learn from a dataset of conformer geometries and their VCD spectra the intricate link between both. To this end, for every compound, a training set of conformers and spectra is established so that an ML model can be obtained. This does -admittedly- mean that it is impossible to completely bypass all DFT calculations but the aim is to be able to limit the number to just enough to train a proper model. For all compounds in Fig. 2, an ML model was trained using different ratios of training, test and validation set, and the hypothesis is examined by looking at the similarity between a DFT predicted conformer spectrum and one obtained using the trained ML model. The results are presented in Fig. 3. For all applications, the molecular geometry is represented using only the sidechain dihedral angles.
As a similarity measure we use the cosine similarity measure Spred which is the normalized overlap between the ML predicted and DFT computed spectrum (see Supplementary Methods 1 and 2 for details on the similarity measures). If it equals 1, the spectra are exactly the same. It can turn negative, meaning that the ML-predicted spectrum would rather agree with the enantiomer of the DFT computed spectrum. This would be detrimental for the use of ML in VCD-based AC assignment and it is gratifying that no conformers appear with negative similarities. Figure 3 are so-called violin plots. The width of the blob at every value of Spred reflects how many conformers are binned within a small interval around that value. How wider the blob the more conformers have an Spred in that bin.
Clearly, the procedure works very well in case of compound 1a. The far majority of conformers comes with values around 0.99 and only a very small tail descends towards circa 0.96. To put this in perspective, the loss in exact similarity is of the order of or even better than the variation in spectrum if one compared DFT spectra for the same conformer obtained using a different basis set or functional. This shows that the ML procedure works very well. Compound 1b is a structural isomer and there the results are somewhat less impressive. A vertically more spread out blob is obtained but the far majority of points still has an impressive similarity above 0.9. Two sets of conformers appear and one might be tempted to interpret this in terms of one collection of conformers with stronger steric hindrance and one with less, but no such connection is found (see Supplementary Discussion 2 and Supplementary Figs. 4 and 5). Compound 2a again shows that the majority of conformers exhibits very good agreement between the ML predicted and actual DFT computed spectrum although the similarities do go down to roughly 0.75. This is still more than sufficient in the context of AC determination56. One could suggest that conformers with higher energy lay lower in similarity, but this is not the case (see Supplementary Discussion 3 and Supplementary Figs. 6–11). For compound 2b, two plots are shown. The first is the result using ML training with only the sidechain dihedral angles as input. Hydrogen bonding is not well represented in this encapsulation of molecular geometry. When additional parameters are included (denoted as hbond, see Fig. 3), the violin plot shifts massively to higher similarities (see Supplementary Discussion 4 and Supplementary Figs. 12 and 13). This means that sufficient attention must be paid to what is a proper representation of a conformer geometry. Compounds 3 and 4 introduce a wider range of substituents and it is clear that the agreement between DFT and ML predicted spectra is very good.
These results reveal that ML does allow to partially replace DFT calculations. Still, for many conformations the VCD spectrum needs to be calculated using DFT as one needs a training set for each compound but once an ML model is available, the spectra of all conformations for which no DFT calculation of the VCD spectrum was performed can be computed from the ML model. A detailed study of what fraction of conformers is required for DFT VCD calculations is given in Supplementary Discussions 5 and 6, Supplementary Figs. 14 and 15, and Supplementary Table 1.
Hypothesis 2: machine learning can significantly reduce the computational cost for AC assignment
From a practical perspective, the scientifically already valuable results above, suggest that one could significantly reduce the effort to assign an AC to an experimental sample. In practice, assigning the AC of an experimental sample requires elaborate DFT calculations for all conformers in each possible AC, composing a Boltzmann averaged VCD spectrum and comparing it to an experimental measurement. Even if for the moment, we assume that a separate ML model needs to be trained for every assumed AC, there may be a significant time gain due to the use of ML. The most time consuming part in the usual approach lies in computing the VCD spectra, much less in the geometry optimization so for now we take for granted that the geometries and Boltzmann weights are DFT computed. One could envision to also skip the step of geometry optimization and use only geometries and energies from a force field calculation but this is subject of future work. At this exploratory stage, it is important not to reach too far in ambitions to avoid conclusions could be based on partial error cancellation.
Table 1 shows the total time cost for all compounds to compute a Boltzmann weighted VCD spectrum using the classical approach and using one where part of the DFT VCD calculations are replaced by the ML-based prediction. To allow for a fair comparison, the time spent to train the ML model is also reported. The data in Table 1 is obtained using a very large fraction of DFT conformer VCD spectra (DFT spectra computed for 80% of all conformers). As the ML training step can be done quite efficiently, the relative time savings are mostly limited by the time spent in computing DFT spectra to generate the ML model. Nonetheless, significant computer time is already being saved compared to the classical approach.
Computing DFT spectra for 80% of all conformers of course limits the possible time gain with ML. Hence, we also investigate the additional time savings possible if the ML model is generated using a smaller percentage of DFT computed spectra. Using fewer DFT spectra may adversely affect the similarity between the Boltzmann weighted spectrum composed with the DFT spectra of all conformers and one based on a combination of DFT spectra and ML predictions. Figure 4 shows the relative speedup for the Boltzmann weighted spectrum as a function of the percentage of DFT computed spectra, along with the similarity of the Boltzmann weighted spectrum and the one based on the DFT spectra of all conformers, for compound 4. It is found that one can strongly reduce the percentage of DFT computed spectra without significantly affecting the resulting spectrum in the sense that the similarity to the spectrum composed with all DFT conformer spectra remains very high. At a similarity of 0.95, all details of the spectrum are still reproduced. With roughly 15% of the conformer spectra computed with DFT and used to generate the ML model, the ML-aided approach allows to retain a similarity above the threshold of 0.95 while providing a speedup with a factor of 6.6.
Similar speedup values as reported for compound 4 are also found for the other model compounds. The Boltzmann weighted spectra obtained for each compound with this approach, along with the associated speedup and similarity, are given in Supplementary Discussions 7–8 and Supplementary Figures 16–18.
Hypothesis 3: machine learning can generate transferable models
All of the above is based on individually training an ML model for a specific AC of a specific molecule. The gratifying time savings reported above could be very strongly boosted if learning an ML model for a single AC would lead to a model that can also be used for a different stereoisomer. To test this we took compound 4 where ML works excellently for a single AC (see hypothesis 1 and Fig. 3). We then switched the AC of compound 4 to both an epimer and the enantiomer, and used the ML model generated for compound 4 to predict conformer spectra for both. The results are presented in Fig. 5a. The predictions for the new stereoisomers unfortunately do not resemble the DFT conformer spectra. The ML model is far from transferable to other AC’s, especially if the conformer spectra differ significantly from the original AC. In an attempt to remedy this, one could suggest to train for multiple stereoisomers in one run. With this approach the conformer spectra of each stereoisomer are obtained with the same accuracy as for a single AC (Fig. 5b). With the current approach, the ML model can only faithfully reproduce spectra for stereoisomers it has been trained on.
Figure 5 also reveals a particular feature. DFT spectra are always only computed for one enantiomer of the set of enantiomers as spectra of enantiomers are mirror images. It is clear that this was not picked up when training on only one of both enantiomers. When training on both enantiomers, the question is whether enough information was sourced such that for the two mirror images of the same conformation also a mirror image spectrum is obtained from the ML model. This is indeed the case as is shown in Fig. 6 where in panel a the spectra of an enantiomeric pair of conformers is compared when only one AC was used in training. In Fig. 6b, the result is shown when both AC are included in training.
Conclusions
The potential of ML in VCD spectroscopy to (partially) replace DFT calculations was examined. Three hypotheses have been put forward, leading to the following conclusions:
-
Hypothesis 1: machine learning can predict conformer spectra solely from molecular geometries. The similarity between the DFT computed spectrum of a conformer and the spectrum predicted with ML from its geometry is very high. ML can indeed learn the intricate and hidden connection between a conformer geometry and its VCD spectrum. Though, it is up to the user to make sure that the representation of the geometry in a practical form encapsulates all the necessary input to cover intramolecular interactions.
-
Hypothesis 2: machine learning can significantly reduce the computational cost of AC assignment. The present results show that the ML training step may be done quite efficiently and as a result significant time savings are possible. Obviously, it remains up to the user to determine whether the time savings compensate for the learning curve associated with proper training in ML methods.
-
Hypothesis 3: machine learning can generate transferable models. The current design architecture does not result in transferable ML models, neither between molecules nor among different AC’s of the same molecule.
The current ML approach already satisfies 2 out of 3 hypotheses. Clearly, more development on the ML methodology is still needed to satisfy hypothesis 3. Nonetheless, ML shows promise as a tool for extracting the link between conformations and VCD spectra.
Methods
Conformational analysis and VCD DFT calculations
VCD spectra are very conformation dependent and so a molecular VCD spectrum for a chosen AC is composed of a set of conformer VCD spectra each weighted with their Boltzmann weight. Hence, to compute a proper VCD spectrum that takes into account all conformers and their Boltzmann weights, it is necessary to thoroughly sample the conformer ensemble within the chosen AC. The conformer geometries and VCD spectra also constitute the input for an ML model. In order to provide the model with an as diverse input as possible both low-energy and a substantial number of higher energy conformers are generated using a force-field-based conformer generation algorithm. The geometry of the conformers is then optimized further using DFT and VCD spectra are calculated for each conformer. The details of each step are provided below.
-
Conformer generation: A set of conformers is generated using the GMMX routine57, which implements a stochastic search mechanism. Conformational energies are computed with the MMFF9458 force field as implemented in PcModel1059. During the stochastic search, a cut-off on the energy of the conformers equal to 40 kcal mol−1 is used. In practice, the generated conformers spread over a smaller range of force field energies. The high cut-off does not mean that we expect the high energy conformers to significantly impact the Boltzmann averaged spectrum but it may add to the diversity of the input for the ML stage. Second, experience shows that some interactions are not well handled at the force field stage and conformer energies may change significantly when moving to the DFT level.
-
Geometry optimization and VCD spectrum generation: For each conformer, the geometry is optimized further and the VCD line spectrum is computed with the B3PW9160 functional, the 6-31G(d) basis set and assuming the rigid rotor, ideal gas, and harmonic approximation. These calculations are performed using Gaussian1661.
-
Spectrum broadening and representation: The computed conformer spectra are broadened using a Lorentzian band shape with a full width at half maximum (FWHM) of 10 cm−1. The spectra were represented as vectors containing the molar absorbance difference \({{\Delta }}\epsilon (\tilde{\nu })={\epsilon }_{L}(\tilde{\nu })-{\epsilon }_{R}(\tilde{\nu })\) for wavenumbers \(\tilde{\nu }\) ranging from 800 cm−1 to 1800 cm−1 using a sampling interval equal to the FWHM (10 cm−1), so a 101-dimensional vector.
The distribution of the conformer DFT energies is discussed in Supplementary Methods 3 and Supplementary Fig. 20.
ML model architecture, training, and optimization
A fully connected Feedforward Neural Network (FNN) is used in this work to extract the link between conformer geometries and their corresponding VCD spectra. The input is a vector containing molecular features describing the geometry of the conformer (for full description see section ‘Molecular representation’) and the output is the 101-dimensional vector representing its VCD spectrum. Layers of artificial neurons, so-called hidden layers, are inserted between the input and output layer. During training, the connections between the neurons establish the link between the VCD spectrum and the conformer geometry. An illustration of an FNN with two hidden layers is shown in Fig. 7. Training a single FNN to predict VCD intensities for multiple \(\tilde{\nu }\) simultaneously, improves the generalizability62,63 of the connections between the layers. The set of conformers for a specific AC of a single molecule is split randomly into three sets: a training, validation and test set. As mentioned earlier, the connections between the neurons are extracted from the training set. The validation set is used to optimize the so-called hyperparameters of the FNN such as its size and the algorithm used for training. The test set provides a final test of how well the FNN can predict the spectra of new conformers. Initially a default 80%:10%:10% (training:validation:test) split is used. Results of the ML approach for different splits are reported in Supplementary Discussion 5.
More technical details of the model are:
-
Hyperparameter Optimization: for every application of the model the hyperparameters are optimized using Bayesian optimization. Here, a tree-structured parzen estimator optimizes the hyperparameters within the search space shown in Table 264. By reoptimizing the model for every application (such as compound, representation, or training set size) we prevent data leaking from previous applications to the current model. The Bayesian optimization was implemented with Hyperopt 0.2.565.
-
Dropout/Batch Normalization: during the Bayesian optimization the tree-structured parzen estimator can choose to introduce Dropout66 for the hidden layers or batch normalization67 layers to reduce overfitting.
-
Loss function: the model is trained with the mean squared error as loss function. The exact implementation of the metric is explained in Supplementary Methods 1-2.
All models are built and trained on a Xeon E5-2680v4 processor using Tensorflow 2.2.068.
Molecular representation
The ML model is trained to predict the VCD conformer spectra from the conformer geometries of each molecule in turn and for a chosen AC. Ideally, a minimal set of intramolecular coordinates that fully describes the conformation is chosen as input for the model. For each of the six compounds, the major differences between conformers of the same compound lie in the geometrical arrangement of the sidechains. Therefore, the conformer geometry is presented to the ML model as the set of dihedral angles in the sidechains shown in Fig. 8. Throughout this work we will refer to this set of dihedral angles as representation A. We expect this representation to capture most of the conformational flexibility. If the model cannot fully capture the link between conformer and spectrum with representation A, other geometric parameters are added to the representation and their influence is discussed.
Conformers of compounds 1a/1b/2a/2b lacking a C2 axis can arise in two different ways by rotating the sidechains internally, resulting in degenerate conformations which share the same VCD spectrum but are presented to the ML model as different conformations with this representation. Hence, we will teach the model that the predicted spectra need to be the same for both by explicitly including both members of such pairs.
Data availability
The conformer dataset that supports the findings of this study is openly available at https://doi.org/10.5281/zenodo.8009874. The technical approach and choices are described in full detail in the supplementary material.
References
Nafie, L. A. Vibrational Optical Activity: Principles and Applications (Wiley, 2011).
Kobayashi, N. & Muranaka, A. Circular Dichroism and Magnetic Circular Dichroism Spectroscopy for Organic Chemists (The Royal Society of Chemistry, 2012).
Stephens, P. & Devlin, F. Determination of the structure of chiral molecules using ab initio vibrational circular dichroism spectroscopy. Chirality 12, 172–179 (2000).
Batista Jr, J. M., Blanch, E. W. & Bolzani, Vd. S. Recent advances in the use of vibrational chiroptical spectroscopic methods for stereochemical characterization of natural products. Nat. Prod. Rep. 32, 1280–1302 (2015).
Merten, C., Golub, T. P. & Kreienborg, N. M. Absolute configurations of synthetic molecular scaffolds from vibrational cd spectroscopy. J. Org. Chem. 84, 8797–8814 (2019).
Sherer, E. C. et al. Systematic approach to conformational sampling for assigning absolute configuration using vibrational circular dichroism. J. Med. Chem. 57, 477–494 (2014).
Bogaerts, J. et al. A combined raman optical activity and vibrational circular dichroism study on artemisinin-type products. Phys. Chem. Chem. Phys. 22, 18014–18024 (2020).
Rossi, D. et al. The role of chirality in a set of key intermediates of pharmaceutical interest, 3-aryl-substituted-γ-butyrolactones, evidenced by chiral hplc separation and by chiroptical spectroscopies. J. Pharm. Biomed. 144, 41–51 (2017).
Zhang, Y. et al. Ir and vibrational circular dichroism spectroscopy of matrine- and artemisinin-type herbal products: Stereochemical characterization and solvent effects. J. Nat. Prod. 79, 1012–1023 (2016).
Górecki, M. A configurational and conformational study of (-)-oseltamivir using a multi-chiroptical approach. Org. Biomol. Chem. 13, 2999–3010 (2015).
Santoro, E. et al. Absolute configurations of phytotoxins seiricardine a and inuloxin a obtained by chiroptical studies. Phytochemistry 116, 359–366 (2015).
Qiu, S. et al. Stereochemistry of the tadalafil diastereoisomers: a critical assessment of vibrational circular dichroism, electronic circular dichroism, and optical rotatory dispersion. J. Med. Chem. 56, 8903–8914 (2013).
Pivonka, D. E. & Wesolowski, S. S. Vibrational circular dichroism (vcd) chiral assignment of atropisomers: Application to γ-aminobutyric acid (gaba) modulators designed as potential anxiolytic drugs. Appl. Spectrosc. 67, 365–370 (2013).
Wesolowski, S. S. & Pivonka, D. E. A rapid alternative to x-ray crystallography for chiral determination: Case studies of vibrational circular dichroism (vcd) to advance drug discovery projects. Bioorg. Med. Chem. Lett. 23, 4019–4025 (2013).
Shen, J. et al. Enantiomeric characterization and structure elucidation of otamixaban. J. Pharm. Anal. 4, 197–204 (2014).
Abbate, S., Longhi, G., Lebon, F. & Tommasini, M. Electronic and vibrational circular dichroism spectra of (r)-(-)-apomorphine. Chem. Phys. 405, 197–205 (2012).
Vanthuyne, N. et al. Determination of the absolute configuration of 1,3,5-triphenyl-4,5-dihydropyrazole enantiomers by a combination of vcd, ecd measurements, and theoretical calculations. Tetrahedron Asymmetry 22, 1120–1124 (2011).
Stephens, P. J., Pan, J. J., Devlin, F. J., Krohn, K. & Kurtán, T. Determination of the absolute configurations of natural products via density functional theory calculations of vibrational circular dichroism, electronic circular dichroism, and optical rotation: the iridoids plumericin and isoplumericin. J. Org. Chem. 72, 3521–3536 (2007).
Caldas, L. A. et al. Sesquiterpene lactones from calea pinnatifida: absolute configuration and structural requirements for antitumor activity. Molecules 25, 3005 (2020).
Knippen, K. et al. Cfa-18: a homochiral metal-organic framework (mof) constructed from rigid enantiopure bistriazolate linker molecules. Dalton Trans. 49, 15758–15768 (2020).
Wang, Z.-Q. et al. Determination of absolute configuration of an isopimarane-type diterpenoid by experimental and theoretical electronic circular dichroism and vibrational circular dichroism. J. Mol. Struct. 1146, 484–489 (2017).
Kong, J. et al. Absolute configuration assignment of (+)-fluralaner using vibrational circular dichroism. Chirality 29, 854–864 (2017).
Aparicio-Cuevas, M. A. et al. Dioxomorpholines and derivatives from a marine-facultative aspergillus species. J. Nat. Prod. 80, 2311–2318 (2017).
Mazzeo, G. et al. Absolute configurations of fungal and plant metabolites by chiroptical methods. ord, ecd, and vcd studies on phyllostin, scytolide, and oxysporone. J. Nat. Prod. 76, 588–599 (2013).
Pardo-Novoa, J. C. et al. Absolute configuration of menthene derivatives by vibrational circular dichroism. J. Nat. Prod. 79, 2570–2579 (2016).
Demarque, D. P. & Merten, C. Intra- versus intermolecular hydrogen bonding: solvent-dependent conformational preferences of a common supramolecular binding motif from 1h nmr and vibrational circular dichroism spectra. Chem. Eur. J. 23, 17915–17922 (2017).
Demarque, D. P., Heinrich, S., Schulz, F. & Merten, C. Sensitivity of vcd spectroscopy for small structural and stereochemical changes of macrolide antibiotics. Chem. Commun. 56, 10926–10929 (2020).
Demarque, D. P., Kemper, M. & Merten, C. Vcd spectroscopy reveals that a water molecule determines the conformation of azithromycin in solution. Chem. Commun. 57, 4031–4034 (2021).
Fagan, P. et al. Cocaine hydrochloride structure in solution revealed by three chiroptical methods. ChemPhysChem 18, 2258–2265 (2017).
Králík, F., Fagan, P., Kuchar, M. & Setnička, V. Structure of heroin in a solution revealed by chiroptical spectroscopy. Chirality 32, 854–865 (2020).
Vermeyen, T. & Merten, C. Solvation and the secondary structure of a proline-containing dipeptide: insights from VCD spectroscopy. Phys. Chem. Chem. Phys. 22, 15640–15648 (2020).
Poopari, M. R., Dezhahang, Z. & Xu, Y. Identifying dominant conformations of n-acetyl-l-cysteine methyl ester and n-acetyl-l-cysteine in water: Vcd signatures of the amide i and the co stretching bands. Spectrochim. Acta A Mol. Biomol. Spectrosc. 136, 131–140 (2015).
Eikås, K. D. R., Beerepoot, M. T. P. & Ruud, K. A computational protocol for vibrational circular dichroism spectra of cyclic oligopeptides. J. Phys. Chem. A 126, 5458–5471 (2022).
Légrády, B., Vass, E. & Tarczay, G. Matrix-isolation vibrational circular dichroism spectroscopy in structural studies of peptides: Conformational landscape of the ac(-ala)1-4-ome depsipeptide series. J. Mol. Spectrosc. 351, 29–38 (2018).
Ma, S. et al. Vibrational circular dichroism shows unusual sensitivity to protein fibril formation and development in solution. J. Am. Chem. Soc. 129, 12364–12365 (2007).
Keiderling, T. A. Structure of condensed phase peptides: insights from vibrational circular dichroism and raman optical activity techniques. Chem. Rev. 120, 3381–3419 (2020).
Hongen, T., Taniguchi, T., Nomura, S., Kadokawa, J.-I. & Monde, K. In depth study on solution-state structure of poly(lactic acid) by vibrational circular dichroism. Macromolecules 47, 5313–5319 (2014).
Ho, R.-M. et al. Transfer of chirality from molecule to phase in self-assembled chiral block copolymers. J. Am. Chem. Soc. 134, 10974–10986 (2012).
Kessler, J., Andrushchenko, V., Kapitán, J., & Bouř, P. Insight into vibrational circular dichroism of proteins by density functional modeling. Phys. Chem. Chem. Phys. 20, 4926–4935 (2018).
Zhao, L. et al. Accurate machine learning prediction of protein circular dichroism spectra with embedded density descriptors. JACS Au 1, 2377–2384 (2021).
Sun, C. et al. Machine learning allows calibration models to predict trace element concentration in soils with generalized LIBS spectra. Sci. Rep. 9, 11363 (2019).
Meiler, J., Meusinger, R. & Will, M. Fast determination of 13c nmr chemical shifts using artificial neural networks. J. Chem. Inf. Comput. Sci. 40, 1169–1176 (2000).
Ye, S. et al. A machine learning protocol for predicting protein infrared spectra. J. Am. Chem. Soc. 142, 19071–19077 (2020).
Mamede, R., Pereira, F. & de Sousa, J. A. Machine learning prediction of UV-VIS spectra features of organic compounds related to photoreactive potential. Sci. Rep. 11, 23720 (2021).
Jonas, E. & Kuhn, S. Rapid prediction of nmr spectral properties with quantified uncertainty. J. Cheminform. 11, 50 (2019).
Ghosh, K. et al. Deep learning spectroscopy: neural networks for molecular excitation spectra. Adv. Sci. 6, 1801367 (2019).
Fine, J. A., Rajasekar, A. A., Jethava, K. P. & Chopra, G. Spectral deep learning for prediction and prospective validation of functional groups. Chem. Sci. 11, 4618–4630 (2020).
Kovács, P., Zhu, X., Carrete, J., Madsen, G. & Wang, Z. Machine-learning prediction of infrared spectra of interstellar polycyclic aromatic hydrocarbons. Astrophys. J. 902, 100 (2020).
McCann, M. et al. Neural network analyses of infrared spectra for classifying cell wall architectures. Plant Physiol. 143, 1314–26 (2007).
da Silva, V. H., Murphy, F., Amigo, J. M., Stedmon, C. & Strand, J. Classification and quantification of microplastics ( < 100 μm) using a focal plane array-fourier transform infrared imaging system and machine learning. Anal. Chem. 92, 13724–13733 (2020).
Tanabe, K. et al. Identification of chemical structures from infrared spectra by using neural networks. Appl. Spectrosc. 55, 1394–1403 (2001).
Vermeyen, T. et al. Exploring machine learning methods for absolute configuration determination with vibrational circular dichroism. Phys. Chem. Chem. Phys. 23, 19781–19789 (2021).
Mamede, R., de Almeida, B. S. O., Chen, M., Zhang, Q. & Aires-de Sousa, J. Machine learning classification of one-chiral-center organic molecules according to optical rotation. J. Chem. Inf. Model. 61, 67–75 (2021).
Adams, K., Pattanaik, L. & Coley, C. W. Learning 3d representations of molecular chirality with invariance to bond rotations. arXiv https://doi.org/10.48550/arXiv.2110.04383 (2021).
Ganea, O.-E. et al. Geomol: torsional geometric generation of molecular 3d conformer ensembles. arXiv https://doi.org/10.48550/arXiv.2106.07802 (2021).
Debie, E. et al. A confidence level algorithm for the determination of absolute configuration using vibrational circular dichroism or raman optical activity. ChemPhysChem 12, 1542–1549 (2011).
Gilbert, K. E. Gmmx (version 1.5). Serena Software Bloomington IN (2011).
Halgren, T. A. Merck molecular force field. i. basis, form, scope, parameterization, and performance of mmff94. J. Comput. Chem. 17, 490–519 (1996).
Gilbert, K. E. Pcmodel (version 10.0). Serena Software Bloomington IN (2013).
Becke, A. D. Density functional thermochemistry. iii. the role of exact exchange. J. Chem. Phys. 98, 5648–5652 (1993).
Frisch, M. J. et al. Gaussian 16 Revision C.01 (Gaussian Inc., 2016).
Xu, D. et al. Survey on multi-output learning. IEEE Trans. Neural Netw. Learn. Syst. 31, 2409–2429 (2020).
Caruana, R. Multitask learning. Mach. Learn. 28, 41–75 (1997).
Bergstra, J., Bardenet, R., Bengio, Y. & Kégl, B. Algorithms for hyper-parameter optimization. In Proceedings of the 24th International Conference on Neural Information Processing Systems, NIPS’11, 2546-2554 (Curran Associates Inc., 2011).
Bergstra, J., Yamins, D. & Cox, D. Making a science of model search: Hyperparameter optimization in hundreds of dimensions for vision architectures. In Dasgupta, S. & McAllester, D. (eds.) Proceedings of the 30th International Conference on Machine Learning, vol. 28 of Proceedings of Machine Learning Research, 115–123 (PMLR, 2013).
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I. & Salakhutdinov, R. Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014).
Ioffe, S. & Szegedy, C. Batch normalization: accelerating deep network training by reducing internal covariate shift. In Bach, F. & Blei, D. (eds.) Proceedings of the 32nd International Conference on Machine Learning, vol. 37 of Proceedings of Machine Learning Research, 448–456 (PMLR, 2015).
Abadi, M. et al. TensorFlow: Large-scale Machine Learning On Heterogeneous Systems (2015). https://www.tensorflow.org/.
Acknowledgements
This work has been funded by the Fund for Scientific Research-Flanders (FWO-Vlaanderen; grant number 1160419N). The Flemish Supercomputer Centre (VSC) is acknowledged for providing computational resources and support. The University of Antwerp (BOF-NOI) is acknowledged for the pre-doctoral scholarship of TV. We thank Christian Johannessen for proofreading the paper.
Author information
Authors and Affiliations
Contributions
T.V. performed the DFT calculations, ML model training/optimization and analysis of the results. T.V., A.C., P.B., and W.H. reviewed the analysis and edited the manuscript. T.V. wrote the initial draft of the manuscript. P.B. and W.H. co-supervised the project.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Communications Chemistry thanks the anonymous reviewers for their contribution to the peer review of this work.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Vermeyen, T., Cunha, A., Bultinck, P. et al. Impact of conformation and intramolecular interactions on vibrational circular dichroism spectra identified with machine learning. Commun Chem 6, 148 (2023). https://doi.org/10.1038/s42004-023-00944-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s42004-023-00944-z
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.