Search

Scholarly Works (8 results)

Sort By:

Thesis
Peer Reviewed

Geometric Learning for Quantum-Informed, Machine Learning and Analysis of Electrostatic Preorganization

Vargas, Santiago
Advisor(s): Alexandrova, Anastassia N

UCLA Electronic Theses and Dissertations (2024)

This thesis is organized in a slightly unconventional fashion: algorithms lead and appli-cations fill out the content. I think this emphasizes my interests during graduate school - I built algorithms and tools to address issues that were otherwise inaccessible to different areas of computational chemistry (including applied machine learning) and enzymology. Two sets of scientific thrusts underscore the bulk of my work: algorithms to analyze dynamic, heterogeneous fields in the context of enzymology and flexible machine learning algorithms, including those that leverage quantum descriptors, for rigorous molecular and reaction-level properties. Each section will include grounding on applications and broader impacts for the reader as well. Now we pivot to discussing the main thrusts and outlining each chapter briefly.

General ML and Quantum Theory of Atoms-in-Molecules (QTAIM): QTAIMserves as a mathematical decomposition algorithm for electronic basins within a molecule. The algorithm intakes molecular densities, as computed (typically) by density functional theory (DFT), and uses the flux of density to partition the scalar field into 3-dimensional atomic basins of density [14, 16]. These objects are known as atomic basins and represent the quantum atom within a molecule. By constructing these structures, we compute a rich set of mathematical descriptors that map to many features including energies, bonding, and electron delocalization. These features have been correlated, in the past, to activation energies, reactivity, and overall system energies, but these uses largely relied on human intervention and small datasets [44, 62, 65, 111, 142, 287]. By developing software centered around high-throughput QTAIM calculations and machine learning, I was able to bring these descriptors to larger datasets and a wide host of applications. In Chapter 2, I discuss an algorithm I implemented to predict Diels-Alder reaction barriers from QTAIM signatures alone. In this study, we showed that QTAIM features, can be used to surmise reaction barriers while also using machine learning techniques to understand what signatures were most informative to our models. Here QTAIM electrostatic potentials and delocalization indices alone were able to yield great performance on withheld datasets. In addition, we demonstrated that QTAIM features can allow a machine learning model to generalize, to an extent, to much larger Diels-Alder reactions. This chapter was adapted from the following: Machine Learning to Predict Diels–Alder Reaction Barriers from the Reactant State Electron Density. S. Vargas*, M. Hannefarth, Z. Liu, A.N. Alexandrova. Journal of Chemical Theory and Computation 2021 17 (10), 6203-6213. 10.1021/acs.jctc.1c00623. In Chapter 3, I discuss a package developed to perform high-throughput QTAIM calculations on datasets of molecules and reactions. This package is currently adapted to work with open-source packages such as ORCA and Multiwfn. These softwares, respectively, compute DFT densities at a user-specified level of theory and subsequently compute QTAIM descriptors. The package is built with high-performance compute (HPC) in mind as it can operate on a single dataset with an arbitrary number of concurrent jobs. Here I also used the package to compute QTAIM values for a diverse set of important and difficult datasets and developed graph neural networks to predict molecular and reaction properties leveraging QTAIM as inputs. This chapter was adapted from the following: This was adapted from High-throughput quantum theory of atoms in molecules (QTAIM) for geometric deep learning of molecular and reaction properties Santiago Vargas, Winston Gee, and Anastassia N. Alexandrova. Digital Discovery 2024 3, 987-998.

Advancing Analysis of Electric Fields in Proteins: The later chapters follow ourwork in developing algorithms to ingest, interpret, and predict on electric fields in protein active sites. This work builds on the notion of electrostatic preorganization, a theory that posits that protein scaffolds arrange to electrostatically catalyse chemical reactions, and thereby, destabilizing reactants while suppressing transition state energies [299, 301]. Chapter 4 depicts exhaustive efforts to apply heterogenous electric field analysis to understanding directed evolution in the context of a protoglobin directed evolution (DE) trajectory. Previous DE efforts optimized protoglobin to efficiently catalyze carbene transfer reactions. We show that traditional explanations for increased catalytic activity across the DE lineage, substrate access and binding, cannot account for the dramatic improvements in protein activity. By tracking the 3-D electric field and using clustering algorithms, we pinpoint representative structures for QM/MM calculations and show that changes in the electric field, along DE, improve carbene transfer reactivity. These findings highlight the role electrostatic organization, notably its dynamic effect, has on determining protein function and points to its future importance in designing proteins for relevant chemical processes. This chapter is adapted from Directed Evolution of Protoglobin Optimizes the Enzyme Electric Field. Shobhit S. Chaturvedi, Santiago Vargas, Pujan Ajmera, and Anastassia N. Alexandrova. Journal of the American Chemical Society 2024 146 (24), 16670-16680 DOI: 10.1021/jacs.4c03914. In Chapter 5, I introduce a machine learning framework designed to predict enzyme functionality directly from the heterogeneous electric fields applied to protein active sites. We apply this method to a dataset of Heme-Iron Oxidoreductases. Previous studies here, focused on simple, point electric fields along the Fe-O bond, are insufficient for reasonable accuracy. On the otherhand, our 3-D, heterogenous model can accurately predict protein activity without relying on additional protein-specific information. In addition, feature selection elucidates what electric field components most inform our models and thus highlight important components to reactivity and selectivity. Finally, we apply previously-mentioned electric field clustering algorithms and QM/MM calculations to reveal how dynamic complexities in protein structures can complicate predictions and thus provides a path forward for improved models in this space. This chapter is adapted from Machine-learning prediction of protein function from the portrait of its intramolecular electric field. S. Vargas*, S. Chaturvedi, A.N. Alexandrova. (Accepted, Journal of the American Chemical Society)

Cover page: Geometric Learning for Quantum-Informed, Machine Learning and Analysis of Electrostatic Preorganization

Article
Peer Reviewed

Correction to Machine Learning to Predict Diels–Alder Reaction Barriers from the Reactant State Electron Density

UCLA Previously Published Works (2022)

Article
Peer Reviewed

Thermodynamic Equilibrium versus Kinetic Trapping: Thermalization of Cluster Catalyst Ensembles Can Extend Beyond Reaction Time Scales

UCLA Previously Published Works (2024)

Creative Commons 'BY' version 4.0 license

Article
Peer Reviewed

Directed Evolution of Protoglobin Optimizes the Enzyme Electric Field

UCLA Previously Published Works (2024)

To unravel why computational design fails in creating viable enzymes, while directed evolution (DE) succeeds, our research delves into the laboratory evolution of protoglobin. DE has adapted this protein to efficiently catalyze carbene transfer reactions. We show that the previously proposed enhanced substrate access and binding alone cannot account for increased yields during DE. The 3D electric field in the entire active site is tracked through protein dynamics, clustered using the affinity propagation algorithm, and subjected to principal component analysis. This analysis reveals notable changes in the electric field with DE, where distinct field topologies influence transition state energetics and mechanism. A chemically meaningful field component emerges and takes the lead during DE and facilitates crossing the barrier to carbene transfer. Our findings underscore intrinsic electric field dynamic's influence on enzyme function, the ability of the field to switch mechanisms within the same protein, and the crucial role of the field in enzyme design.

Cover page: Directed Evolution of Protoglobin Optimizes the Enzyme Electric Field

Creative Commons 'BY-NC' version 4.0 license

Article
Peer Reviewed

Machine Learning to Predict Diels–Alder Reaction Barriers from the Reactant State Electron Density

UCLA Previously Published Works (2021)

Reaction barriers are key to our understanding of chemical reactivity and catalysis. Certain reactions are so seminal in chemistry that countless variants, with or without catalysts, have been studied, and their barriers have been computed or measured experimentally. This wealth of data represents a perfect opportunity to leverage machine learning models, which could quickly predict barriers without explicit calculations or measurement. Here, we show that the topological descriptors of the quantum mechanical charge density in the reactant state constitute a set that is both rigorous and continuous and can be used effectively for the prediction of reaction barrier energies to a high degree of accuracy. We demonstrate this on the Diels-Alder reaction, highly important in biology and medicinal chemistry, and as such, studied extensively. This reaction exhibits a range of barriers as large as 270 kJ/mol. While we trained our single-objective supervised (labeled) regression algorithms on simpler Diels-Alder reactions in solution, they predict reaction barriers also in significantly more complicated contexts, such a Diels-Alder reaction catalyzed by an artificial enzyme and its evolved variants, in agreement with experimental changes in k_cat. We expect this tool to apply broadly to a variety of reactions in solution or in the presence of a catalyst, for screening and circumventing heavily involved computations or experiments.

Cover page: Machine Learning to Predict Diels–Alder Reaction Barriers from the Reactant State Electron Density

Article
Peer Reviewed

Computational and Experimental Design of Quinones for Electrochemical CO2 Capture and Concentration

UC Irvine Previously Published Works (2022)

Current state-of-the-art thermal technologies for CO2 capture and concentration (CCC) from industrial emissions and air are energetically inefficient. In contrast, electrochemical CCC (eCCC) using redox carriers can theoretically approach 100% efficiency. However, there are currently few oxygen-stable redox carriers suitable for eCCC. Quinone derivatives have previously been studied as redox carriers as they have no affinity for CO2 in the fully oxidized state and an enhanced affinity for CO2 in their reduced states. Unfortunately, the quinones used in prior studies displayed an unfavorable tradeoff between their second reduction potential (E1/2) and CO2 binding constant (KCO2). As a result, reduced quinones that exhibit a sufficient KCO2 for flue gas or atmospheric CO2 capture have E1/2 values negative of the O2/O2•- reduction potential. To improve our understanding of the structural and electronic relationships that correlate KCO2 and E1/2, we report the largest set of quinones that have been experimentally evaluated for their KCO2 and E1/2 properties. The trends in the E1/2 and KCO2 properties were further investigated through extensive quantum chemical calculations to inform experimental carrier design. Notably, we identified structural handles to manipulate E1/2 and KCO2 properties of quinones; however, the altered steric and electronic effects did not disrupt their linear dependence.

Article

An Artificial Intelligence Framework for Optimal Drug Design

UCLA Previously Published Works (2022)

AbstractWe introduce the concept of optimal drug design (ODD) as the use of an AI framework to optimize the exposure, safety, and efficacy of drugs. To exemplify the concept of ODD, we developed an artificial intelligence framework that integrates de novo molecular design, quantitative structure activity relationships, and pharmacokinetic-pharmacodynamic modeling. Specifically, our computational architecture has integrated a generative algorithm for small molecule design with a hybrid physiologically-based pharmacokinetic machine learning (PBPK-ML) model, which was applied to generate and optimize drug candidates for enhanced brain exposure. Publicly sourced data on the plasma and brain pharmacokinetics of 77 small molecule drugs in rats was used for model development. We have observed an approximate 30-fold and 120-fold increase on average in predicted brain exposure for AI generated molecules compared to known central nervous system drugs and randomly selected small organic molecules. We believe that with additional data and mechanistic modeling this in silico pipeline could facilitate the discovery of a new wave of optimally designed medicines for the treatment of CNS diseases.Graphical AbstractArtificial Intelligence Framework for the Optimization of Brain Pharmacokinetics.A genetic algorithm consisting of cross-breeding, mutating, scoring, and refining was used for de novo generation of a population of new molecular structures. SELFIE representations of molecules were used as input to a variational autoencoder for de novo generation/refinement of individual drug candidates. Molecular descriptors of individual drug candidates are generated and used as input into a trained neural network to generate drug-specific pharmacokinetic (PK) parameters. PK parameters are used as input into a physiologically-based pharmacokinetic (PBPK) model of the brain to predict brain PK of the drug candidate. Brain concentration-time profiles are integrated to obtain an area-under the curve (AUC), a metric of brain exposure, which is used to score and inform the design of new generations of molecules. Iterations of this framework generate novel drug candidates optimized for greater brain exposure. Created with BioRender.

Article
Peer Reviewed

Seasonal changes in diet and chemical defense in the Climbing Mantella frog (Mantella laevigata).

UC Davis Previously Published Works (2018)

Poison frogs acquire chemical defenses from the environment for protection against potential predators. These defensive chemicals are lipophilic alkaloids that are sequestered by poison frogs from dietary arthropods and stored in skin glands. Despite decades of research focusing on identifying poison frog alkaloids, we know relatively little about how environmental variation and subsequent arthropod availability impacts alkaloid loads in poison frogs. We investigated how seasonal environmental variation influences poison frog chemical profiles through changes in the diet of the Climbing Mantella (Mantella laevigata). We collected M. laevigata females on the Nosy Mangabe island reserve in Madagascar during the wet and dry seasons and tested the hypothesis that seasonal differences in rainfall is associated with changes in diet composition and skin alkaloid profiles of M. laevigata. The arthropod diet of each frog was characterized into five groups (i.e. ants, termites, mites, insect larvae, or other) using visual identification and cytochrome oxidase 1 DNA barcoding. We found that frog diet differed between the wet and dry seasons, where frogs had a more diverse diet in the wet season and consumed a higher percentage of ants in the dry season. To determine if seasonality was associated with variation in frog defensive chemical composition, we used gas chromatography / mass spectrometry to quantify alkaloids from individual skin samples. Although the assortment of identified alkaloids was similar across seasons, we detected significant differences in the abundance of certain alkaloids, which we hypothesize reflects seasonal variation in the diet of M. laevigata. We suggest that these variations could originate from seasonal changes in either arthropod leaf litter composition or changes in frog behavioral patterns. Although additional studies are needed to understand the consequences of long-term environmental shifts, this work suggests that alkaloid profiles are relatively robust against short-term environmental perturbations.

Cover page: Seasonal changes in diet and chemical defense in the Climbing Mantella frog (Mantella laevigata).