Machine learned force fields typically require manual construction of training sets consisting of thousands of first principles calculations, which can result in low training efficiency and ...unpredictable errors when applied to structures not represented in the training set of the model. This severely limits the practical application of these models in systems with dynamics governed by important rare events, such as chemical reactions and diffusion. We present an adaptive Bayesian inference method for automating the training of interpretable, low-dimensional, and multi-element interatomic force fields using structures drawn on the fly from molecular dynamics simulations. Within an active learning framework, the internal uncertainty of a Gaussian process regression model is used to decide whether to accept the model prediction or to perform a first principles calculation to augment the training set of the model. The method is applied to a range of single- and multi-element systems and shown to achieve a favorable balance of accuracy and computational efficiency, while requiring a minimal amount of ab initio training data. We provide a fully open-source implementation of our method, as well as a procedure to map trained models to computationally efficient tabulated force fields.
Rational catalyst design is crucial toward achieving more energy-efficient and sustainable catalytic processes. Understanding and modeling catalytic reaction pathways and kinetics require atomic ...level knowledge of the active sites. These structures often change dynamically during reactions and are difficult to decipher. A prototypical example is the hydrogen-deuterium exchange reaction catalyzed by dilute Pd-in-Au alloy nanoparticles. From a combination of catalytic activity measurements, machine learning-enabled spectroscopic analysis, and first-principles based kinetic modeling, we demonstrate that the active species are surface Pd ensembles containing only a few (from 1 to 3) Pd atoms. These species simultaneously explain the observed X-ray spectra and equate the experimental and theoretical values of the apparent activation energy. Remarkably, we find that the catalytic activity can be tuned on demand by controlling the size of the Pd ensembles through catalyst pretreatment. Our data-driven multimodal approach enables decoding of reactive structures in complex and dynamic alloy catalysts.
X-ray absorption spectroscopy (XAS) produces a wealth of information about the local structure of materials, but interpretation of spectra often relies on easily accessible trends and prior ...assumptions about the structure. Recently, researchers have demonstrated that machine learning models can automate this process to predict the coordinating environments of absorbing atoms from their XAS spectra. However, machine learning models are often difficult to interpret, making it challenging to determine when they are valid and whether they are consistent with physical theories. In this work, we present three main advances to the data-driven analysis of XAS spectra: we demonstrate the efficacy of random forests in solving two new property determination tasks (predicting Bader charge and mean nearest neighbor distance), we address how choices in data representation affect model interpretability and accuracy, and we show that multiscale featurization can elucidate the regions and trends in spectra that encode various local properties. The multiscale featurization transforms the spectrum into a vector of polynomial-fit features, and is contrasted with the commonly-used “pointwise” featurization that directly uses the entire spectrum as input. We find that across thousands of transition metal oxide spectra, the relative importance of features describing the curvature of the spectrum can be localized to individual energy ranges, and we can separate the importance of constant, linear, quadratic, and cubic trends, as well as the white line energy. This work has the potential to assist rigorous theoretical interpretations, expedite experimental data collection, and automate analysis of XAS spectra, thus accelerating the discovery of new functional materials.
We use a data-driven approach to study the magnetic and thermodynamic properties of van der Waals (vdW) layered materials. We investigate monolayers of the form Formula: see text, based on the known ...material Formula: see text, using density functional theory (DFT) calculations and machine learning methods to determine their magnetic properties, such as magnetic order and magnetic moment. We also examine formation energies and use them as a proxy for chemical stability. We show that machine learning tools, combined with DFT calculations, can provide a computationally efficient means to predict properties of such two-dimensional (2D) magnetic materials. Our data analytics approach provides insights into the microscopic origins of magnetic ordering in these systems. For instance, we find that the X site strongly affects the magnetic coupling between neighboring A sites, which drives the magnetic ordering. Our approach opens new ways for rapid discovery of chemically stable vdW materials that exhibit magnetic behavior.
Abstract
Forming a hetero-interface is a materials-design strategy that can access an astronomically large phase space. However, the immense phase space necessitates a high-throughput approach for an ...optimal interface design. Here we introduce a high-throughput computational framework, InterMatch, for efficiently predicting charge transfer, strain, and superlattice structure of an interface by leveraging the databases of individual bulk materials. Specifically, the algorithm reads in the lattice vectors, density of states, and the stiffness tensors for each material in their isolated form from the Materials Project. From these bulk properties, InterMatch estimates the interfacial properties. We benchmark InterMatch predictions for the charge transfer against experimental measurements and supercell density-functional theory calculations. We then use InterMatch to predict promising interface candidates for doping transition metal dichalcogenide MoSe
2
. Finally, we explain experimental observation of factor of 10 variation in the supercell periodicity within a few microns in graphene/
α
-RuCl
3
by exploring low energy superlattice structures as a function of twist angle using InterMatch. We anticipate our open-source InterMatch algorithm accelerating and guiding ever-growing interfacial design efforts. Moreover, the interface database resulting from the InterMatch searches presented in this paper can be readily accessed online.
Sequential learning for materials discovery is a paradigm where a computational agent solicits new data to simultaneously update a model in service of exploration (finding the largest number of ...materials that meet some criteria) or exploitation (finding materials with an ideal figure of merit). In real-world discovery campaigns, new data acquisition may be costly and an optimal strategy may involve using and acquiring data with different levels of fidelity, such as first-principles calculation to supplement an experiment. In this work, we introduce agents which can operate on multiple data fidelities, and benchmark their performance on an emulated discovery campaign to find materials with desired band gap values. The fidelities of data come from the results of DFT calculations as low fidelity and experimental results as high fidelity. We demonstrate performance gains of agents which incorporate multi-fidelity data in two contexts: either using a large body of low fidelity data as a prior knowledge base or acquiring low fidelity data in-tandem with experimental data. This advance provides a tool that enables materials scientists to test various acquisition and model hyperparameters to maximize the discovery rate of their own multi-fidelity sequential learning campaigns for materials discovery. This may also serve as a reference point for those who are interested in practical strategies that can be used when multiple data sources are available for active or sequential learning campaigns.
In this perspective, we highlight results of a research consortium devoted to advancing understanding of oxygen reduction reaction (ORR) catalysis as a means to inform fuel cell science. We ...demonstrate how targeted collaborations between different institutions from academic, national lab, and industry backgrounds and different scientific disciplines like theory, experiment, and characterization can yield unique insights into fuel cell catalysts. We comment on such insights into material designs for platinum-group-metal alloys, transition metal oxides, and non-traditional materials including metal-organic frameworks; systems that have served as the foundational building blocks for our consortium. We also motivate a renewed focus on catalyst durability in light of emerging technological requirements and paths forward in understanding
in situ
and
operando
electrochemical stability. Finally, we describe new frontiers ORR research can take and how emerging artificial intelligence tools can assist researchers in capturing data, selecting new experiments, and guiding characterization to accelerate the design and discovery of fuel cell catalysts. A main goal of sharing this perspective is to discuss the rationale for our future research plans based on our consortium work. However, we also hope to illustrate both the potential impact of a collaborative strategy with the hopes of inspiring a higher degree of Industry-Academia-National Laboratory collaboration and encourage other centers and consortiums to distill and share their findings in a similar perspective-type article. Together we hope to enable the fuel cell research community to engage in a discussion of strategies for research and accelerated development of catalysts with improved activity and stability.
In this perspective, we highlight results of a research consortium devoted to advancing understanding of oxygen reduction reaction (ORR) catalysis as a means to inform fuel cell science.
Hundreds of genes interact with the yeast nuclear pore complex (NPC), localizing at the nuclear periphery and clustering with co-regulated genes. Dynamic tracking of peripheral genes shows that they ...cycle on and off the NPC and that interaction with the NPC slows their sub-diffusive movement. Furthermore, NPC-dependent inter-chromosomal clustering leads to coordinated movement of pairs of loci separated by hundreds of nanometers. We developed fractional Brownian motion simulations for chromosomal loci in the nucleoplasm and interacting with NPCs. These simulations predict the rate and nature of random sub-diffusion during repositioning from nucleoplasm to periphery and match measurements from two different experimental models, arguing that recruitment to the nuclear periphery is due to random sub-diffusion and transient capture by NPCs. Finally, the simulations do not lead to inter-chromosomal clustering or coordinated movement, suggesting that interaction with the NPC is necessary, but not sufficient, to cause clustering.
Quantum confinement endows two-dimensional (2D) layered materials with exceptional physics and novel properties compared to their bulk counterparts. Although certain two- and few-layer configurations ...of graphene have been realized and studied, a systematic investigation of the properties of arbitrarily layered graphene assemblies is still lacking. We introduce theoretical concepts and methods for the processing of materials information, and as a case study, apply them to investigate the electronic structure of multi-layer graphene-based assemblies in a high-throughput fashion. We provide a critical discussion of patterns and trends in tight binding band structures and we identify specific layered assemblies using low-dispersion electronic bands as indicators of potentially interesting physics like strongly correlated behavior. A combination of data-driven models for visualization and prediction is used to intelligently explore the materials space. This work more generally aims to increase confidence in the combined use of physics-based and data-driven modeling for the systematic refinement of knowledge about 2D layered materials, with implications for the development of novel quantum devices.
Exploratory synthesis has been the main generator of new inorganic materials for decades. However, our Edisonian and bias-prone processes of synthetic exploration alone are no longer sufficient in an ...age that demands rapid advances in materials development. In this work, we demonstrate an end-to-end attempt towards systematic, computer-aided discovery and laboratory synthesis of inorganic crystalline compounds as a modern alternative to purely exploratory synthesis. Our approach initializes materials discovery campaigns by autonomously mapping the synthetic feasibility of a chemical system using density functional theory with AI feedback. Following expert-driven down-selection of newly generated phases, we use solid-state synthesis and
in situ
characterization
via
hot-stage X-ray diffraction in order to realize new ternary oxide phases experimentally. We applied this strategy in six ternary transition-metal oxide chemistries previously considered well-explored, one of which culminated in the discovery of two novel phases of calcium ruthenates. Detailed characterization using room temperature X-ray powder diffraction, 4D-STEM and SQUID measurements identifies the structure and composition and confirms distinct properties, including distinct defect concentrations, of one of the new phases formed in our experimental campaigns. While the discovery of a new material guided by AI and DFT theory represents a milestone, our procedure and results also highlight a number of critical gaps in the process that can inform future efforts towards the improvement of AI-coupled methodologies.
Exploratory synthesis has been the main generator of new inorganic materials for decades. AI-assisted discovery is possible, but human-AI collaboration should be refined according to their respective strengths.