No Arabic abstract
Active learning - the field of machine learning (ML) dedicated to optimal experiment design, has played a part in science as far back as the 18th century when Laplace used it to guide his discovery of celestial mechanics [1]. In this work we focus a closed-loop, active learning-driven autonomous system on another major challenge, the discovery of advanced materials against the exceedingly complex synthesis-processes-structure-property landscape. We demonstrate autonomous research methodology (i.e. autonomous hypothesis definition and evaluation) that can place complex, advanced materials in reach, allowing scientists to fail smarter, learn faster, and spend less resources in their studies, while simultaneously improving trust in scientific results and machine learning tools. Additionally, this robot science enables science-over-the-network, reducing the economic impact of scientists being physically separated from their labs. We used the real-time closed-loop, autonomous system for materials exploration and optimization (CAMEO) at the synchrotron beamline to accelerate the fundamentally interconnected tasks of rapid phase mapping and property optimization, with each cycle taking seconds to minutes, resulting in the discovery of a novel epitaxial nanocomposite phase-change memory material.
Neutron scattering is a unique and versatile characterization technique for probing the magnetic structure and dynamics of materials. However, instruments at neutron scattering facilities in the world is limited, and instruments at such facilities are perennially oversubscribed. We demonstrate a significant reduction in experimental time required for neutron diffraction experiments by implementation of autonomous navigation of measurement parameter space through machine learning. Prior scientific knowledge and Bayesian active learning are used to dynamically steer the sequence of measurements. We developed the autonomous neutron diffraction explorer (ANDiE) and used it to determine the magnetic order of MnO and Fe1.09Te. ANDiE can determine the Neel temperature of the materials with 5-fold enhancement in efficiency and correctly identify the transition dynamics via physics-informed Bayesian inference. ANDiEs active learning approach is broadly applicable to a variety of neutron-based experiments and can open the door for neutron scattering as a tool of accelerated materials discovery.
Emergent functionalities of structural and topological defects in ferroelectric materials underpin an extremely broad spectrum of applications ranging from domain wall electronics to high dielectric and electromechanical responses. Many of these have been discovered and quantified via local scanning probe microscopy methods. However, the search for these functionalities has until now been based by either trial and error or using auxiliary information such as topography or domain wall structure to identify potential objects of interest based on the intuition of operator or preexisting hypotheses, with subsequent manual exploration. Here, we report the development and implementation of a machine learning framework that actively discovers relationships between local domain structure and polarization switching characteristics in ferroelectric materials encoded in the hysteresis loop. The latter and descriptors such as nucleation bias, coercive bias, hysteresis loop area, or more complex functionals of hysteresis loop shape and corresponding uncertainties are used to guide the discovery via automated piezoresponse force microscopy (PFM) and spectroscopy experiments. As such, this approach combines the power of machine learning methods to learn the correlative relationships between high dimensional data, and human-based physics insights encoded in the acquisition function. For ferroelectric, this automated workflow demonstrates that the discovery path and sampling points of on-field and off-field hysteresis loops are largely different, indicating the on-field and off-field hysteresis loops are dominated by different mechanisms. The proposed approach is universal and can be applied to a broad range of modern imaging and spectroscopy methods ranging from other scanning probe microscopy modalities to electron microscopy and chemical imaging.
Machine learning (ML) models utilizing structure-based features provide an efficient means for accurate property predictions across diverse chemical spaces. However, obtaining equilibrium crystal structures typically requires expensive density functional theory (DFT) calculations, which limits ML-based exploration to either known crystals or a small number of hypothetical crystals. Here, we demonstrate that the application of Bayesian optimization with symmetry constraints using a graph deep learning energy model can be used to perform DFT-free relaxations of crystal structures. Using this approach to significantly improve the accuracy of ML-predicted formation energies and elastic moduli of hypothetical crystals, two novel ultra-incompressible hard materials MoWC2 (P63/mmc) and ReWB (Pca21) were identified and successfully synthesized via in-situ reactive spark plasma sintering from a screening of 399,960 transition metal borides and carbides. This work addresses a critical bottleneck to accurate property predictions for hypothetical materials, paving the way to ML-accelerated discovery of new materials with exceptional properties.
Active learning has been increasingly applied to screening functional materials from existing materials databases with desired properties. However, the number of known materials deposited in the popular materials databases such as ICSD and Materials Project is extremely limited and consists of just a tiny portion of the vast chemical design space. Herein we present an active generative inverse design method that combines active learning with a deep variational autoencoder neural network and a generative adversarial deep neural network model to discover new materials with a target property in the whole chemical design space. The application of this method has allowed us to discover new thermodynamically stable materials with high band gap (SrYF$_5$) and semiconductors with specified band gap ranges (SrClF$_3$, CaClF$_5$, YCl$_3$, SrC$_2$F$_3$, AlSCl, As$_2$O$_3$), all of which are verified by the first principle DFT calculations. Our experiments show that while active learning itself may sample chemically infeasible candidates, these samples help to train effective screening models for filtering out materials with desired properties from the hypothetical materials created by the generative model. The experiments show the effectiveness of our active generative inverse design approach.
Deep learning models have demonstrated outstanding performance in several problems, but their training process tends to require immense amounts of computational and human resources for training and labeling, constraining the types of problems that can be tackled. Therefore, the design of effective training methods that require small labeled training sets is an important research direction that will allow a more effective use of resources.Among current approaches designed to address this issue, two are particularly interesting: data augmentation and active learning. Data augmentation achieves this goal by artificially generating new training points, while active learning relies on the selection of the most informative subset of unlabeled training samples to be labelled by an oracle. Although successful in practice, data augmentation can waste computational resources because it indiscriminately generates samples that are not guaranteed to be informative, and active learning selects a small subset of informative samples (from a large un-annotated set) that may be insufficient for the training process. In this paper, we propose a Bayesian generative active deep learning approach that combines active learning with data augmentation -- we provide theoretical and empirical evidence (MNIST, CIFAR-${10,100}$, and SVHN) that our approach has more efficient training and better classification results than data augmentation and active learning.