No Arabic abstract
Solar-energy plays an important role in solving serious environmental problems and meeting high-energy demand. However, the lack of suitable materials hinders further progress of this technology. Here, we present the largest inorganic solar-cell material search to date using density functional theory (DFT) and machine-learning approaches. We calculated the spectroscopic limited maximum efficiency (SLME) using Tran-Blaha modified Becke-Johnson potential for 5097 non-metallic materials and identified 1997 candidates with an SLME higher than 10%, including 934 candidates with suitable convex-hull stability and effective carrier mass. Screening for 2D-layered cases, we found 58 potential materials and performed G0W0 calculations on a subset to estimate the prediction-uncertainty. As the above DFT methods are still computationally expensive, we developed a high accuracy machine learning model to pre-screen efficient materials and applied it to over a million materials. Our results provide a general framework and universal strategy for the design of high-efficiency solar cell materials. The data and tools are publicly distributed at: https://www.ctcms.nist.gov/~knc6/JVASP.html, https://www.ctcms.nist.gov/jarvisml/, https://jarvis.nist.gov/ and https://github.com/usnistgov/jarvis .
Machine learning was utilized to efficiently boost the development of soft magnetic materials. The design process includes building a database composed of published experimental results, applying machine learning methods on the database, identifying the trends of magnetic properties in soft magnetic materials, and accelerating the design of next-generation soft magnetic nanocrystalline materials through the use of numerical optimization. Machine learning regression models were trained to predict magnetic saturation ($B_S$), coercivity ($H_C$) and magnetostriction ($lambda$), with a stochastic optimization framework being used to further optimize the corresponding magnetic properties. To verify the feasibility of the machine learning model, several optimized soft magnetic materials -- specified in terms of compositions and thermomechanical treatments -- have been predicted and then prepared and tested, showing good agreement between predictions and experiments, proving the reliability of the designed model. Two rounds of optimization-testing iterations were conducted to search for better properties.
Material scientists are increasingly adopting the use of machine learning (ML) for making potentially important decisions, such as, discovery, development, optimization, synthesis and characterization of materials. However, despite MLs impressive performance in commercial applications, several unique challenges exist when applying ML in materials science applications. In such a context, the contributions of this work are twofold. First, we identify common pitfalls of existing ML techniques when learning from underrepresented/imbalanced material data. Specifically, we show that with imbalanced data, standard methods for assessing quality of ML models break down and lead to misleading conclusions. Furthermore, we found that the models own confidence score cannot be trusted and model introspection methods (using simpler models) do not help as they result in loss of predictive performance (reliability-explainability trade-off). Second, to overcome these challenges, we propose a general-purpose explainable and reliable machine-learning framework. Specifically, we propose a novel pipeline that employs an ensemble of simpler models to reliably predict material properties. We also propose a transfer learning technique and show that the performance loss due to models simplicity can be overcome by exploiting correlations among different material properties. A new evaluation metric and a trust score to better quantify the confidence in the predictions are also proposed. To improve the interpretability, we add a rationale generator component to our framework which provides both model-level and decision-level explanations. Finally, we demonstrate the versatility of our technique on two applications: 1) predicting properties of crystalline compounds, and 2) identifying novel potentially stable solar cell materials.
The large-scale search for high-performing candidate 2D materials is limited to calculating a few simple descriptors, usually with first-principles density functional theory calculations. In this work, we alleviate this issue by extending and generalizing crystal graph convolutional neural networks to systems with planar periodicity, and train an ensemble of models to predict thermodynamic, mechanical, and electronic properties. To demonstrate the utility of this approach, we carry out a screening of nearly 45,000 structures for two largely disjoint applications: namely, mechanically robust composites and photovoltaics. An analysis of the uncertainty associated with our methods indicates the ensemble of neural networks is well-calibrated and has errors comparable with those from accurate first-principles density functional theory calculations. The ensemble of models allows us to gauge the confidence of our predictions, and to find the candidates most likely to exhibit effective performance in their applications. Since the datasets used in our screening were combinatorically generated, we are also able to investigate, using an innovative method, structural and compositional design principles that impact the properties of the structures surveyed and which can act as a generative model basis for future material discovery through reverse engineering. Our approach allowed us to recover some well-accepted design principles: for instance, we find that hybrid organic-inorganic perovskites with lead and tin tend to be good candidates for solar cell applications.
Lattice constants such as unit cell edge lengths and plane angles are important parameters of the periodic structures of crystal materials. Predicting crystal lattice constants has wide applications in crystal structure prediction and materials property prediction. Previous work has used machine learning models such as neural networks and support vector machines combined with composition features for lattice constant prediction and has achieved a maximum performance for cubic structures with an average $R^2$ of 0.82. Other models tailored for special materials family of a fixed form such as ABX3 perovskites can achieve much higher performance due to the homogeneity of the structures. However, these models trained with small datasets are usually not applicable to generic lattice parameter prediction of materials with diverse compositions. Herein, we report MLatticeABC, a random forest machine learning model with a new descriptor set for lattice unit cell edge length ($a,b,c$) prediction which achieves an R2 score of 0.979 for lattice parameter $a$ of cubic crystals and significant performance improvement for other crystal systems as well. Source code and trained models can be freely accessed at https://github.com/usccolumbia/MLatticeABC
Although the richness of spatial symmetries has led to a rapidly expanding inventory of possible topological crystalline (TC) phases of electrons, physical realizations have been slow to materialize due to the practical difficulty to ascertaining band topology in realistic calculations. Here, we integrate the recently established theory of symmetry indicators of band topology into first-principle band-structure calculations, and test it on a databases of previously synthesized crystals. The combined algorithm is found to efficiently unearth topological materials and predict topological properties like protected surface states. On applying our algorithm to just 8 out of the 230 space groups, we already discover numerous materials candidates displaying a diversity of topological phenomena, which are simultaneously captured in a single sweep. The list includes recently proposed classes of TC insulators that had no previous materials realization as well as other topological phases, including: (i) a screw-protected 3D TC insulator, b{eta}-MoTe2, with gapped surfaces except for 1D helical hinge states; (ii) a rotation-protected TC insulator BiBr with coexisting surface Dirac cones and hinge states; (iii) non-centrosymmetric Z2 topological insulators undetectable using the well-established parity criterion, AgXO (X=Na,K,Rb); (iv) a Dirac semimetal MgBi2O6; (v) a Dirac nodal-line semimetal AgF2; and (vi) a metal with three-fold degenerate band crossing near the Fermi energy, AuLiMgSn. Our work showcases how the recent theoretical insights on the fundamentals of band structures can aid in the practical goal of discovering new topological materials.