No Arabic abstract
Using a realistic molecular catalyst system, we conduct scaling studies of ab initio molecular dynamics simulations using the CP2K code on both Intel Xeon CPU and NVIDIA V100 GPU architectures. We explore using process placement and affinity to gain additional performance improvements. We also use statistical methods to understand performance changes in spite of the variability in runtime for each molecular dynamics timestep. We found ideal conditions for CPU runs included at least four MPI ranks per node, bound evenly across each socket, and fully utilizing processing cores with one OpenMP thread per core, no benefit was shown from reserving cores for the system. The CPU-only simulations scaled at 70% or more of the ideal scaling up to 10 compute nodes, after which the returns began to diminish more quickly. Simulations on a single 40-core node with two NVIDIA V100 GPUs for acceleration achieved over 3.7x speedup compared to the fastest single 36-core node CPU-only version, and showed 13% speedup over the fastest time we achieved across five CPU-only nodes.
This paper has been withdrawn by the author
We extend the ab initio molecular dynamics (AIMD) method based on density functional theory to the nonequilibrium situation where an electronic current is present in the electronic system. The dynamics is treated using the semi-classical generalized Langevin equation. We demonstrate how the full anharmonic description of the inter-atomic forces is important in order to understand the current-induced heating and the energy distribution both in frequency and in real space.
This work examines the performance of leading-edge systems designed for machine learning computing, including the NVIDIA DGX-2, Amazon Web Services (AWS) P3, IBM Power System Accelerated Compute Server AC922, and a consumer-grade Exxact TensorEX TS4 GPU server. Representative deep learning workloads from the fields of computer vision and natural language processing are the focus of the analysis. Performance analysis is performed along with a number of important dimensions. Performance of the communication interconnects and large and high-throughput deep learning models are considered. Different potential use models for the systems as standalone and in the cloud also are examined. The effect of various optimization of the deep learning models and system configurations is included in the analysis.
Ab initio molecular dynamics (AIMD) is a valuable technique for studying molecules and materials at finite temperatures where the nuclei evolve on potential energy surfaces obtained from accurate electronic structure calculations. In this work, a quantum computer-based AIMD method is presented. The electronic energies are calculated on a quantum computer using the variational quantum eigensolver (VQE) method. We compute the energy gradients numerically using the Hellmann-Feynman theorem, finite differences, and a correlated sampling technique. Our method only requires additional classical calculations of electron integrals for each degree of freedom, without any additional computations on a quantum computer beyond the initial VQE run. To achieve comparable accuracy, our gradient calculation method requires three to five orders of magnitude fewer measurements than other brute force methods without correlated sampling. As a proof of concept, AIMD dynamics simulations are demonstrated for the H2 molecule on IBM quantum devices. To the best of our knowledge, it is the first successful attempt to run AIMD on quantum devices for a chemical system. In addition, we demonstrate the validity of the method for larger molecules using full configuration interaction (FCI) wave functions. As quantum hardware and noise mitigation techniques continue to improve, the method can be utilized for studying larger molecular and material systems.
Cloud computing has become increasingly popular. Many options of cloud deployments are available. Testing cloud performance would enable us to choose a cloud deployment based on the requirements. In this paper, we present an innovative process, implemented in software, to allow us to assess the quality of the cloud performance data. The process combines performance data from multiple machines, spanning across user experience data, workload performance metrics, and readily available system performance data. Furthermore, we discuss the major challenges of bringing raw data into tidy data formats in order to enable subsequent analysis, and describe how our process has several layers of assessment to validate the quality of the data processing procedure. We present a case study to demonstrate the effectiveness of our proposed process, and conclude our paper with several future research directions worth investigating.