No Arabic abstract
The GooFit Framework is designed to perform maximum-likelihood fits for arbitrary functions on various parallel back ends, for example a GPU. We present an extension to GooFit which adds the functionality to perform time-dependent amplitude analyses of pseudoscalar mesons decaying into four pseudoscalar final states. Benchmarks of this functionality show a significant performance increase when utilizing a GPU compared to a CPU. Furthermore, this extension is employed to study the sensitivity on the $D^0 - bar{D}^0$ mixing parameters $x$ and $y$ in a time-dependent amplitude analysis of the decay $D^0 rightarrow K^+pi^-pi^+pi^-$. Studying a sample of 50 000 events and setting the central values to the world average of $x = (0.49 pm0.15) %$ and $y = (0.61 pm0.08) %$, the statistical sensitivities of $x$ and $y$ are determined to be $sigma(x) = 0.019 %$ and $sigma(y) = 0.019 %$.
MCBooster is a header-only, C++11-compliant library that provides routines to generate and perform calculations on large samples of phase space Monte Carlo events. To achieve superior performance, MCBooster is capable to perform most of its calculations in parallel using CUDA- and OpenMP-enabled devices. MCBooster is built on top of the Thrust library and runs on Linux systems. This contribution summarizes the main features of MCBooster. A basic description of the user interface and some examples of applications are provided, along with measurements of performance in a variety of environments
Machine-intelligence has become a driving factor in modern society. However, its demand outpaces the underlying electronic technology due to limitations given by fundamental physics such as capacitive charging of wires, but also by system architecture of storing and handling data, both driving recent trends towards processor heterogeneity. Here we introduce a novel amplitude-only Fourier-optical processor paradigm capable of processing large-scale ~(1,000 x 1,000) matrices in a single time-step and 100 microsecond-short latency. Conceptually, the information-flow direction is orthogonal to the two-dimensional programmable-network, which leverages 10^6-parallel channels of display technology, and enables a prototype demonstration performing convolutions as pixel-wise multiplications in the Fourier domain reaching peta operations per second throughputs. The required real-to-Fourier domain transformations are performed passively by optical lenses at zero-static power. We exemplary realize a convolutional neural network (CNN) performing classification tasks on 2-Megapixel large matrices at 10 kHz rates, which latency-outperforms current GPU and phase-based display technology by one and two orders of magnitude, respectively. Training this optical convolutional layer on image classification tasks and utilizing it in a hybrid optical-electronic CNN, shows classification accuracy of 98% (MNIST) and 54% (CIFAR-10). Interestingly, the amplitude-only CNN is inherently robust against coherence noise in contrast to phase-based paradigms and features an over 2 orders of magnitude lower delay than liquid crystal-based systems. Beyond contributing to novel accelerator technology, scientifically this amplitude-only massively-parallel optical compute-paradigm can be far-reaching as it de-validates the assumption that phase-information outweighs amplitude in optical processors for machine-intelligence.
Hydra is a header-only, templated and C++11-compliant framework designed to perform the typical bottleneck calculations found in common HEP data analyses on massively parallel platforms. The framework is implemented on top of the C++11 Standard Library and a variadic version of the Thrust library and is designed to run on Linux systems, using OpenMP, CUDA and TBB enabled devices. This contribution summarizes the main features of Hydra. A basic description of the overall design, functionality and user interface is provided, along with some code examples and measurements of performance.
A massively parallel simulation code, called textit{dHybrid}, has been developed to perform global scale studies of space plasma interactions. This code is based on an explicit hybrid model; the numerical stability and parallel scalability of the code are studied. A stabilization method for the explicit algorithm, for regions of near zero density, is proposed. Three-dimensional hybrid simulations of the interaction of the solar wind with unmagnetized artificial objects are presented, with a focus on the expansion of a plasma cloud into the solar wind, which creates a diamagnetic cavity and drives the Interplanetary Magnetic Field out of the expansion region. The dynamics of this system can provide insights into other similar scenarios, such as the interaction of the solar wind with unmagnetized planets.
The Dalitz plot analysis technique is used to study the resonant substructures of $B^{-} to D^{+} pi^{-} pi^{-}$ decays in a data sample corresponding to 3.0 ${rm fb}^{-1}$ of $pp$ collision data recorded by the LHCb experiment during 2011 and 2012. A model-independent analysis of the angular moments demonstrates the presence of resonances with spins 1, 2 and 3 at high $D^{+}pi^{-}$ mass. The data are fitted with an amplitude model composed of a quasi-model-independent function to describe the $D^{+}pi^{-}$ S-wave together with virtual contributions from the $D^{*}(2007)^{0}$ and $B^{*0}$ states, and components corresponding to the $D^{*}_{2}(2460)^{0}$, $D^{*}_{1}(2680)^{0}$, $D^{*}_{3}(2760)^{0}$ and $D^{*}_{2}(3000)^{0}$ resonances. The masses and widths of these resonances are determined together with the branching fractions for their production in $B^{-} to D^{+} pi^{-} pi^{-}$ decays. The $D^{+}pi^{-}$ S-wave has phase motion consistent with that expected due to the presence of the $D^{*}_{0}(2400)^{0}$ state. These results constitute the first observations of the $D^{*}_{3}(2760)^{0}$ and $D^{*}_{2}(3000)^{0}$ resonances.