No Arabic abstract
We studied the performance of the Convolutional Neural Network (CNN) for energy regression in a finely 3D-segmented calorimeter simulated by GEANT4. A CNN trained solely on a pure sample of pions achieved substantial improvement in the energy resolution for both single pions and jets over the conventional approaches. It maintained good performance for electron and photon reconstruction. We also used the Graph Neural Network (GNN) with edge convolution to assess the importance of timing information in the shower development for improved energy reconstruction. In this paper, we present the comparison of several reconstruction techniques: a simple energy sum, a dual-readout analog, a CNN, and a GNN with timing information.
Pattern recognition problems in high energy physics are notably different from traditional machine learning applications in computer vision. Reconstruction algorithms identify and measure the kinematic properties of particles produced in high energy collisions and recorded with complex detector systems. Two critical applications are the reconstruction of charged particle trajectories in tracking detectors and the reconstruction of particle showers in calorimeters. These two problems have unique challenges and characteristics, but both have high dimensionality, high degree of sparsity, and complex geometric layouts. Graph Neural Networks (GNNs) are a relatively new class of deep learning architectures which can deal with such data effectively, allowing scientists to incorporate domain knowledge in a graph structure and learn powerful representations leveraging that structure to identify patterns of interest. In this work we demonstrate the applicability of GNNs to these two diverse particle reconstruction problems.
We present a study which shows encouraging stability of the response linearity for a simulated high granularity calorimeter module reconstructed by a CNN model to miscalibration, bias, and noise effects. Our results also show an intuitive, quantifiable relationship between these factors and the calibration parameters. We trained a CNN model to reconstruct energy in the calorimeter module using simulated single-pion events; we then observed the response of the model under various miscalibration, bias, and noise conditions that affected the model input. From these data, we estimated linear response models to calibrate the CNN. We also quantified the relationship between these factors and the calibration parameters by regression analysis.
We apply deep neural networks (DNN) to data from the EXO-200 experiment. In the studied cases, the DNN is able to reconstruct the relevant parameters - total energy and position - directly from raw digitized waveforms, with minimal exceptions. For the first time, the developed algorithms are evaluated on real detector calibration data. The accuracy of reconstruction either reaches or exceeds what was achieved by the conventional approaches developed by EXO-200 over the course of the experiment. Most existing DNN approaches to event reconstruction and classification in particle physics are trained on Monte Carlo simulated events. Such algorithms are inherently limited by the accuracy of the simulation. We describe a unique approach that, in an experiment such as EXO-200, allows to successfully perform certain reconstruction and analysis tasks by training the network on waveforms from experimental data, either reducing or eliminating the reliance on the Monte Carlo.
The liquid argon ionization current in a sampling calorimeter cell can be analyzed to determine the energy of detected particles. In practice, experimental artifacts such as pileup and electronic noise make the inference of energy from current a difficult process. The beam intensity of the Large Hadron Collider will be significantly increased during the Phase-II long shut down of 2024-2026. Signal processing techniques that are used to extract the energy of detected particles in the ATLAS detector will suffer a significant loss in performance under these conditions. This paper compares the presently used optimal filter technique to convolutional neural networks for energy reconstruction in the ATLAS liquid argon hadronic end cap calorimeter. In particular, it is shown that convolutional neural networks trained with an appropriately tuned and novel loss function are able to outperform the optimal filter technique.
We present the 3DGAN for the simulation of a future high granularity calorimeter output as three-dimensional images. We prove the efficacy of Generative Adversarial Networks (GANs) for generating scientific data while retaining a high level of accuracy for diverse metrics across a large range of input variables. We demonstrate a successful application of the transfer learning concept: we train the network to simulate showers for electrons from a reduced range of primary energies, we then train further for a five times larger range (the model could not train for the larger range directly). The same concept is extended to generate showers for other particles (photons and neutral pions) depositing most of their energies in electromagnetic interactions. In addition, the generation of charged pion showers is also explored, a more accurate effort would require additional data from other detectors not included in the scope of the current work. Our further contribution is a demonstration of using GAN-generated data for a practical application. We train a third-party network using GAN-generated data and prove that the response is similar to a network trained with data from the Monte Carlo simulation. The showers generated by GAN present accuracy within $10%$ of Monte Carlo for a diverse range of physics features, with three orders of magnitude speedup. The speedup for both the training and inference can be further enhanced by distributed training.