We present a load balancing strategy for hybrid particle-mesh methods that is based on domain decomposition and element-local time measurement. This new strategy is compared to our previous approach, which assumes a constant weighting factor for each particle to determine the computational load. The timer-based load balancing is applied to a plasma expansion simulation. The performance of the new algorithm is compared to results presented in the past and a significant improvement in terms of computational efficiency is shown.
In the wake of the intense effort made for the experimental CILEX project, numerical simulation cam- paigns have been carried out in order to finalize the design of the facility and to identify optimal laser and plasma parameters. These simulations bring, of course, important insight into the fundamental physics at play. As a by-product, they also characterize the quality of our theoretical and numerical models. In this paper, we compare the results given by different codes and point out algorithmic lim- itations both in terms of physical accuracy and computational performances. These limitations are illu- strated in the context of electron laser wakefield acceleration (LWFA). The main limitation we identify in state-of-the-art Particle-In-Cell (PIC) codes is computational load imbalance. We propose an innovative algorithm to deal with this specific issue as well as milestones towards a modern, accurate high-per- formance PIC code for high energy particle acceleration.
A novel parallel technique for Fourier-Galerkin pseudo-spectral methods with applications to two-dimensional Navier-Stokes equations and inviscid Boussinesq approximation equations is presented. It takes the advantage of the programming structure of the phase-shift de-aliased scheme for pseudo-spectral codes, and combines the task-distribution strategy [Yin, Clercx and Montgomery, Comput. Fluids, 33, 509 (2004)] and parallelized Fast Fourier Transform scheme. The performances of the resulting MPI Fortran90 codes with the new procedure on SGI 3800 are reported. For fixed resolution of the same problem, the peak speed of the new scheme can be twice as fast as the old parallel methods. The parallelized codes are used to solve some challenging numerical problems governed by the Navier-Stokes equations and the Boussinesq equations. Two interesting physical problems, namely, the double-valued $omega$-$psi $ structure in two-dimensional decaying turbulence and the collapse of the bubble cap in the Boussinesq simulation, are solved by using the proposed parallel algorithms.
Furthering our understanding of many of todays interesting problems in plasma physics---including plasma based acceleration and magnetic reconnection with pair production due to quantum electrodynamic effects---requires large-scale kinetic simulations using particle-in-cell (PIC) codes. However, these simulations are extremely demanding, requiring that contemporary PIC codes be designed to efficiently use a new fleet of exascale computing architectures. To this end, the key issue of parallel load balance across computational nodes must be addressed. We discuss the implementation of dynamic load balancing by dividing the simulation space into many small, self-contained regions or tiles, along with shared-memory (e.g., OpenMP) parallelism both over many tiles and within single tiles. The load balancing algorithm can be used with three different topologies, including two space-filling curves. We tested this implementation in the code OSIRIS and show low overhead and improved scalability with OpenMP thread number on simulations with both uniform load and severe load imbalance. Compared to other load-balancing techniques, our algorithm gives order-of-magnitude improvement in parallel scalability for simulations with severe load imbalance issues.
A R&D project has been launched in 2009 to address fundamental methods in radiation transport simulation and revisit Geant4 kernel design to cope with new experimental requirements. The project focuses on simulation at different scales in the same experimental environment: this set of problems requires new methods across the current boundaries of condensed-random-walk and discrete transport schemes. An exploration is also foreseen about exploiting and extending already existing Geant4 features to apply Monte Carlo and deterministic transport methods in the same simulation environment. An overview of this new R&D associated with Geant4 is presented, together with the first developments in progress.
We present a particle method for estimating the curvature of interfaces in volume-of-fluid simulations of multiphase flows. The method is well suited for under-resolved interfaces, and it is shown to be more accurate than the parabolic fitting that is employed in such cases. The curvature is computed from the equilibrium positions of particles constrained to circular arcs and attracted to the interface. The proposed particle method is combined with the method of height functions at higher resolutions, and it is shown to outperform the current combinations of height functions and parabolic fitting. The algorithm is conceptually simple and straightforward to implement on new and existing software frameworks for multiphase flow simulations thus enhancing their capabilities in challenging flow problems. We evaluate the proposed hybrid method on a number of two- and three-dimensional benchmark flow problems and illustrate its capabilities on simulations of flows involving bubble coalescence and turbulent multiphase flows.