We present the Lattice QCD application CL2QCD, which is based on OpenCL and can be utilized to run on Graphic Processing Units as well as on common CPUs. We focus on implementation details as well as performance results of selected features. CL2QCD has been successfully applied in LQCD studies at finite temperature and density and is available at http://code.compeng.uni-frankfurt.de/projects/clhmc .
QPACE is a novel massively parallel architecture optimized for lattice QCD simulations. A single QPACE node is based on the IBM PowerXCell 8i processor. The nodes are interconnected by a custom 3-dimensional torus network implemented on an FPGA. The compute power of the processor is provided by 8 Synergistic Processing Units. Making efficient use of these accelerator cores in scientific applications is challenging. In this paper we describe our strategies for porting applications to the QPACE architecture and report on performance numbers.
We report on coding and performance of our polynomial hybrid Monte Carlo program on the Earth Simulator. At present the entire program achieves 25--40% efficiency. An analysis of overheads shows that a tuning of inter-node communications is required for further improvement.
We present preliminary lattice results for the nonperturbative tensor structure of the vector and axial-vector quark-antiquark vertices in QCD. Our lattice calculations are for $N_f=2$ mass-degenerate Wilson fermion flavors whose quark mass values include an almost physical one. We compare our lattice results with the corresponding continuum solutions of the inhomogeneous Bethe-Salpeter equations in a rainbow-ladder truncation. We find similarities in the momentum dependencies of the form factors but also clear deviations at low momentum.
This talk gives an overview, aimed at non-experts, of the recent progress on the studies of technicolor models on the lattice. Phenomenologically successful technicolor models require walking coupling; thus, an emphasis is put on the determination of the beta-function of various models. As a case study we consider SU(2) gauge field theory with two adjoint representation fermions, so-called minimal walking technicolor theory.
Recently Arm introduced a new instruction set called Scalable Vector Extension (SVE), which supports vector lengths up to 2048 bits. While SVE hardware will not be generally available until about 2021, we believe that future SVE-based architectures will have great potential for Lattice QCD. In this contribution we discuss key aspects of SVE and describe how we implemented SVE in the Grid Lattice QCD framework.