New community

Subscribe to the gold package and get unlimited access to Shamra Academy

QPACE -- a QCD parallel computer based on Cell processors

123 0 0.0 ( 0 )

Download Cite

Added by Dirk Pleiter

Publication date 2009

fields Informatics Engineering

and research's language is English

Authors H. Baier - H. Boettiger - M. Drochner

High Energy Physics - Lattice Hardware Architecture

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

QPACE is a novel parallel computer which has been developed to be primarily used for lattice QCD simulations. The compute power is provided by the IBM PowerXCell 8i processor, an enhanced version of the Cell processor that is used in the Playstation 3. The QPACE nodes are interconnected by a custom, application optimized 3-dimensional torus network implemented on an FPGA. To achieve the very high packaging density of 26 TFlops per rack a new water cooling concept has been developed and successfully realized. In this paper we give an overview of the architecture and highlight some important technical details of the system. Furthermore, we provide initial performance results and report on the installation of 8 QPACE racks providing an aggregate peak performance of 200 TFlops.

rate research

Lattice QCD Applications on QPACE

149 - Y. Nakamura , A. Nobile , D. Pleiter 2011

QPACE is a novel massively parallel architecture optimized for lattice QCD simulations. A single QPACE node is based on the IBM PowerXCell 8i processor. The nodes are interconnected by a custom 3-dimensional torus network implemented on an FPGA. The compute power of the processor is provided by 8 Synergistic Processing Units. Making efficient use of these accelerator cores in scientific applications is challenging. In this paper we describe our strategies for porting applications to the QPACE architecture and report on performance numbers.

High Energy Physics - Lattice

Parallel Computing for QCD on a Pentium Cluster

79 - X. Q. Luo , E. B. Gregory , J. C. Yang 2000

Motivated by the computational demands of our research and budgetary constraints which are common to many research institutions, we built a ``poor mans supercomputer, a cluster of PC nodes which together can perform parallel calculations at a fraction of the price of a commercial supercomputer. We describe the construction, cost, and performance of our cluster.

High Energy Physics - Lattice

QCD on the Cell Broadband Engine

109 - F. Belletti , G. Bilardi , M. Drochner 2007

We evaluate IBMs Enhanced Cell Broadband Engine (BE) as a possible building block of a new generation of lattice QCD machines. The Enhanced Cell BE will provide full support of double-precision floating-point arithmetics, including IEEE-compliant rounding. We have developed a performance model and applied it to relevant lattice QCD kernels. The performance estimates are supported by micro- and application-benchmarks that have been obtained on currently available Cell BE-based computers, such as IBM QS20 blades and PlayStation 3. The results are encouraging and show that this processor is an interesting option for lattice QCD applications. For a massively parallel machine on the basis of the Cell BE, an application-optimized network needs to be developed.

High Energy Physics - Lattice

Solving the Dirac equation on QPACE

122 - Andrea Nobile 2011

We discuss the implementation and optimization challenges for a Wilson-Dirac solver with Clover term on QPACE, a parallel machine based on Cell processors and a torus network. We choose the mixed-precision Schwarz preconditioned FGCR algorithm in order to circumvent network bandwidth and latency constraints, to make efficient use of the multicore parallelism and on-chip memory, and to achieve flexibility in the choice of lattice sizes. We present benchmarks on up to 256 QPACE nodes showing an aggregate sustained performance of about 10 TFlops for the complete solver and very good scaling.

High Energy Physics - Lattice

DD-$alpha$AMG on QPACE 3

171 - Peter Georg , Daniel Richtmann , Tilo Wettig 2017

We describe our experience porting the Regensburg implementation of the DD-$alpha$AMG solver from QPACE 2 to QPACE 3. We first review how the code was ported from the first generation Intel Xeon Phi processor (Knights Corner) to its successor (Knights Landing). We then describe the modifications in the communication library necessitated by the switch from InfiniBand to Omni-Path. Finally, we present the performance of the code on a single processor as well as the scaling on many nodes, where in both cases the speedup factor is close to the theoretical expectations.

High Energy Physics - Lattice Distributed Parallel and Cluster Computing Computational Physics

comments

Fetching comments

Sohag University

Additional details More universities

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

QPACE -- a QCD parallel computer based on Cell processors

Ask ChatGPT about the research

No Arabic abstract

Read More