Toward exascale whole-device modeling of fusion devices: Porting the GENE gyrokinetic microturbulence code to GPU

1 June 2021

journal article
research article
Published by AIP Publishing in Physics of Plasmas

Vol. 28 (6), 062501
https://doi.org/10.1063/5.0046327

Abstract

GENE solves the five-dimensional gyrokinetic equations to simulate the development and evolution of plasma microturbulence in magnetic fusion devices. The plasma model used is close to first principles and computationally very expensive to solve in the relevant physical regimes. In order to use the emerging computational capabilities to gain new physics insights, several new numerical and computational developments are required. Here, we focus on the fact that it is crucial to efficiently utilize GPUs (graphics processing units) that provide the vast majority of the computational power on such systems. In this paper, we describe the various porting approaches considered and given the constraints of the GENE code and its development model, justify the decisions made, and describe the path taken in porting GENE to GPUs. We introduce a novel library called gtensor that was developed along the way to support the process. Performance results are presented for the ported code, which in a single node of the Summit supercomputer achieves a speed-up of almost

15 \times

compared to running on central processing unit (CPU) only. Typical GPU kernels are memory-bound, achieving about 90% of peak. Our analysis shows that there is still room for improvement if we can refactor/fuse kernels to achieve higher arithmetic intensity. We also performed a weak parallel scalability study, which shows that the code runs well on a massively parallel system, but communication costs start becoming a significant bottleneck.

Keywords

Funding Information

U.S. Department of Energy (17-SC-20-SC)
U.S. Department of Energy (DE-AC05-00OR22725)

This publication has 31 references indexed in Scilit:

Numerical reproducibility for the parallel reduction on multi- and many-core architectures
Parallel Computing, 2015
Multiscale Nature of the Dissipation Range in Gyrokinetic Simulations of Alfvénic Turbulence
Physical Review Letters, 2015
A flux-matched gyrokinetic analysis of DIII-D L-mode turbulence
Physics of Plasmas, 2014
Kokkos: Enabling manycore performance portability through polymorphic memory access patterns
Journal of Parallel and Distributed Computing, 2014
MAGNETIC RECONNECTION TURBULENCE IN STRONG GUIDE FIELDS: BASIC PROPERTIES AND APPLICATION TO CORONAL HEATING
The Astrophysical Journal Supplement Series, 2014
The global version of the gyrokinetic turbulence code GENE
Journal of Computational Physics, 2011
Nonlinear quasisteady state benchmark of global gyrokinetic codes
Physics of Plasmas, 2010
Foundations of nonlinear gyrokinetic theory
Reviews of Modern Physics, 2007
Electron temperature gradient driven turbulence
Physics of Plasmas, 2000
Computational design for long-term numerical integration of the equations of fluid motion: Two-dimensional incompressible flow. Part I
Journal of Computational Physics, 1966

Cited by 8 articles