High performance PIC plasma simulation with modern GPUs

Abstract
With the recent Nvidia Tesla V100 a performance of 0.5 TFLOPS was achieved for 3D Particle-In-Cell simulation. The paper includes brief description of the simulation algorithm, the detail of GPU implementation and the performance analysis with different GPUs.