Progress towards accelerating HOMME on hybrid multi-core systems
- 16 November 2012
- journal article
- research article
- Published by SAGE Publications in The International Journal of High Performance Computing Applications
- Vol. 27 (3), 335-347
- https://doi.org/10.1177/1094342012462751
Abstract
The suitability of a spectral element based dynamical core (HOMME) within the Community Atmospheric Model (CAM) for GPU-based architectures is examined and initial performance results are reported. This work was done within a project to enable CAM to run at high resolution on next-generation, multi-petaflop systems. The dynamical core is the present focus because it dominates the performance profile of our target problem. HOMME enjoys good scalability due to its underlying cubed-sphere mesh with full two-dimensional decomposition and the localization of all computational work within each element. The thread blocking and code changes that allow HOMME to effectively use GPUs are described along with a rewritten vertical remapping scheme, which improves performance on both CPUs and GPUs. Validation of results in the full HOMME model is also described. We demonstrate that the most expensive kernel in the model executes more than three times faster on the GPU than the CPU. These improvements are expected to provide improved efficiency when incorporated into the full model that has been configured for the target problem. Remaining issues affecting performance include optimizing the boundary exchanges for the case of multiple spectral elements being computed on the GPU.Keywords
This publication has 14 references indexed in Scilit:
- The Community Climate System Model Version 4Journal of Climate, 2011
- GPU Computing for Atmospheric ModelingComputing in Science & Engineering, 2010
- GPU ACCELERATION OF NUMERICAL WEATHER PREDICTIONParallel Processing Letters, 2008
- Petascale atmospheric models for the Community Climate System Model: new developments and evaluation of scalable dynamical coresJournal of Physics: Conference Series, 2008
- Scalable Parallel Programming with CUDAQueue, 2008
- The Parabolic Spline Method (PSM) for conservative transport problemsInternational Journal for Numerical Methods in Fluids, 2005
- A monotonic and positive–definite filter for a Semi‐Lagrangian Inherently Conserving and Efficient (SLICE) schemeQuarterly Journal of the Royal Meteorological Society, 2005
- High-Resolution Mesh Convergence Properties and Parallel Efficiency of a Spectral Element Atmospheric Dynamical CoreThe International Journal of High Performance Computing Applications, 2005
- The Accumulation of Rounding Errors and Port Validation for Global Atmospheric ModelsSIAM Journal on Scientific Computing, 1997
- An Energy and Angular-Momentum Conserving Vertical Finite-Difference Scheme and Hybrid Vertical CoordinatesMonthly Weather Review, 1981