Accelerating seminumerical Fock-exchange calculations using mixed single- and double-precision arithmethic
Open Access
- 7 June 2021
- journal article
- research article
- Published by AIP Publishing in The Journal of Chemical Physics
- Vol. 154 (21), 214116
- https://doi.org/10.1063/5.0045084
Abstract
We investigate the applicability of single-precision (fp32) floating point operations within our linear-scaling, seminumerical exchange method sn-LinK [Laqua et al., J. Chem. Theory Comput. 16, 1456 (2020)] and find that the vast majority of the three-center-one-electron (3c1e) integrals can be computed with reduced numerical precision with virtually no loss in overall accuracy. This leads to a near doubling in performance on central processing units (CPUs) compared to pure fp64 evaluation. Since the cost of evaluating the 3c1e integrals is less significant on graphic processing units (GPUs) compared to CPU, the performance gains from accelerating 3c1e integrals alone is less impressive on GPUs. Therefore, we also investigate the possibility of employing only fp32 operations to evaluate the exchange matrix within the self-consistent-field (SCF) followed by an accurate one-shot evaluation of the exchange energy using mixed fp32/fp64 precision. This still provides very accurate (1.8 µEh maximal error) results while providing a sevenfold speedup on a typical “gaming” GPU (GTX 1080Ti). We also propose the use of incremental exchange-builds to further reduce these errors. The proposed SCF scheme (i-sn-LinK) requires only one mixed-precision exchange matrix calculation, while all other exchange-matrix builds are performed with only fp32 operations. Compared to pure fp64 evaluation, this leads to 4–7× speedups for the whole SCF procedure without any significant deterioration of the results or the convergence behavior.Keywords
Funding Information
- Deutsche Forschungsgemeinschaft (SFB 1309-32587107, EXC2111-390814868)
This publication has 53 references indexed in Scilit:
- Integral partition bounds for fast and effective screening of general one-, two-, and many-electron integralsThe Journal of Chemical Physics, 2019
- Hybrid CPU/GPU Integral Engine for Strong-Scaling Ab Initio MethodsJournal of Chemical Theory and Computation, 2017
- Preselective Screening for Linear-Scaling Exact Exchange-Gradient Calculations for Graphics Processing Units and General Strong-Scaling Massively Parallel CalculationsJournal of Chemical Theory and Computation, 2015
- Pre-selective screening for matrix elements in linear-scaling exact exchange calculationsThe Journal of Chemical Physics, 2013
- Linear scaling exchange gradients for Hartree–Fock and hybrid density functional theoryChemical Physics Letters, 2000
- Linear and sublinear scaling formation of Hartree–Fock-type exchange matricesThe Journal of Chemical Physics, 1998
- Linear scaling computation of the Fock matrix. II. Rigorous bounds on exchange integrals and incremental Fock buildThe Journal of Chemical Physics, 1997
- Linear scaling computation of the Fock matrixThe Journal of Chemical Physics, 1997
- A linear scaling method for Hartree–Fock exchange calculations of large moleculesThe Journal of Chemical Physics, 1996
- Linear scaling computation of the Hartree–Fock exchange matrixThe Journal of Chemical Physics, 1996