Parsimonious Potential Energy Surface Expansions Using Dictionary Learning with Multipass Greedy Selection
- 16 September 2021
- journal article
- research article
- Published by American Chemical Society (ACS) in The Journal of Physical Chemistry Letters
- Vol. 12 (37), 9169-9174
- https://doi.org/10.1021/acs.jpclett.1c02721
Abstract
Potential energy surfaces fit with basis set expansions have been shown to provide accurate representations of electronic energies and have enabled a variety of high-accuracy dynamics, kinetics, and spectroscopy applications. The number of terms in these expansions scales poorly with system size, a drawback that challenges their use for systems with more than ∼10 atoms. A solution is presented here using dictionary learning. Subsets of the full set of conventional basis functions are optimized using a newly developed multipass greedy regression method inspired by forward and backward selection methods from the statistics, signal processing, and machine learning literatures. The optimized representations have accuracies comparable to the full set but are 1 or more orders of magnitude smaller, and notably, the number of terms in the optimized multipass greedy expansions scales approximately linearly with the number of atoms.Funding Information
- Basic Energy Sciences (ANL FWP 59044, DE-AC02-06CH11357)
This publication has 45 references indexed in Scilit:
- Communication: A benchmark-quality, full-dimensional ab initio potential energy surface for Ar-HOCOThe Journal of Chemical Physics, 2014
- Development of a “First Principles” Water Potential with Flexible Monomers: Dimer Potential Energy Surface, VRT Spectrum, and Second Virial CoefficientJournal of Chemical Theory and Computation, 2013
- Global ab initio ground-state potential energy surface of N4The Journal of Chemical Physics, 2013
- Regression Shrinkage and Selection via The Lasso: A RetrospectiveJournal of the Royal Statistical Society Series B: Statistical Methodology, 2011
- Permutationally invariant potential energy surfaces in high dimensionalityInternational Reviews in Physical Chemistry, 2009
- CUR matrix decompositions for improved data analysisProceedings of the National Academy of Sciences of the United States of America, 2009
- An Improved Approximation Algorithm for the Column Subset Selection ProblemPublished by Society for Industrial & Applied Mathematics (SIAM) ,2009
- On the Optimality of the Backward Greedy Algorithm for the Subset Selection ProblemSIAM Journal on Matrix Analysis and Applications, 2000
- Updating the Inverse of a MatrixSIAM Review, 1989
- The development of a new Morse-oscillator based rotation-vibration Hamiltonian for H3+Journal of Molecular Spectroscopy, 1985