ML-Based Analysis of Particle Distributions in High-Intensity Laser Experiments: Role of Binning Strategy

Open Access

25 December 2020

journal article
research article
Published by MDPI AG in Entropy

Vol. 23 (1), 21
https://doi.org/10.3390/e23010021

Abstract

When entering the phase of big data processing and statistical inferences in experimental physics, the efficient use of machine learning methods may require optimal data preprocessing methods and, in particular, optimal balance between details and noise. In experimental studies of strong-field quantum electrodynamics with intense lasers, this balance concerns data binning for the observed distributions of particles and photons. Here we analyze the aspect of binning with respect to different machine learning methods (Support Vector Machine (SVM), Gradient Boosting Trees (GBT), Fully-Connected Neural Network (FCNN), Convolutional Neural Network (CNN)) using numerical simulations that mimic expected properties of upcoming experiments. We see that binning can crucially affect the performance of SVM and GBT, and, to a less extent, FCNN and CNN. This can be interpreted as the latter methods being able to effectively learn the optimal binning, discarding unnecessary information. Nevertheless, given limited training sets, the results indicate that the efficiency can be increased by optimizing the binning scale along with other hyperparameters. We present specific measurements of accuracy that can be useful for planning of experiments in the specified research area.

Keywords

Funding Information

Ministry of Science and Higher Education of the Russian Federation (075-15-2020-808.)

This publication has 25 references indexed in Scilit:

Particle-in-Cell laser-plasma simulation on Xeon Phi coprocessors
Computer Physics Communications, 2016
Extended particle-in-cell schemes for physics in ultrastrong laser fields: Review and developments
Physical Review E, 2015
A domain decomposition method for pseudo-spectral electromagnetic simulations of plasmas
Journal of Computational Physics, 2013
Extremely high-intensity laser interactions with fundamental quantum systems
Reviews of Modern Physics, 2012
SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation
Nature Genetics, 2008
Markov chain Monte Carlo without likelihoods
Proceedings of the National Academy of Sciences of the United States of America, 2003
Greedy function approximation: A gradient boosting machine.
The Annals of Statistics, 2001
Approximation by superpositions of a sigmoidal function
Mathematics of Control, Signals, and Systems, 1992
Bayesianly Justifiable and Relevant Frequency Calculations for the Applied Statistician
The Annals of Statistics, 1984

Cited by 2 articles