Choosing the Sample Size of a Computer Experiment: A Practical Guide
Top Cited Papers
- 1 November 2009
- journal article
- Published by Informa UK Limited in Technometrics
- Vol. 51 (4), 366-376
- https://doi.org/10.1198/tech.2009.08040
Abstract
We provide reasons and evidence supporting the informal rule that the number of runs for an effective initial computer experiment should be about 10 times the input dimension. Our arguments quantify two key characteristics of computer codes that affect the sample size required for a desired level of accuracy when approximating the code via a Gaussian process (GP). The first characteristic is the total sensitivity of a code output variable to all input variables; the second corresponds to the way this total sensitivity is distributed across the input variables, specifically the possible presence of a few prominent input factors and many impotent ones (i.e., effect sparsity). Both measures relate directly to the correlation structure in the GP approximation of the code. In this way, the article moves toward a more formal treatment of sample size for a computer experiment. The evidence supporting these arguments stems primarily from a simulation study and via specific codes modeling climate and ligand activation of G-protein.Keywords
This publication has 16 references indexed in Scilit:
- Bayesian Treed Gaussian Process Models With an Application to Computer ModelingJournal of the American Statistical Association, 2008
- Computer Model Calibration Using High-Dimensional OutputJournal of the American Statistical Association, 2008
- Computer model validation with functional outputThe Annals of Statistics, 2007
- Variable Selection for Gaussian Process Models in Computer ExperimentsTechnometrics, 2006
- Combining Field Data and Computer Simulations for Calibration and PredictionSIAM Journal on Scientific Computing, 2004
- Efficient Global Optimization of Expensive Black-Box FunctionsJournal of Global Optimization, 1998
- Parameter space exploration of an ocean general circulation model using an isopycnal mixing parameterizationJournal of Marine Research, 1994
- Arctic sea ice variability: Model sensitivities and a multidecadal simulationJournal of Geophysical Research: Oceans, 1994
- Bayesian Prediction of Deterministic Functions, with Applications to the Design and Analysis of Computer ExperimentsJournal of the American Statistical Association, 1991
- A Comparison of Three Methods for Selecting Values of Input Variables in the Analysis of Output from a Computer CodeTechnometrics, 1979