Choosing the Sample Size of a Computer Experiment: A Practical Guide

Top Cited Papers

1 November 2009

journal article
Published by Informa UK Limited in Technometrics

Vol. 51 (4), 366-376
https://doi.org/10.1198/tech.2009.08040

Abstract

We provide reasons and evidence supporting the informal rule that the number of runs for an effective initial computer experiment should be about 10 times the input dimension. Our arguments quantify two key characteristics of computer codes that affect the sample size required for a desired level of accuracy when approximating the code via a Gaussian process (GP). The first characteristic is the total sensitivity of a code output variable to all input variables; the second corresponds to the way this total sensitivity is distributed across the input variables, specifically the possible presence of a few prominent input factors and many impotent ones (i.e., effect sparsity). Both measures relate directly to the correlation structure in the GP approximation of the code. In this way, the article moves toward a more formal treatment of sample size for a computer experiment. The evidence supporting these arguments stems primarily from a simulation study and via specific codes modeling climate and ligand activation of G-protein.

Keywords

This publication has 16 references indexed in Scilit:

Bayesian Treed Gaussian Process Models With an Application to Computer Modeling
Journal of the American Statistical Association, 2008
Computer Model Calibration Using High-Dimensional Output
Journal of the American Statistical Association, 2008
Computer model validation with functional output
The Annals of Statistics, 2007
Variable Selection for Gaussian Process Models in Computer Experiments
Technometrics, 2006
Combining Field Data and Computer Simulations for Calibration and Prediction
SIAM Journal on Scientific Computing, 2004
Efficient Global Optimization of Expensive Black-Box Functions
Journal of Global Optimization, 1998
Parameter space exploration of an ocean general circulation model using an isopycnal mixing parameterization
Journal of Marine Research, 1994
Arctic sea ice variability: Model sensitivities and a multidecadal simulation
Journal of Geophysical Research: Oceans, 1994
Bayesian Prediction of Deterministic Functions, with Applications to the Design and Analysis of Computer Experiments
Journal of the American Statistical Association, 1991
A Comparison of Three Methods for Selecting Values of Input Variables in the Analysis of Output from a Computer Code
Technometrics, 1979

Cited by 464 articles