Hybrid pooled–unpooled design for cost‐efficient measurement of biomarkers

9 February 2010

journal article
research article
Published by Wiley in Statistics in Medicine

Vol. 29 (5), 597-613
https://doi.org/10.1002/sim.3823

Abstract

Evaluating biomarkers in epidemiological studies can be expensive and time consuming. Many investigators use techniques such as random sampling or pooling biospecimens in order to cut costs and save time on experiments. Commonly, analyses based on pooled data are strongly restricted by distributional assumptions that are challenging to validate because of the pooled biospecimens. Random sampling provides data that can be easily analyzed. However, random sampling methods are not optimal cost‐efficient designs for estimating means. We propose and examine a cost‐efficient hybrid design that involves taking a sample of both pooled and unpooled data in an optimal proportion in order to efficiently estimate the unknown parameters of the biomarker distribution. In addition, we find that this design can be used to estimate and account for different types of measurement and pooling error, without the need to collect validation data or repeated measurements. We show an example where application of the hybrid design leads to minimization of a given loss function based on variances of the estimators of the unknown parameters. Monte Carlo simulation and biomarker data from a study on coronary heart disease are used to demonstrate the proposed methodology. Published in 2010 by John Wiley & Sons, Ltd.

Keywords

Funding Information

National Institutes of Health, Eunice Kennedy Shriver National Institute of Child Health and Human Development

This publication has 29 references indexed in Scilit:

To pool or not to pool, from whether to when: applications of pooling to biospecimens subject to a limit of detection
Paediatric and Perinatal Epidemiology, 2008
Estimation of ROC curves based on stably distributed biomarkers subject to measurement error and pooling mixtures
Statistics in Medicine, 2007
Pooling biospecimens and limits of detection: effects on ROC curve analysis
Biostatistics, 2006
Effect of pooling samples on the efficiency of comparative studies using microarrays
Bioinformatics, 2005
A New Method for Dealing with Measurement Error in Explanatory Variables of Regression Models
Biometrics, 2004
ROC curve analysis for biomarkers based on pooled assessments
Statistics in Medicine, 2003
The effect of random measurement error on receiver operating characteristic (ROC) curves
Statistics in Medicine, 2000
Statistical methods to assess and minimize the role of intra-individual variability in obscuring the relationship between dietary lipids and serum cholesterol
Journal of Chronic Diseases, 1978
Two-stage sampling with exchangeable prior distributions
Biometrika, 1977

Cited by 30 articles