A General Software Defect-Proneness Prediction Framework
- 28 October 2010
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Software Engineering
- Vol. 37 (3), 356-370
- https://doi.org/10.1109/tse.2010.90
Abstract
BACKGROUND - Predicting defect-prone software components is an economically important activity and so has received a good deal of attention. However, making sense of the many, and sometimes seemingly inconsistent, results is difficult. OBJECTIVE - We propose and evaluate a general framework for software defect prediction that supports 1) unbiased and 2) comprehensive comparison between competing prediction systems. METHOD - The framework is comprised of 1) scheme evaluation and 2) defect prediction components. The scheme evaluation analyzes the prediction performance of competing learning schemes for given historical data sets. The defect predictor builds models according to the evaluated learning scheme and predicts software defects with new data according to the constructed model. In order to demonstrate the performance of the proposed framework, we use both simulation and publicly available software defect data sets. RESULTS - The results show that we should choose different learning schemes for different data sets (i.e., no scheme dominates), that small details in conducting how evaluations are conducted can completely reverse findings, and last, that our proposed framework is more effective and less prone to bias than previous approaches. CONCLUSIONS - Failure to properly or fully evaluate a learning scheme can be misleading; however, these problems may be overcome by our proposed framework.Keywords
This publication has 39 references indexed in Scilit:
- Defect content estimations from review dataPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Evaluating capture-recapture models with two inspectorsIEEE Transactions on Software Engineering, 2001
- A comprehensive evaluation of capture-recapture models for estimating software defect contentIEEE Transactions on Software Engineering, 2000
- An Experimental Evaluation of an Experience-Based Capture-Recapture Method in Software Code InspectionsEmpirical Software Engineering, 1998
- Wrappers for feature subset selectionArtificial Intelligence, 1997
- On the statistical analysis of the number of errors remaining in a software design document after inspectionIEEE Transactions on Software Engineering, 1997
- Assessing software designs using capture-recapture methodsIEEE Transactions on Software Engineering, 1993
- Estimating defects in commercial software during operational useIEEE Transactions on Reliability, 1993
- Prediction and control of ADA software defectsJournal of Systems and Software, 1990
- Regression modelling of software quality: empirical investigationInformation and Software Technology, 1990