A Systematic Literature Review on Fault Prediction Performance in Software Engineering
Top Cited Papers
- 6 October 2011
- journal article
- review article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Software Engineering
- Vol. 38 (6), 1276-1304
- https://doi.org/10.1109/tse.2011.103
Abstract
Background: The accurate prediction of where faults are likely to occur in code can help direct test effort, reduce costs, and improve the quality of software. Objective: We investigate how the context of models, the independent variables used, and the modeling techniques applied influence the performance of fault prediction models. Method: We used a systematic literature review to identify 208 fault prediction studies published from January 2000 to December 2010. We synthesize the quantitative and qualitative results of 36 studies which report sufficient contextual and methodological information according to the criteria we develop and apply. Results: The models that perform well tend to be based on simple modeling techniques such as Naive Bayes or Logistic Regression. Combinations of independent variables have been used by models that perform well. Feature selection has been applied to these combinations when models are performing particularly well. Conclusion: The methodology used to build models seems to be influential to predictive performance. Although there are a set of fault prediction studies in which confidence is possible, more studies are needed that use a reliable methodology and which report their context, methodology, and performance comprehensively.Keywords
This publication has 100 references indexed in Scilit:
- On the ability of complexity metrics to predict fault-prone classes in object-oriented systemsJournal of Systems and Software, 2010
- What’s up with software metrics? – A preliminary mapping studyJournal of Systems and Software, 2010
- A systematic and comprehensive investigation of methods to build and evaluate fault prediction modelsJournal of Systems and Software, 2010
- Increasing diversity: Natural language measures for software fault predictionJournal of Systems and Software, 2009
- The effectiveness of software metrics in identifying error-prone classes in post-release software evolution processJournal of Systems and Software, 2008
- Predicting defect-prone software modules using support vector machinesJournal of Systems and Software, 2008
- Mining software repositories for comprehensible software fault prediction modelsJournal of Systems and Software, 2008
- Applying machine learning to software fault-proneness predictionJournal of Systems and Software, 2008
- Practical assessment of the models for identification of defect-prone classes in object-oriented commercial systems using design metricsJournal of Systems and Software, 2003
- The prediction of faulty classes using object-oriented design metricsJournal of Systems and Software, 2001