An Exploration of Crime Prediction Using Data Mining on Open Data
- 19 September 2017
- journal article
- research article
- Published by World Scientific Pub Co Pte Ltd in International Journal of Information Technology & Decision Making
- Vol. 16 (05), 1155-1181
- https://doi.org/10.1142/s0219622017500250
Abstract
The increase in crime data recording coupled with data analytics resulted in the growth of research approaches aimed at extracting knowledge from crime records to better understand criminal behavior and ultimately prevent future crimes. While many of these approaches make use of clustering and association rule mining techniques, there are fewer approaches focusing on predictive models of crime. In this paper, we explore models for predicting the frequency of several types of crimes by LSOA code (Lower Layer Super Output Areas — an administrative system of areas used by the UK police) and the frequency of anti-social behavior crimes. Three algorithms are used from different categories of approaches: instance-based learning, regression and decision trees. The data are from the UK police and contain over 600,000 records before preprocessing. The results, looking at predictive performance as well as processing time, indicate that decision trees (M5P algorithm) can be used to reliably predict crime frequency in general as well as anti-social behavior frequency.Keywords
Funding Information
- University of Portsmouth (GB), Research Development Initiative
This publication has 44 references indexed in Scilit:
- Science, politics, and crime prevention: Toward a new crime policyJournal of Criminal Justice, 2012
- FAMCDM: A fusion approach of MCDM methods to rank multiclass classification algorithmsOmega, 2011
- An Enhanced Algorithm to Predict a Future Crime using Data MiningInternational Journal of Computer Applications, 2011
- Self-Exciting Point Process Modeling of CrimeJournal of the American Statistical Association, 2011
- A brief history of the analysis of crime concentrationEuropean Journal of Applied Mathematics, 2010
- Evidence-Based Public Policy Options to Reduce Crime and Criminal Justice Costs: Implications in Washington StateVictims & Offenders, 2009
- Early Risk Factors for Violence in Colombian AdolescentsAmerican Journal of Psychiatry, 2003
- A comparative analysis of methods for pruning decision treesIEEE Transactions on Pattern Analysis and Machine Intelligence, 1997
- Local Algorithms for Pattern Recognition and Dependencies EstimationNeural Computation, 1993
- Locally Weighted Regression: An Approach to Regression Analysis by Local FittingJournal of the American Statistical Association, 1988