An architecture for the deployment of statistical models for the big data era
- 1 December 2016
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 1377-1384
- https://doi.org/10.1109/bigdata.2016.7840745
Abstract
Statistical models are commonly fit to bulk datasets, and they are applied in quasi real-time to previously unseen data. Challenges lie not only in fitting these models to data, but also in keeping track of their development and deployment process. It is common practice to re-engineer data pre-processing functions that were created during model development in order to build a version for deployment that works on streams of data. This approach is error-prone and inefficient. In this paper, we present our Model Deployment and Execution Framework (MDEF), to tackle these challenges in response to the volume, velocity, and variety of big data.Keywords
This publication has 4 references indexed in Scilit:
- An overview of free software tools for general data miningPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2014
- Auto-WEKAPublished by Association for Computing Machinery (ACM) ,2013
- Representation Learning: A Review and New PerspectivesIEEE Transactions on Pattern Analysis and Machine Intelligence, 2013
- A survey of Knowledge Discovery and Data Mining process modelsThe Knowledge Engineering Review, 2006