Big Data Application in Education: Dropout Prediction in Edx MOOCs
- 18 August 2016
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 440-443
- https://doi.org/10.1109/bigmm.2016.70
Abstract
Educational Data Mining and Learning Analytics are two growing fields of study, trying to make sense of education data and to improve teaching and learning experience. We study dropout prediction in Massively Open Online Courses (MOOCS), where the goal is given student's learning behavior log data in one month, to predict whether students would drop out in next ten days. We collect 39 courses data from XuetangX platform, which is based on the open source Edx platform. We describe our complete approach to cope with drop out prediction task, including data extraction from Edx platform, data preprocessing, feature engineering and performance test on several supervised classification model such as SVM, Logistics Regression, Random Forest and Gradient Boosting Decision Tree. We achieve 88% accuracy in dropout prediction task with GBDT model.Keywords
This publication has 7 references indexed in Scilit:
- Understanding in-video dropouts and interaction peaks inonline lecture videosPublished by Association for Computing Machinery (ACM) ,2014
- Towards Identifying the Resolvability of Threads in MOOCsPublished by Association for Computational Linguistics (ACL) ,2014
- Identifying Student Leaders from MOOC Discussion Forums through Language InfluencePublished by Association for Computational Linguistics (ACL) ,2014
- Deconstructing disengagementPublished by Association for Computing Machinery (ACM) ,2013
- Predicting student failure at school using genetic programming and different data mining approaches with high dimensional and imbalanced dataApplied Intelligence, 2012
- Sensitivity Analysis of k-Fold Cross Validation in Prediction Error EstimationIEEE Transactions on Pattern Analysis and Machine Intelligence, 2009
- Stochastic gradient boosted distributed decision treesPublished by Association for Computing Machinery (ACM) ,2009