Published February 14, 2026 | Version v1
Publication Open

Predicción del abandono en la plataforma de educación digital ProFuturo utilizando técnicas de Machine Learning y análisis de datos

Description

Predicting dropout rates accurately on digital education platforms such as ProFuturo will enable preventive measures to be taken, significantly reducing dropout rates and optimizing the use of resources. Several authors have addressed this problem using machine learning and artificial intelligence techniques with encouraging results. However, most of the approaches are based only on demographic variables or the course completion certificate discarding relevant information available in Moodle platform.
Furthermore, they obtain moderate success rates and are not easily interpretable in terms of the indicators considered.
In this paper we propose a novel methodology for accurate dropout prediction that takes into account all informative variables from
Moodle. The approach is based on simple machine learning models and maintains high interpretability in terms of input variables.
The experimental results show that a methodology based on Random Forest can achieve high detection probability (91%) without compromising specificity with a value of 88%. Moreover, the application of SHAP algorithm has provided high interpretability to understand the role of different variables.

Files

CameraReady_24.pdf

Files (564.1 kB)

Name Size Download all
md5:11d3fc98431fb89e3fc3e5272efec988
564.1 kB Preview Download