Published December 13, 2021 | Version 1.0
Dataset Restricted

Predict students' dropout and academic success

  • 1. VALORIZA - Research Center for Endogenous Resource Valorization, Portalegre, Portugal
  • 2. Polythecnic Institute of Portalegre


A dataset created from a higher education institution (acquired from several disjoint databases) related to students enrolled in different undergraduate degrees, such as agronomy, design, education, nursing, journalism, management, social service, and technologies.

The dataset includes information known at the time of student enrollment (academic path, demographics, and social-economic factors) and the students' academic performance at the end of the first and second semesters.

The data is used to build classification models to predict students' dropout and academic success. The problem is formulated as a three category classification task (dropout, enrolled, and graduate) at the end of the normal duration of the course.

We acknowledge support of this work by the program "SATDAP - Capacitação da Administração Pública under grant POCI-05-5762-FSE-000191, Portugal"




The record is publicly accessible, but files are restricted to users with access.