Published July 3, 2023 | Version 1.0
Journal article Open

Early prediction of student performance in CS1 programming courses

  • 1. Universidad del Valle
  • 2. Universidad Nacional de Colombia

Description

There is high failure and low academic performance in programming courses. To mitigate these problems it is necessary to predict student performance at an early stage. This will allow teachers to support students with interventions that allow them to achieve the defined learning objectives. Student performance prediction is gaining more and more attention, researchers are working on machine learning features and algorithms that can be used to achieve better results in predictions. This paper proposes a model that predicts student performance in weeks 3, 5 and 7 of a 16-week CS1 programming course.  Starting from the grade, delivery time and the number of attempts generated by the student in programming labs and an exam. Eight classification algorithms were used to train and evaluate the model. The metrics used were: accuracy, recall and F1 score. The algorithm that presented the best results in week 3 was Gradient Boosting Classifier (GBC) with an F1 score of 86%, followed by Random Forest Classifier (RFC) with 83%.

Files

classification_data.csv

Files (348.3 kB)

Name Size Download all
md5:5ec48f4b9a5fae45af41d6e33b1cc680
10.4 kB Preview Download
md5:9c216f29dbe2295582ba6ed0d0124822
337.9 kB Preview Download