Published August 2, 2023 | Version 1.0
Journal article Open

Predicting the final grade using a machine learning regression model: insights from fifty percent of total course grades in CS1 courses

  • 1. Universidad de San Buenaventura
  • 2. Universidad del Valle

Description

This paper introduces a model for accurately predicting students' final grades in the CS1 course by utilizing their grades from the first half of the course. The methodology includes three phases: training, testing, and validation, employing four regression algorithms: AdaBoost, Random Forest, Support Vector Regression, and XGBoost. Notably, the SVR algorithm outperformed the others, achieving an impressive R-squared (R2) value ranging from 72% to 91%. The discussion section focuses on four crucial aspects: the selection of data features and the percentage of course grades used for training, the comparison between predicted and actual values to demonstrate reliability, and the model's performance compared to existing literature models, highlighting its effectiveness.

Notes

.

Files

data_stutent_predict_EN.csv

Files (343.8 kB)

Name Size Download all
md5:a42a538d37896c2713f23166171c644b
26.5 kB Preview Download
md5:f17675105306f266f0cb05d8e013ca93
317.4 kB Preview Download