Comparing Random Forest and Logistic Regression for Predicting Student Completion in Online University Courses Using Behavioral Data

Muhamad Irfan; Abdul Sattar; Ahmad Sher; Muhamad Ijaz

doi:10.63913/ail.v1i1.2

Authors

Muhamad Irfan Bahauddin Zakariya University
Abdul Sattar Bahauddin Zakariya University
Ahmad Sher Bahauddin Zakariya University
Muhamad Ijaz Bahauddin Zakariya University

DOI:

https://doi.org/10.63913/ail.v1i1.2

Keywords:

predictive analytics, random forest, logistic regression, online education, student retention

Abstract

This paper compares the performance of two machine learning algorithms, Random Forest and Logistic Regression, in predicting student course completion in online university courses using behavioral data. Behavioral data, including interaction logs and submission records, has proven to be crucial in identifying students at risk of non-completion. The study evaluates the models using standard classification metrics such as accuracy, precision, recall, and F1-score, based on real-world data from online courses. Both models demonstrate exceptionally high predictive accuracy, with Logistic Regression achieving perfect classification and Random Forest closely following. While Logistic Regression is favored for its simplicity and interpretability, Random Forest excels in handling complex, non-linear relationships within the data. The analysis of feature importance reveals that student engagement, particularly through viewing and passing course materials, is a strong predictor of course completion. These findings offer significant practical implications for online education, supporting early interventions to enhance student retention. However, limitations such as the absence of certain behavioral data and the linear assumption in Logistic Regression suggest areas for future research. Expanding the dataset to include discussion forums, peer interactions, or additional machine learning models may provide deeper insights into improving student success in online courses.

ISSN 3089-3690 (Online)
Organizer / Collaboration	:	Fakultas Sains dan Teknologi UIN Syarif Hidayatullah Jakarta
Published by	:	Bright Publisher
Website	:	ail.mbicore.com
Mailing Address	:	Graha Permata Estate, Jl. HM Bahrun Blok H9, Sokayasa, Berkoh, Kec. Purwokerto Tim., Kabupaten Banyumas, Jawa Tengah 53146
Email	:	arif@amikompurwokerto.ac.id (principal contact)
		editor@ail.mbicore.com (managing editor)

Comparing Random Forest and Logistic Regression for Predicting Student Completion in Online University Courses Using Behavioral Data

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Published By

Make a Submission

ISSN

Quick Menu

Recommended Tools

Visitor Stats