Dropping out of school is a problem in most countries worldwide and leads to many adverse effects on the family and society. Therefore, early predicting the risk of dropping out of high school can help educators have interventions and efficient solutions in reducing the high school dropout rate. In this work, we present a machine learning model to predict the risk of school dropout. This model was built from the dataset, including 10,219 student records with 807 dropouts (7.89%) in high schools in Ca Mau province. The results show that the models Naïve Bayes, Decision Tree with Bagging, Random Forest with Bagging give the best results with Area Under the Curve at 83.01%, 80.95%, 83.16%, and accuracy, precision, recall, f1-score are all over 80%. In addition, we also extracted important features playing a decisive contribution in predicting school dropout, including Grade Point Average, school code, Conduct, Age, and Class.
Tạp chí khoa học Trường Đại học Cần Thơ
Lầu 4, Nhà Điều Hành, Khu II, đường 3/2, P. Xuân Khánh, Q. Ninh Kiều, TP. Cần Thơ
Điện thoại: (0292) 3 872 157; Email: tapchidhct@ctu.edu.vn
Chương trình chạy tốt nhất trên trình duyệt IE 9+ & FF 16+, độ phân giải màn hình 1024x768 trở lên