Classification for Student Performance
Authors
Wang Ziyi
Share
Annotation
Student performance can determine the future path of success of a student . It is essential for us to recognize the factors that influence student performance and predict the student grade level using the existing information. We split the data into the training set and testing set, building up a binary logistic regression to predict the results in the testing set. The results show that our model has good predictability and reaches the AUC value of 0.68. Also, by looking at the coefficient, we figured out several important factors that affect student performance grades.
Keywords
Authors
Wang Ziyi
Share
References:
[1] Wiggins, G. P. (1993). Assessing student performance: Exploring the purpose and limits of testing. Jossey-Bass.
[2] Hilbe, J. M. (2009). Logistic regression models. Chapman and hall/CRC.
[3] Wang, H., Xu, Q., & Zhou, L. (2015). Large unbalanced credit scoring using lasso-logistic regression ensemble. PloS one, 10(2), e0117844.
[4] Narkhede, S. (2018). Understanding auc-roc curve. Towards Data Science, 26, 220-227.
[5] Malhi, A., & Gao, R. X. (2004). PCA-based feature selection scheme for machine defect classification. IEEE transactions on instrumentation and measurement, 53(6), 1517-1525.
[6] Polak, J., & Cook, D. (2021). A Study on Student Performance, Engagement, and Experience With Kaggle InClass data Challenges. Journal of Statistics and Data Science Education, 29(1), 63-70.
[7] Kirasich, K., Smith, T., & Sadler, B. (2018). Random forest vs logistic regression: binary classification for heterogeneous datasets. SMU Data Science Review, 1(3), 9.