Detecting Unusual Student Academic Grades Using the Isolation Forest Method
Keywords:
Anomalies, Detection, Isolation Forest, Student Grades, educationAbstract
This study aims to detect anomalies in the final grades of students in the Computer Networks course using the Isolation Forest algorithm. The data used comes from student grades for the current semester, including variables such as Discipline, Practice, Final Semester Exam (UAS), and Final Grades. The Isolation Forest algorithm was applied to identify students with grades that are inconsistent compared to their peers. The analysis results show that 10 students were detected to have final grades considered anomalous. These anomalies can be caused by various factors, including personal factors, differences in study effort, external influences, assessment methodology, and possible data errors. The distribution of grades was analyzed using histograms and boxplots, which indicated the presence of outliers in several variables. Correlations between variables were identified through a heatmap, showing that UAS has the highest correlation with Final Grades. Anomaly identification through scatter plot visualization helps in understanding the distribution and detection of anomalies in students' final grades. The conclusions of this study highlight the importance of further investigation to understand the causes of anomalies and provide appropriate interventions. This study also emphasizes the benefits of using data mining techniques in education to improve the quality of assessment and monitoring of students’ academic performance.
References
Anandakrishnan, A., Kumar, S., & Statnikov, A. (2018). Anomaly detection in finance: Editors’ introduction. KDD Workshop on Anomaly Detection in Finance, 1-7. Retrieved from Google Scholar.
Breunig, M. M., Kriegel, H. P., Ng, R. T., & Sander, J. (2000). LOF: Identifying density-based local outliers. In Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, 93-104. Retrieved from Google Scholar.
Campos, G. O., et al. (2016). On the evaluation of unsupervised outlier detection: Measures, datasets, and an empirical study. Data Mining and Knowledge Discovery, 30(4), 891–927. https://doi.org/10.1007/s10618-015-0444-8.
Feng, Y., et al. (2019). Anti-money laundering (AML) research: A system for identification and multi-classification. In WISA 2019, Lecture Notes in Computer Science, 11817, 169–175. https://doi.org/10.1007/978-3-030-30952-7_19.
Hariri, S., Kind, M. C., & Brunner, R. J. (2019). Extended isolation forest. IEEE Transactions on Knowledge and Data Engineering, 33(4), 1479–1489. https://doi.org/10.1109/TKDE.2019.2947676.
Ji, S., Pan, S., Cambria, E., Marttinen, P., & Philip, S. Y. (2021). A survey on knowledge graphs: Representation, acquisition, and applications. IEEE Transactions on Neural Networks and Learning Systems, 33(2), 494–514. https://doi.org/10.1109/TNNLS.2020.3015724.
Khraisat, A., Gondal, I., Vamplew, P., & Kamruzzaman, J. (2019). Survey of intrusion detection systems: Techniques, datasets, and challenges. Cybersecurity, 2(1), 1–22. https://doi.org/10.1186/s42400-019-0038-7.
Liu, F. T., Ting, K. M., & Zhou, Z. H. (2008). Isolation forest. In 2008 Eighth IEEE International Conference on Data Mining, 413–422. IEEE. Retrieved from Google Scholar.
Liu, F. T., Ting, K. M., & Zhou, Z.-H. (2010). On detecting clustered anomalies using SCiForest. In ECML PKDD 2010, Lecture Notes in Artificial Intelligence, 6322, 274–290. https://doi.org/10.1007/978-3-642-15883-4_18.
Marx, M., Krötzsch, M., & Thost, V. (2017). Logic on mars: Ontologies for generalized property graphs. In IJCAI 2017, 1188–1194. Retrieved from Google Scholar.
Ruff, L., Kauffmann, J. R., et al. (2021). A unifying review of deep and shallow anomaly detection. Proceedings of the IEEE, 109(5), 756–795. Retrieved from Google Scholar.
Xu, J., & Zhou, J. (2022). A Fine-Grained Anomaly Detection Method Fusing Isolation Forest and Knowledge Graph Reasoning. In Web Information Systems and Applications, Lecture Notes in Computer Science, 13579, 126-141. Springer, Cham. https://doi.org/10.1007/978-3-031-20309-1_12.
Xu, H., Pang, G., Wang, Y., & Wang, Y. (2022). Deep Isolation Forest for Anomaly Detection. Retrieved from arXiv: https://doi.org/10.48550/arXiv.2206.06602.
Zhang, Y., Zha, D., You, S., Wang, S., Hu, X., & Guo, M. (2023). OptIForest: Optimal Isolation Forest for Anomaly Detection. IJCAI, 2380-2387. Retrieved from https://www.ijcai.org/proceedings/2023/326.
Zimek, A., Schubert, E., & Kriegel, H. P. (2013). A survey on unsupervised outlier detection in high-dimensional numerical data. Statistical Analysis and Data Mining, 5(5), 363–387. https://doi.org/10.1002/sam.11280.
Downloads
Published
Issue
Section
License
Copyright (c) 2024 Hendra Nusa Putra, Nanda Tommy Wirawan (Author)

This work is licensed under a Creative Commons Attribution 4.0 International License.