School exercises for predicting university students' academic performance using combined machine learning techniques
Abstract
The purpose of this study is to propose a methodology for developing predictive models of student academic performance using academic exercises completed in class and combined machine learning techniques known as majority voting and stacking. Data were collected from 250 university students in Mexico regarding their assessments of school exercises to develop the models, and performance metrics were obtained through cross-validation. Subsequently, the constructed models were applied to 108 students in a later semester of the same course, and their metrics were calculated. The results obtained through cross-validation show that the stacking technique with the k-nearest neighbors’ method in the second phase has the highest accuracy (69.2%). When predicting the academic performance of 108 students using the developed models, the highest accuracy is obtained with the stacking technique that includes the k-nearest neighbors’ method in the second phase, with a value of 74.1%. The information obtained was collected 17% of the way through the course, facilitating the early detection of students with academic difficulties so that teachers can intervene promptly and improve their performance. It is common for teachers to collect assessments of academic exercises without needing to use more complex data collection tools, which favors the use of this type of methodology for building predictive models.
Downloads
References
Contreras, L. E., Fuentes, H. J. y Rivas, E. (2021). Análisis del rendimiento académico mediante técnicas de aprendizaje automático con métodos de ensamble. Revista Boletín Redipe, 10(13), 171-190. https://doi.org/10.36260/rbr.v10i13.1737
Cruz, L. D. (2024). Ensamble híbrido de métodos para la clasificación morfotaxonómica explicable de macroinvertebrados de agua dulce. Encuentro Internacional De Educación En Ingeniería. https://doi.org/10.26507/paper.3565
Daza, J., Castro, J. E. y Ávila, H. (2024). Optimizando el aprendizaje de los lenguajes de programación. Un enfoque basado en la analítica de datos para los estudiantes de Ingeniería de Sistemas en la Fundación Universitaria Los Libertadores. Perspectivas, 9(24), 234-256. https://doi.org/10.26620/uniminuto.perspectivas.9.24.2024.234-256
Del Carpio, R., (2024). Predicción del rendimiento académico utilizando modelos de aprendizaje automático: Una revisión sistemática de la literatura. 593 Digital Publisher CEIT, 9(6), 1038-1054, https://dialnet.unirioja.es/servlet/articulo?codigo=9842504
García, F. A., Escobar, J. L., Gallegos, C. M. y Hernández, E. S. (2023). El enfoque de aprendizaje conjunto en la detección de fallas en cajas de engranajes. Revista Universidad y Sociedad, 15(3), 325-333. http://scielo.sld.cu/pdf/rus/v15n3/2218-3620-rus-15-03-325.pdf
Gil, V. D. y Quintero, C. (2023). Análisis de variables asociadas al rendimiento académico en cursos universitarios virtuales. Formación universitaria, 16(4), 33-42. https://dx.doi.org/10.4067/s0718-50062023000400033
Nizar, N., Zainudin, A. D., Albada, A. y Shan, C. M. (2024). Forecasting Short-Term FTSE Bursa Malaysia Using WEKA. Information Management and Business Review, 16(2), 104-114. https://doi.org/10.22610/imbr.v16i2(I)S.3773
Parraga, D. (2024). Modelos Predictivos de Rendimiento Académico Universitario Mediante Aprendizaje Automático. Vitalia Revista Científica y Académica, 5(2), 974–991. https://doi.org/10.61368/r.s.d.h.v5i2.204
Salas, R. A., Salas, E. P., Salas, R. D. y Vargas Y. D. (2019). Análisis de la Aplicación Web Para la Estimación Puntual por medio de la Ciencia de Datos. Revista Dilemas Contemporáneos: Educación, Política y Valores, 6(2), Artículo 33. https://dilemascontemporaneoseducacionpoliticayvalores.com/index.php/dilemas/article/view/705
Sarmiento, J. D., Tibaduiza, D. A., Anaya, M., Gómez, J. D., Pérez, M. P., Sanchez, C. E. y Eslava, J. S. (2024). Uso de Machine Learning en la identificación y clasificación de microplásticos enfocado en el poliestireno expandido. En Ingeniería: una transición hacia el futuro. Asociación Colombiana de
Facultades de Ingeniería – ACOFI. https://doi.org/10.26507/paper.4046
Sierra, M. A., Quintana, K. P., Hernández, J. A., Enríquez, L. B., Pérez, M. D. y Arzate, C. (2024). Validación de un modelo de inteligencia artificial para la predicción de la mortalidad del paciente con sepsis. Medicina Interna de México, 40(3), 171-178. https://doi.org/10.24245/mim.v40i3.9023
Timarán, R., Chaves, A. y Ordoñez-Erazo, H. (2023). Decision Tree Algorithm Moderately Coupled to PostgreSQL DBMS. Revista Facultad De Ingeniería, 32(66), e16777. https://revistas.uptc.edu.co/index.php/ingenieria/article/view/16777
Vargas, A. A. y Prieto, J. C. (2024). Predicción del rendimiento académico estudiantil usando algoritmos de aprendizaje supervisado en una universidad de la selva peruana. Revista Amazonía Digital, 3(1), e292. https://revistas.unamad.edu.pe/index.php/rad/article/view/292
Villarrasa, I., García, X., Liébana, E. y Monfort, G. (2024). Predicción del rendimiento académico en educación secundaria mediante el análisis de árboles de decisión. Educación XX1, 27(1), 253-279. https://doi.org/10.5944/educxx1.33351
Yajure, C. A. (2023). Selección del modelo óptimo de predicción de la relación de desempeño de una planta solar fotovoltaica. Un enfoque multicriterio basado en algoritmos de aprendizaje automático. Ciencia, Ingeniería y Aplicaciones, 6(2), 7-29. https://doi.org/10.22206/cyap.2023.v6i2.2935
Zambrano, J. M., Arroyo, M. V., Jalil, N. J. y Castro, I. E. (2024). Perfiles de educación universitaria: Un análisis comparativo. Revista De Ciencias Sociales, 30(2), 291-304. https://dialnet.unirioja.es/servlet/articulo?codigo=9603967

This work is licensed under a Creative Commons Attribution 4.0 International License.
In order to promote the development and dissemination of research in education in Latin America, the Ibero-American Journal for Educational Research and Development (RIDE) adhered to the Budapest Open Access Initiative, which is why it is identified as a Open access publication. This means that any user can read the complete text of the articles, print them, download them, copy them, link them, distribute them and use the contents for other purposes. Creative Cummons licenses allow users to specify the rights to use an open access journal available on the Internet in such a way that users know the rules of publication. Authors who publish in this journal accept the following conditions: Authors they keep the author's rights and give the magazine the right of the first publication, with the work registered with the attribution license of Creative Commons, which allows third parties to use the published material whenever they mention the authorship of the work and the first publication in this The authors can make other independent and additional contractual agreements for the non-exclusive distribution of the version of the article published in this journal (eg, include it in an institutional repository or publish it in a book) as long as they clearly indicate that The work was published for the first time in this magazine. Authors are allowed and recommended to publish their work. low on the Internet (for example on institutional or personal pages) before and during the review and publication process, as it can lead to productive exchanges and to a greater and faster dissemination of the published work
