Calibrated Probabilistic Stacking with Linear Meta-Learning for Admission Outcome Prediction

Khrystyna Zub; Oksana Mulesa

doi:10.31449/inf.v50i13.14817

Abstract

This paper presents a two-stage probabilistic stacking ensemble for admission outcome prediction based on calibrated heterogeneous classifiers and linear decision fusion in probability space. The proposed approach combines HistGradientBoosting, ExtraTrees, and RandomForest models to generate posterior class probabilities, while probability calibration is applied to improve the consistency and comparability of probabilistic estimates. To prevent information leakage, the meta-level training space is formed exclusively from calibrated out-of-fold probability representations. At the second stage, linear meta-models are used to aggregate probabilistic outputs and produce the final decision. The method was evaluated on a real-world dataset collected from the admission campaign of Lviv Polytechnic National University. Experimental studies using holdout validation and stratified cross-validation demonstrate that the proposed ensemble achieves high and stable predictive performance while preserving the quality of probabilistic estimates. In particular, the method reached F1-scores up to 0.990 and MCC values up to 0.979 on the holdout test set, together with low LogLoss values. Comparative analysis with baseline classifiers and standard stacking approaches confirms that calibrated probabilistic fusion improves both classification quality and the reliability of posterior probability estimates in practical decision-support tasks.

Author Biography

Oksana Mulesa, University of Prešov

Department of Physics, Mathematics, and Technologies, Faculty of Humanities and Natural Sciences, University of Presov, Presov, Slovakia

References

[1] B. G. Banik and A. B. Syed, ‘Predicting University Admission Chances Using Machine Learning’, Next-Generation Computing Systems and Technologies, vol. 2, no. 1, pp. 1–9, Mar. 2026, doi: 10.62762/NGCST.2026.766610.

[2] K. K. Reddy, ‘AI-Based University Admission Prediction System Using Random Forest Regression’, IJRASET, vol. 13, no. 12, pp. 3269–3272, Dec. 2025, doi: 10.22214/ijraset.2025.76688.

[3] K. Zub and M. Gregus, ‘Machine Learning-based Classification of Higher Education Admission Success for Informed Decision-Making’, Procedia Computer Science, vol. 272, pp. 534–539, 2025, doi: 10.1016/j.procs.2025.10.243.

[4] P. Golden, K. Mojesh, L. M. Devarapalli, P. N. S. Reddy, S. Rajesh, and A. Chawla, ‘A Comparative Study on University Admission Predictions Using Machine Learning Techniques’, IJSRCSEIT, pp. 537–548, Apr. 2021, doi: 10.32628/CSEIT2172107.

[5] J.-P. Wu, M.-S. Lin, and C.-L. Tsai, ‘A Predictive Model That Aligns Admission Offers with Student Enrollment Probability’, Education Sciences, vol. 13, no. 5, p. 440, Apr. 2023, doi: 10.3390/educsci13050440.

[6] I. Izonin, R. Muzyka, R. Tkachenko, M. Gregus, R. Korzh, and K. Yemets, ‘An enhanced cascade ensemble method for big data analysis’, IJ-AI, vol. 14, no. 2, p. 963, Apr. 2025, doi: 10.11591/ijai.v14.i2.pp963-974.

[7] C. Rokde, J. Chakole, and A. Ukey, ‘Financial Forecasting with Deep Learning Models Based Ensemble Technique in Stock Market Analysis’, IJIEEB, vol. 17, no. 4, pp. 1–13, Aug. 2025, doi: 10.5815/ijieeb.2025.04.01.

[8] S. A. Hamim, R. S. Aftab, M. Ahmed, F. Faiza, and M. F. Mridha, ‘Advanced Heart Attack Prediction Using a Stacked Ensemble Machine Learning Model and Diverse Data Integration’, IJISA, vol. 17, no. 5, pp. 49–67, Oct. 2025, doi: 10.5815/ijisa.2025.05.04.

[9] K. V. Zub, ‘The evaluation of the hei`s entrants admission chances based on the stacking model of the support vectors machine’, Sci. Pap. UAP, vol. 2, no. 63, pp. 168–176, 2021, doi: 10.32403/1998-6912-2021-2-63-168-176.

[10] D. R. Patil, T. M. Pattewar, T. S. Shinde, K. S. Kumavat, and S. N. Deshpande, ‘Stacking and Voting-Based Boosting Ensembles for Robust Malicious URL Classification’, IJCAI, vol. 49, no. 35, Dec. 2025, doi: 10.31449/inf.v49i35.7762.

[11] B. Zadrozny and C. Elkan, ‘Obtaining calibrated probability estimates from decision trees and naive Bayesian classifiers’, in Proceedings of the Eighteenth International Conference on Machine Learning, in ICML ’01. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc., Jun. 2001, pp. 609–616.

[12] M. A. Voronenko et al., ‘Using Bayesian methods in the task of modeling the patients’ pharmacoresistance development’, IAPGOS, vol. 12, no. 2, pp. 77–82, Jun. 2022, doi: 10.35784/iapgos.2968.

[13] K. Zub, P. Zhezhnych, and C. Strauss, ‘Two-Stage PNN–SVM Ensemble for Higher Education Admission Prediction’, BDCC, vol. 7, no. 2, p. 83, Apr. 2023, doi: 10.3390/bdcc7020083.

[14] I. Izonin, R. Tkachenko, M. Gregus, Z. Duriagina, and N. Shakhovska, ‘PNN-SVM Approach of Ti-Based Powder’s Properties Evaluation for Biomedical Implants Production’, Computers, Materials & Continua, vol. 71, no. 3, pp. 5933–5947, 2022, doi: 10.32604/cmc.2022.022582.

[15] I. Izonin, A. Trostianchyn, Z. Duriagina, R. Tkachenko, T. Tepla, and N. Lotoshynska, ‘The Combined Use of the Wiener Polynomial and SVM for Material Classification Task in Medical Implants Production’, International Journal of Intelligent Systems and Applications, vol. 10, no. 9, pp. 40–47, Sep. 2018, doi: 10.5815/ijisa.2018.09.05.

[16] A. Kowshir Bitto, Md. H. Imam Bijoy, A. Das, J. Ferdousi, A. Begum, and I. Mahmud, ‘A Novel CatML Stacking Classifier Based Intelligent System for Predicting Postgraduate Admission Chances: A Study on Bangladesh’, IJMECS, vol. 17, no. 4, pp. 82–100, Aug. 2025, doi: 10.5815/ijmecs.2025.04.06.

[17] N. S. K. M. K. Tirumanadham and T. S., ‘Enhancing Student Performance Prediction in ELearning Environments: Advanced EnsembleTechniques and Robust Feature Selection’, IJMECS, vol. 17, no. 2, pp. 67–86, Apr. 2025, doi: 10.5815/ijmecs.2025.02.03.

[18] A. Trostianchyn, Z. Duriagina, I. Izonin, R. Tkachenko, V. Kulyk, and O. Pavliuk, ‘Sm-Co alloys coercivity prediction using stacking heterogeneous ensemble model’, Acta Metall Slovaca, vol. 27, no. 4, pp. 195–202, Dec. 2021, doi: 10.36547/ams.27.4.1173.

[19] I. N. Switrayana and N. Sulistianingsih, ‘Leveraging Convolutional Neural Network to Enhance the Performance of Ensemble Learning in Scientific Article Classification’, IJMECS, vol. 17, no. 6, pp. 146–159, Dec. 2025, doi: 10.5815/ijmecs.2025.06.10.

[20] T. Mortier, V. Bengs, E. Hüllermeier, S. Luca, and W. Waegeman, ‘On the Calibration of Probabilistic Classifier Sets’, in Proceedings of The 26th International Conference on Artificial Intelligence and Statistics, PMLR, Apr. 2023, pp. 8857–8870. Accessed: Apr. 07, 2026. [Online]. Available: https://proceedings.mlr.press/v206/mortier23a.html

[21] D. H. Wolpert, ‘Stacked generalization’, Neural Networks, vol. 5, no. 2, pp. 241–259, Jan. 1992, doi: 10.1016/S0893-6080(05)80023-1.

[22] M. J. Van Der Laan, E. C. Polley, and A. E. Hubbard, ‘Super Learner’, Statistical Applications in Genetics and Molecular Biology, vol. 6, no. 1, Sep. 2007, doi: 10.2202/1544-6115.1309.

[23] J. Platt, ‘Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods’, 1999. Accessed: Apr. 07, 2026. [Online]. Available: https://www.semanticscholar.org/paper/Probabilistic-Outputs-for-Support-vector-Machines-Platt/42e5ed832d4310ce4378c44d05570439df28a393

[24] H.-T. Lin, C.-J. Lin, and R. C. Weng, ‘A note on Platt’s probabilistic outputs for support vector machines’, Mach Learn, vol. 68, no. 3, pp. 267–276, Aug. 2007, doi: 10.1007/s10994-007-5018-6.

[25] A. Niculescu-Mizil and R. Caruana, ‘Predicting good probabilities with supervised learning’, in Proceedings of the 22nd international conference on Machine learning - ICML ’05, Bonn, Germany: ACM Press, 2005, pp. 625–632. doi: 10.1145/1102351.1102430.

[26] B. Zadrozny and C. Elkan, ‘Transforming classifier scores into accurate multiclass probability estimates’, in Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, Edmonton Alberta Canada: ACM, Jul. 2002, pp. 694–699. doi: 10.1145/775047.775151.

[27] V. Jensen, F. M. Bianchi, and S. N. Anfinsen, ‘Ensemble Conformalized Quantile Regression for Probabilistic Time Series Forecasting’, IEEE Trans. Neural Netw. Learning Syst., vol. 35, no. 7, pp. 9014–9025, Jul. 2024, doi: 10.1109/TNNLS.2022.3217694.

[28] F. Ren, Y. Li, and M. Hu, ‘Multi-classifier ensemble based on dynamic weights’, Multimed Tools Appl, vol. 77, no. 16, pp. 21083–21107, Aug. 2018, doi: 10.1007/s11042-017-5480-5.

Calibrated Probabilistic Stacking with Linear Meta-Learning for Admission Outcome Prediction

Abstract

Author Biography

References

Authors

DOI:

Keywords:

Downloads

Published

Issue

Section

License

How to Cite

Developed By

Information