Hybrid Optimized Dual-Function Logistic Regression-based Ensemble Architecture for Robust Password Strength Prediction
Abstract
The increasing frequency of cyberattacks and data breaches has made password strength a critical problem to predict. While various Machine Learning (ML) methods have been applied to password strength classification, their performance is often compromised by class imbalance and the inability to tap into complementary model behaviors. In this paper, based on the Password Security Sber Dataset, this study introduces a novel heterogeneous ensemble method with a weighted soft-voting strategy leveraging K-Nearest Neighbors, XGB, CatBoost, and Logistic Regression. Among the most valuable new contributions is the dual utilization of Logistic Regression as a base classifier and a surrogate optimizer in optimizing ensemble weights dynamically to ensure the system's reliability and stability. The methodological pipeline consists of SMOTE-based oversampling for class imbalance handling, feature selection to preserve discriminative password features, and structured hyperparameter tuning for each base learner. LR optimization is incorporated in the ensemble system for regulating weight assignment during soft voting application for optimal predictive performance. Experimental results show that LR alone achieved 98.45% accuracy and 97.53% F1-score, while the optimized ensemble achieved 98.24% accuracy, 97.21% F1-score, and 98.82% precision. Compared to baseline ensembles and traditional models, the new approach demonstrates improved accuracy, generalizability, and computational complexity and marks its pragmatic significance in providing more robust password strength estimation for modern digital systems.DOI:
https://doi.org/10.31449/inf.v49i23.9976Downloads
Additional Files
Published
How to Cite
Issue
Section
License
I assign to Informatica, An International Journal of Computing and Informatics ("Journal") the copyright in the manuscript identified above and any additional material (figures, tables, illustrations, software or other information intended for publication) submitted as part of or as a supplement to the manuscript ("Paper") in all forms and media throughout the world, in all languages, for the full term of copyright, effective when and if the article is accepted for publication. This transfer includes the right to reproduce and/or to distribute the Paper to other journals or digital libraries in electronic and online forms and systems.
I understand that I retain the rights to use the pre-prints, off-prints, accepted manuscript and published journal Paper for personal use, scholarly purposes and internal institutional use.
In certain cases, I can ask for retaining the publishing rights of the Paper. The Journal can permit or deny the request for publishing rights, to which I fully agree.
I declare that the submitted Paper is original, has been written by the stated authors and has not been published elsewhere nor is currently being considered for publication by any other journal and will not be submitted for such review while under review by this Journal. The Paper contains no material that violates proprietary rights of any other person or entity. I have obtained written permission from copyright owners for any excerpts from copyrighted works that are included and have credited the sources in my article. I have informed the co-author(s) of the terms of this publishing agreement.
Copyright © Slovenian Society Informatika







