Robust Text Classification via Improved CNN, Unbalanced BiLSTM, and Multi-Head Attention

Yan Liang; Jiabin Liu

doi:10.31449/inf.v49i35.11100

Robust Text Classification via Improved CNN, Unbalanced BiLSTM, and Multi-Head Attention

Abstract

As one of the core tasks of natural language processing technology, text classification methods general-ly face the problems of insufficient global semantic capture and limited feature focusing ability when processing long texts or complex semantics. To address this issue, a deep learning model that integrates improved convolutional neural networks, unbalanced bidirectional long short-term memory networks, and multi-head attention mechanisms is proposed. Utilizing an improved bidirectional long short-term memory network to capture global semantic information, while dynamically focusing on key features through a multi head attention mechanism to enhance the model's adaptability to classification tasks. The performance of the model is validated through experiments on AG News (short text) and IMDb (long text) datasets. The results show that in short text classification, the proposed method has an accu-racy rate of 96% and a classification error rate of only 1.46%. In the task of long text classification, the method proposed in the study has a product under the curve of 0.98. In adversarial attack testing, the accuracy rates of adversarial samples generated by different methods are 92.85% and 90.63%, respec-tively, with the lowest robustness degradation rates of 3.72% and 5.49%, respectively. In cross domain generalization testing, it shows the least classification errors and superior cross domain adaptability. These results validate the high performance, robustness, and wide applicability of the method. The re-search indicates that this approach can validly improve the performance of text classification and pro-vide new solutions for natural language processing related tasks in long text and multi-category scenar-ios.

Authors

Yan Liang
Jiabin Liu

DOI:

https://doi.org/10.31449/inf.v49i35.11100

Downloads

Published

12/16/2025

How to Cite

Liang, Y., & Liu, J. (2025). Robust Text Classification via Improved CNN, Unbalanced BiLSTM, and Multi-Head Attention. Informatica, 49(35). https://doi.org/10.31449/inf.v49i35.11100

Download Citation

Issue

Vol. 49 No. 35 (2025): Online-only issue

Section

Online-only

License

Authors retain copyright in their work. By submitting to and publishing with Informatica, authors grant the publisher (Slovene Society Informatika) the non-exclusive right to publish, reproduce, and distribute the article and to identify itself as the original publisher.

All articles are published under the Creative Commons Attribution license CC BY 3.0. Under this license, others may share and adapt the work for any purpose, provided appropriate credit is given and changes (if any) are indicated.

Authors may deposit and share the submitted version, accepted manuscript, and published version, provided the original publication in Informatica is properly cited.

Robust Text Classification via Improved CNN, Unbalanced BiLSTM, and Multi-Head Attention

Abstract

Authors

DOI:

Downloads

Published

How to Cite

Issue

Section

License

Developed By

Information