Early Warning of Financial Crises in Manufacturing Using SMOTE-Tomek Random Forest and Sentiment-Enhanced Indicators
Abstract
In order to improve the accuracy of financial crisis warning in the manufacturing industry and solve the problems of single indicators and insufficient ability to handle imbalanced data in traditional models, a warning system integrating traditional financial indicators and text big data indicators has been studied and constructed. The synthetic minority oversampling technique Tomek link random forest (SMOTE-Tomek-RF) model for early warning is adopted. Moreover, using 21 manufacturing enterprises listed on the Shanghai Stock Exchange A-shares as samples and based on 22 warning indicators, core variables are selected through random forest (RF) feature selection to compare the warning performance of RF, SMOTE-RF, single decision tree (DT), and the proposed SMOTE-Tomek-RF model. The results showed that the importance scores of emotional inclination and popularity were 0.052 and 0.047, respectively. Both scores were higher than the threshold and were ranked high, effectively supplementing the information. The predictive model proposed by the research had a subject area under the working curve (AUC) of 0.968, an F1 score of 84.97%, and a G-Mean of 90.11%. The AUC of the traditional RF model, SMOTE-RF model, and DT model were only 0.934, 0.953, and 0.943, respectively. In addition, the prediction accuracy for healthy and crisis firms after combining text big data amounted to 100% and 92.86%, respectively. In summary, the prediction model can effectively deal with the data imbalance problem and improve the precision of early warning. This method provides a reliable method for financial crisis early warning in manufacturing industry, which is of great significance for enterprise risk control and investor decision-making.DOI:
https://doi.org/10.31449/inf.v49i32.11034Downloads
Published
How to Cite
Issue
Section
License
I assign to Informatica, An International Journal of Computing and Informatics ("Journal") the copyright in the manuscript identified above and any additional material (figures, tables, illustrations, software or other information intended for publication) submitted as part of or as a supplement to the manuscript ("Paper") in all forms and media throughout the world, in all languages, for the full term of copyright, effective when and if the article is accepted for publication. This transfer includes the right to reproduce and/or to distribute the Paper to other journals or digital libraries in electronic and online forms and systems.
I understand that I retain the rights to use the pre-prints, off-prints, accepted manuscript and published journal Paper for personal use, scholarly purposes and internal institutional use.
In certain cases, I can ask for retaining the publishing rights of the Paper. The Journal can permit or deny the request for publishing rights, to which I fully agree.
I declare that the submitted Paper is original, has been written by the stated authors and has not been published elsewhere nor is currently being considered for publication by any other journal and will not be submitted for such review while under review by this Journal. The Paper contains no material that violates proprietary rights of any other person or entity. I have obtained written permission from copyright owners for any excerpts from copyrighted works that are included and have credited the sources in my article. I have informed the co-author(s) of the terms of this publishing agreement.
Copyright © Slovenian Society Informatika







