Enterprise Financial Fraud Detection using GA-BPNN, Random Forest, and SVM with Multi-Modal Features

Abstract

This paper selected 22 indicators related to financial fraud, taking into account both financial and non-financial indicators from a machine learning perspective, and screened them through the IV value and Spearman correlation coefficient. Then, three algorithms, random forest (RF), support vector machine (SVM), and genetic algorithm-back-propagation neural network (GA-BPNN), were introduced, and experiments were carried out with the data of 784 fraudulent enterprises and 784 non-fraudulent enterprises sourced from the China Stock Market & Accounting Research Database as samples. The results indicated that using both financial and non-financial indicators yielded better identification results compared to using only financial indicators: the recall rate of the RF algorithm increased from 0.5762 to 0.6295, and the area under the receiver operating characteristic curve (AUC) increased from 0.6331 to 0.6874. The GA-BPNN algorithm exhibited the best performance in recognizing financial fraud behaviors, achieving a recall rate of 0.8256 and an AUC of 0.8537, which were the highest among the different algorithms. The shareholding ratio of the largest shareholder was the most important factor in identifying financial fraud, followed by the inventory turnover rate, which requires particular attention. The results demonstrate the usability of the GA-BPNN algorithm and the established indicator system, which can be applied in actual financial scenarios.

Authors

  • Jie Zhang School of Accounting, Guangzhou College of Commerce

DOI:

https://doi.org/10.31449/inf.v50i9.12142

Downloads

Published

03/12/2026

How to Cite

Zhang, J. (2026). Enterprise Financial Fraud Detection using GA-BPNN, Random Forest, and SVM with Multi-Modal Features. Informatica, 50(9). https://doi.org/10.31449/inf.v50i9.12142