CNN-SVM-based Human-Computer Interaction Model for Automotive Systems in Complex Driving Environments

Mei Gao, Dan Ye, Junjie Zhang

Abstract


In the complex driving environment, with the increase of task difficulty, the change of diversity and relevance, the phenomenon of perceptual mode conflict, strong cognition or increased difficulty of operation appears when drivers deal with tasks, which affects the execution of primary and secondary tasks. The information expressed and transmitted by multimedia technology is real-time, and only with real-time can we interact and transmit information with users. This paper discusses the design of automobile man-machine interaction based on multimedia information acquisition technology in complex driving environment. Based on users' situational awareness, this paper studies users' interactive needs and experiences in different driving situations, and proposes CNN-SVM (Convolutional Neural Networksupport vector machine) emotional perception model. After automatically extracting spectral features through CNN, support vector machine (SVM) is used instead of traditional Softmax classifier to achieve accurate classification of multi class emotions. The experiment focuses on identifying the following core emotion categories: Anger, Neutral, Joy, and Anxiety, and verifies the model's generalization ability in driving scenarios through cross validation. In the spectrum segmentation experiment, the same network structure as CNN Net was used, and SVM was used instead of softmax classifier. CNN Net was used for training each time. After training, use the test sample set to calculate the input features of the softmax classifier, and input the new features into SVM to calculate the classification result of CNN-SVM. Select 500 test set images from the CityScapes dataset for testing, with MIoU (Mean Intersection Point on Consortium) used as the testing metric. The experimental results show that the model has improved the segmentation accuracy of roads, sidewalks, buildings, etc. Compared with several mainstream segmentation algorithms currently available, this algorithm has relatively small improvements for smaller target objects such as lights, signs, vegetables, and riders, with improvements of 1%, 0.5%, and 0.9%, respectively. In driving scenarios, users' judgment of "safety" mainly depends on whether secondary tasks occupy the cognitive and interactive channels of the primary driving task. This article explores how avoiding these two factors can provide users with a sense of security and improve their interaction experience.


Full Text:

PDF

References


AlZu’bi S, Hawashin B, Mujahed M, (2019), An efficient employment of internet of multimedia things in smart and future agriculture. Multimedia Tools and Applications, 78(20): 29581-29605.

Dhokrat, J. G., & Pulgam, N. (2024). A framework for privacy-preserving multiparty computation with homomorphic encryption and zero-knowledge proofs. Informatica, 48(21).

Karadere, G, Y Düzcan, & Yldz, A. R. (2020). Light-weight design of automobile suspension components using topology and shape optimization techniques. Materials Testing, 62(5), 454-458.

Krishan, R, Verma, A, Mishra, S. (2019). Design of multi-machine power system stabilizers with forecast uncertainties in load/generation. IETE Journal of Research, 65(1), 44-57.

Gao, D., Wang, J., & Chai, R. (2024). Intelligent car autonomous driving tracking technology based on fuzzy information and multi-sensor fusion. Informatica, 48(21), 37-50.

Yang, C, Yue, Y, Zhang, J, Wen, M, Wang, D. (2019). Probabilistic reasoning for unique role recognition based on the fusion of semantic-interaction and spatio-temporal features. IEEE Transactions on Multimedia, 21(5), 1195-1208.

Ji, M, Peng, G, He, J, Liu, S, Chen, Z, Li, S. (2021). A two-stage, intelligent bearing-fault-diagnosis method using order-tracking and a one-dimensional convolutional neural network with variable speeds. Sensors, 21(3), 675.

Chen, J, Jia, X. (2020). An approach for assembly process case discovery using multimedia information source. Computers in Industry, 115(1), 103176.

Fang, Y., & Wang, Y. (2024). Cross modal sentiment analysis of image text fusion based on Bi LSTM and B-CNN. Informatica, 48(21), 95-111.

Huang, J, Yan, W, Li, T. H, Liu, S, Li, G. (2020). Learning the global descriptor for 3d object recognition based on multiple views decomposition. IEEE Transactions on Multimedia, 2020(99), 1-1.

Wang, J, Jiang, C, Zhu, H, Yong, R, Hanzo, L. (2018). Internet of vehicles: sensing-aided transportation information collection and diffusion. IEEE Transactions on Vehicular Technology, 67(5), 3813-3825.

Xin, X, Liu, X, Li, K, Xiao, B, Qi, H. (2017). Minimal perfect hashing-based information collection protocol for rfid systems. IEEE Transactions on Mobile Computing, 2017(10), 1-1.

Shiraishi, M, Ashiya, H, Konno, A, Morita, K, Kataoka, S. (2019). Development of real-time collection, integration, and sharing technology for infrastructure damage information. Journal of Disaster Research, 14(2), 333-347.

Ferro, N, Kim, Y, Sanderson, M. (2019). Using collection shards to study retrieval performance effect sizes. ACM Transactions on Information Systems, 37(3), 1-40.

Mori, E, Kelkar, S. (2020). Introduction to the special issue on interface architects: the evolution of human–computer interaction. IEEE Annals of the History of Computing, 42(4), 6-7.

Souza, A, Filho, M. R, Soares, C. (2020). Production and evaluation of an educational process for human-computer interaction (hci) courses. IEEE Transactions on Education, 2020(99), 1-8.

Zhou, Z. Q, Sun, L. (2019). Metamorphic testing of driverless cars. Communications of the ACM, 62(3), 61-67.

Gao, H, Liu, C. H, Wang, W. (2018). Hybrid vehicular crowdsourcing with driverless cars: challenges and a solution. Computer, 51(12), 24-31.

Guo, M, Yu, Z, Xu, Y, Huang, Y, Li, C, Javier García-Haro. (2021). Me-net: a deep convolutional neural network for extracting mangrove using sentinel-2a data. Remote Sensing, 13(7), 1-24.

Yan, S, Jing, L, Wang, H. (2021). A new individual tree species recognition method based on a convolutional neural network and high-spatial resolution remote sensing imagery. Remote Sensing, 13(3), 479.

Al-Hashimy, H. N. H., Hussein, W. N., Al Jubair, A. S., & Yao, J. (2024). Enhancing data integrity in computerized accounting information systems using supervised and unsupervised machine learning algorithms implement A SEM-PLS analysis. Informatica, 48(20), 107-118.




DOI: https://doi.org/10.31449/inf.v49i25.8340

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.