Facial Expression Localization and Recognition Using MDMO and Transformer for Mental Health Diagnosis Support

Abstract

With the work and study pressure on people increasing and the importance of mental health problems on the rise, facial expression analysis plays an important role in mental health auxiliary diagnosis and treatment (MHADT). This study proposes a facial expression localization and recognition model that integrates Main Directional Mean Optical Flow (MDMO) with the Transformer architecture. It addresses the problems of insufficient generalization ability and limited temporal modeling ability of traditional methods in facial expression recognition. The study is based on the authoritative Audio/Visual Emotion Challenge 2019 Depression Detection Sub-challenge (AVEC2019 DDS) dataset in the mental health field, which contains 163 training samples, 56 validation samples, and 56 test samples. With Explicit Shape Regression, Local Binary Patterns, Mnemonic Descent Method, Convolutional Neural Network, and benchmark models as comparison objects, it systematically evaluates the model's performance in error, accuracy, and processing speed. The results show that the model achieves the best performance in both facial expression localization and recognition tasks. The validation set errors are 7.26 and 6.85, the localization accuracy reaches 89.6%, and the recognition accuracy reaches 88.9%, which is significantly better than other methods. At the same time, its image processing time is 55 milliseconds (ms) and 44ms, balancing high precision and real-time performance. The study indicates that the fusion of MDMO and Transformer can effectively capture the spatial and temporal features of facial expressions. Thus, this fusion method provides an efficient, stable, and scalable technical solution for emotion recognition in MHADT. This improves both the FER and localization accuracy and effect. Besides, it provides a novel reference in both approaches and application level for the intelligent development of mental health evaluation.

Authors

  • Yan Yan Guangdong Polytechnic of Industry and Commerce, Guangzhou 510510, China
  • Hanjun Li Guangzhou College of Commerce, Guangzhou 511363, Guangdong, China

DOI:

https://doi.org/10.31449/inf.v50i6.12194

Downloads

Published

02/21/2026

How to Cite

Yan, Y., & Li, H. (2026). Facial Expression Localization and Recognition Using MDMO and Transformer for Mental Health Diagnosis Support. Informatica, 50(6). https://doi.org/10.31449/inf.v50i6.12194