[1]
X. Li, “Cross-Modal Transformer with Dynamic Attention Fusion for Emotion Recognition in Music via Audio-Lyrics Alignment”, IJCAI, vol. 49, no. 28, Dec. 2025.