[1]
J. Lin and T. Ni, “CMMF and STAM-FNet: Multimodal Fusion Architectures for Complex Scene Understanding in Dynamic Environments”, IJCAI, vol. 49, no. 9, Oct. 2025.