[1]
J. Di, “Optimizing Long-Term User Engagement in Short-Video Recommendation via Reinforcement Learning: A Markov Decision Process Framework with Composite Rewards”, IJCAI, vol. 50, no. 13, May 2026.