(1)
Di, J. Optimizing Long-Term User Engagement in Short-Video Recommendation via Reinforcement Learning: A Markov Decision Process Framework With Composite Rewards. IJCAI 2026, 50.