Di, Juan. “Optimizing Long-Term User Engagement in Short-Video Recommendation via Reinforcement Learning: A Markov Decision Process Framework With Composite Rewards”. Informatica 50, no. 13 (May 18, 2026). Accessed May 19, 2026. https://www.informatica.si/index.php/informatica/article/view/13064.