1.
Di J. Optimizing Long-Term User Engagement in Short-Video Recommendation via Reinforcement Learning: A Markov Decision Process Framework with Composite Rewards. IJCAI [Internet]. 2026 May 18 [cited 2026 May 19];50(13). Available from: https://www.informatica.si/index.php/informatica/article/view/13064