DI, Juan. Optimizing Long-Term User Engagement in Short-Video Recommendation via Reinforcement Learning: A Markov Decision Process Framework with Composite Rewards. Informatica, [S. l.], v. 50, n. 13, 2026. DOI: 10.31449/inf.v50i13.13064. Disponível em: https://www.informatica.si/index.php/informatica/article/view/13064. Acesso em: 19 may. 2026.