Di, J. (2026). Optimizing Long-Term User Engagement in Short-Video Recommendation via Reinforcement Learning: A Markov Decision Process Framework with Composite Rewards. Informatica, 50(13). https://doi.org/10.31449/inf.v50i13.13064