Di, Juan. 2026. “Optimizing Long-Term User Engagement in Short-Video Recommendation via Reinforcement Learning: A Markov Decision Process Framework With Composite Rewards”. Informatica 50 (13). https://doi.org/10.31449/inf.v50i13.13064.