TANG, Qimeng; ZHANG, Yuhang; GAO, Yanbin. Q-learning and Policy Gradient-Based Reinforcement Learning Method to Decision Making of Phased Array Radar Jamming. Informatica, [S. l.], v. 49, n. 27, 2025. DOI: 10.31449/inf.v49i27.7369. Disponível em: https://www.informatica.si/index.php/informatica/article/view/7369. Acesso em: 24 jan. 2026.