Tang, Qimeng, Yuhang Zhang, and Yanbin Gao. “Q-Learning and Policy Gradient-Based Reinforcement Learning Method to Decision Making of Phased Array Radar Jamming”. Informatica 49, no. 27 (December 20, 2025). Accessed January 24, 2026. https://www.informatica.si/index.php/informatica/article/view/7369.