Tang, Q., Zhang, Y., & Gao, Y. (2025). Q-learning and Policy Gradient-Based Reinforcement Learning Method to Decision Making of Phased Array Radar Jamming. Informatica, 49(27). https://doi.org/10.31449/inf.v49i27.7369