Double Deep Q-Network with Experience Replay for Time Dependent Vehicle Routing Problem with Time Windows Under Historical Congestion Constraints. (2026). Informatica, 50(9). https://doi.org/10.31449/inf.v50i9.12122