The control of traffic signals is crucial for improving transportation
efficiency. Recently, learning-based methods, especially Deep Reinforcement
Learning (DRL), garnered substantial success in the quest for more efficient
traffic signal control strategies. However, the design of rewa