Distributed Multi-Agent RL for Routing Optimization