Skip to content

Latest commit

 

History

History
63 lines (50 loc) · 3.38 KB

README.md

File metadata and controls

63 lines (50 loc) · 3.38 KB

Deep RL for traffic signal control

License: MIT

This repo implements start-of-the-art mutli-agent (decentralized) deep RL algorithms for large-scale traffic signal control in SUMO-simulated environments.

Available cooperation levels:

  • Centralized: a global agent that makes global control w/ global observation, reward.
  • Decentralized: multiple local agents that make local control independently w/ neighborhood information sharing.

Available NN layers: Fully-connected, LSTM.

Available algorithms: IQL, IA2C, IA2C with stabilization (called MA2C).

Available environments:

Requirements

Usages

First define all hyperparameters in a config file under [config_dir], and create the base directory of experiements [base_dir]. Before the training, SUMO xml files have to be generated by calling build_file.py under [environment_dir]/data/.

To train a new agent, run

python3 main.py --base-dir [base_dir] train --config-dir [config_dir] --test-mode no_test

no_test is suggested, since it may slow down the training speed.

To access tensorboard during training, run

tensorboard --logdir=[base_dir]/log

To evaluate and compare trained agents, run

python3 main.py --base-dir [base_dir] evaluate --agents [agent names] --evaluate-seeds [seeds]

Evaluation data will be output to [base_dir]/eva_data, and make sure evaluation seeds are different from those used in training.

Citation

If you find this useful in your research, please cite our paper "Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control" (early access version):

@ARTICLE{8667868, 
author={T. {Chu} and J. {Wang} and L. {Codecà} and Z. {Li}}, 
journal={IEEE Transactions on Intelligent Transportation Systems}, 
title={Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control}, 
year={2019}, 
volume={}, 
number={}, 
pages={1-10}, 
keywords={Reinforcement learning;Scalability;Heuristic algorithms;Mathematical model;Codecs;Neural networks;Convergence;Adaptive traffic signal control;reinforcement learning;multi-agent reinforcement learning;deep reinforcement learning;actor-critic.}, 
doi={10.1109/TITS.2019.2901791},
ISSN={1524-9050}, 
month={},}