Skip to content

Latest commit

 

History

History
executable file
·
27 lines (18 loc) · 1.06 KB

README.md

File metadata and controls

executable file
·
27 lines (18 loc) · 1.06 KB

Reward Shaping using Difference Rewards (D++)

The underlying architecture is MADDPG, and I am modifying the Multiagent Parlicle Environment such that it returns the shaped reward. Thus take a look at "MPE_custom" repository > dpprs branch.

This is a work in progress.

MADDPG-PyTorch

PyTorch Implementation of MADDPG is taken from Shariq Iqbal.

Requirements

The versions are just what I used and not necessarily strict requirements.

How to Run

All training code is contained within main.py. To view options simply run:

python main.py --help