Skip to content
/ DPP Public

Reward Shaping Using Counterfactuals (D, D++). See "MPE_custom" repository.

License

Notifications You must be signed in to change notification settings

yathartha3/DPP

Repository files navigation

Reward Shaping using Difference Rewards (D++)

The underlying architecture is MADDPG, and I am modifying the Multiagent Parlicle Environment such that it returns the shaped reward. Thus take a look at "MPE_custom" repository > dpprs branch.

This is a work in progress.

MADDPG-PyTorch

PyTorch Implementation of MADDPG is taken from Shariq Iqbal.

Requirements

The versions are just what I used and not necessarily strict requirements.

How to Run

All training code is contained within main.py. To view options simply run:

python main.py --help

About

Reward Shaping Using Counterfactuals (D, D++). See "MPE_custom" repository.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages