Introduction

Based on the original Repository at https://github.com/fxia22/gn.pytorch/

The project learns the dynamics of the agent by using the Graph Neural Networks Graph networks as learnable physics engines for inference and control.
The learned GNN Model is integrated into the Deep Mind Control Suite to learn a policy for reaching the goal, please replace your original dm_control package with the files from dm_control_mod
The code for explanation is contained in the explanations folder the files contain instructions for generating the decision trees for policy. Please look into "explainPolicy.ipynb" and "explainDynamics.ipynb"
Models and files are named _swimmer3 for all swimmer3 trained policies and gnn models and the data sets

Dependencies

Please install the following Dependencies

DeepMind control suite
Mujoco
networkx
pytorch 0.4.1 (other versions untested)
stable_baselines3

Generate data

Generate data with gen_data_new.py script, for the 3-link swimmers states. A pregenerated data file is given here "swimmer3.npy" and "swimmer3_eval.npy"

Normalise the data using "normalizer.py" a pre-generated file is given here "normalize3.pth"

Train GN

Change line 12 in train_gn.py to be from dataset import SwimmerDataset

python train_gn.py to train the model. The learning rate schedule corresponds to "fast training" in original paper.

Evaluate GN

Change line 14 in evaluate_gn.py to be from dataset import SwimmerDataset

python evaluate_gn.py <model path>

Predicted and Actual dynamics of swimmer3

GNN loss for swimmer3
GNN predicted vs actual state

Swimmer3 learned policy

TD3 has worked the best for us with an mean reward of 218

Demo of swimmer3 moving

Explanations

Using a set of observations we predict the angle the angle of the head using the decision trees predictions.

Change line 12 in train_gn.py to be from testdataset import SwimmerDataset Change line 14 in evaluate_gn.py to be from testdataset import SwimmerDataset

The policy explanation is attempted by building a decision tree of optimal policy and model policy. The details are given in "explainDynamics.ipynb" and "explainPolicy.ipynb"

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.idea		.idea
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
dm_control_new		dm_control_new
logs		logs
misc		misc
models		models
swimmer3_td3_c/TD3_1		swimmer3_td3_c/TD3_1
.gitattributes		.gitattributes
CustomPPO.py		CustomPPO.py
CustomSwimmerGym.py		CustomSwimmerGym.py
CustomTrain.py		CustomTrain.py
README.md		README.md
__init__.py		__init__.py
activity_optactive_reward-110000		activity_optactive_reward-110000
activity_optactive_reward2-110000		activity_optactive_reward2-110000
dataset.py		dataset.py
dataset3.py		dataset3.py
debug_rollout.ipynb		debug_rollout.ipynb
environment_base.yml		environment_base.yml
evaluate_gn.py		evaluate_gn.py
evaluate_gn3.py		evaluate_gn3.py
evaluate_gn_rollout.py		evaluate_gn_rollout.py
explainDynamics.ipynb		explainDynamics.ipynb
explainPolicy.ipynb		explainPolicy.ipynb
exports		exports
gen_data.py		gen_data.py
gen_data_new.py		gen_data_new.py
generate_rollout.py		generate_rollout.py
gn_models.py		gn_models.py
gnn_swimmer3_td3.zip		gnn_swimmer3_td3.zip
model_1040000.pth		model_1040000.pth
model_swimer6.pth		model_swimer6.pth
model_swimmer3.pth		model_swimmer3.pth
myswimmer.py		myswimmer.py
myswimmer2.py		myswimmer2.py
myswimmer_new.py		myswimmer_new.py
new_swimmer.npy		new_swimmer.npy
normalize.pth		normalize.pth
normalize3.pth		normalize3.pth
normalizer.py		normalizer.py
performance-110000.txt		performance-110000.txt
split.py		split.py
swimmer2.npy		swimmer2.npy
swimmer3.npy		swimmer3.npy
swimmer3_eval.npy		swimmer3_eval.npy
swimmer3_td3_correct.zip		swimmer3_td3_correct.zip
swimmer6.npy		swimmer6.npy
swimmer6_test.npy		swimmer6_test.npy
swimmer_gym.py		swimmer_gym.py
test_normalizer.ipynb		test_normalizer.ipynb
testdataset.py		testdataset.py
train_gn.py		train_gn.py
util2.py		util2.py
utils.py		utils.py
visualize.ipynb		visualize.ipynb
wrappers.py		wrappers.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Dependencies

Generate data

Train GN

Evaluate GN

Predicted and Actual dynamics of swimmer3

Swimmer3 learned policy

Demo of swimmer3 moving

Explanations

About

Releases

Packages

Languages

apurvakokate/GraphNN-For-Learning-Dynamics-and-Generating-Policies-with-Explanations-using-Decision-Trees

Folders and files

Latest commit

History

Repository files navigation

Introduction

Dependencies

Generate data

Train GN

Evaluate GN

Predicted and Actual dynamics of swimmer3

Swimmer3 learned policy

Demo of swimmer3 moving

Explanations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages