DDPG in bullet Gym using pytorch

Overview

This is an implementation of Deep Deterministic Policy Gradient (DDPG) in bullet Gym using PyTorch.

Dependencies

Python 3.6
PyTorch 0.3.0
openAI gym
pybullet

Run

here is a simple example to train CartPole with high efficiency:

$ python main.py --debug --discrete --env=CartPole-v0 --vis

you can using this to understand usage of each argument:

$ python main.py --help

some explanation of important arguments:

--debug: print the reward and some other information

--discrete: if the actions are discrete rather than continuous

--vis: render each action (but it would slow down your training speed)

--cuda: train this task using GPU

—test: testing mode

—resume : load model from the path

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.gitignore		.gitignore
README.md		README.md
ddpg.py		ddpg.py
evaluator.py		evaluator.py
main.py		main.py
memory.py		memory.py
model.py		model.py
multi.py		multi.py
normalized_env.py		normalized_env.py
observation_processor.py		observation_processor.py
random_process.py		random_process.py
rpm.py		rpm.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DDPG in bullet Gym using pytorch

Overview

Dependencies

Run

Contributors

About

Releases

Packages

Languages

OsgoodWu/pytorch-gym

Folders and files

Latest commit

History

Repository files navigation

DDPG in bullet Gym using pytorch

Overview

Dependencies

Run

Contributors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages