Skip to content

Implementation of the Deep Deterministic Policy Gradient(DDPG) in bullet Gym using pytorch

Notifications You must be signed in to change notification settings

OsgoodWu/pytorch-gym

 
 

Repository files navigation

DDPG in bullet Gym using pytorch

Overview

This is an implementation of Deep Deterministic Policy Gradient (DDPG) in bullet Gym using PyTorch.

Dependencies

  • Python 3.6
  • PyTorch 0.3.0
  • openAI gym
  • pybullet

Run

  • here is a simple example to train CartPole with high efficiency:

$ python main.py --debug --discrete --env=CartPole-v0 --vis

  • you can using this to understand usage of each argument:

$ python main.py --help

  • some explanation of important arguments:

--debug: print the reward and some other information

--discrete: if the actions are discrete rather than continuous

--vis: render each action (but it would slow down your training speed)

--cuda: train this task using GPU

—test: testing mode

—resume : load model from the path

Contributors

About

Implementation of the Deep Deterministic Policy Gradient(DDPG) in bullet Gym using pytorch

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%