Continuous-control-Agent

1. Project overview :

A reinforcement learning agent( double-jointed arm) trained to maintain its position toward a target in a continuous environment.

2.Task Description :

2.1 Environement :

For this project I am using the Reacher environment which simulates Double-jointed arm that can move to target locations.

With :

the observation has 30 variables about measurements such as velocities , angular velocities .... of the arm .
The action space is 4 dimentional vector , action = [x1 ,x2, x3, x4] where xi ∈ [-1, +1] with i ∈ {1,2,3,4}
The rewarding strategy : the agent receives +0.1 if it is in the goal( target) direction and nothing otherwise

Thus the goal is to maintain the position of the arm toward the target for as many time steps as possible.

2.2 Solving the environement:

This taks is considered solved if we reach an average reward of +30.0 over 100 episodes or more.

3. Getting started

If you wish to reproduce this work you need to setup the enviornement by following this section :

3.1 Clone this repository :

git clone https://github.com/ZSoumia/Continous-control-Agent

3.2 Set up the environment :

Please follow instructions from this repo

3.3 Download the Unity Environment :

Select the Unity environement based on your opertaing system :

Linux: click here
Mac OSX: click here
Windows (32-bit): click here
Windows (64-bit): click here

Check out this link if you need help with determining if your computer is running a 32-bit version or 64-bit version of the Windows operating system.

(For AWS) If you'd like to train the agent on AWS (and have not enabled a virtual screen), then please use this link to obtain the "headless" version of the environment. You will not be able to watch the agent without enabling a virtual screen, but you will be able to train the agent. (To watch the agent, you should follow the instructions to enable a virtual screen, and then download the environment for the Linux operating system above.)

==> Place the downloaded file into your cloned project file .

4. Project's structure :

The Agent.py file contains the general structure of the Reinforcement learning agent .
The Actor.py contains the actor's network code .
Critic.py contains the critic's network code.
Continuous_control.ipynb is the notebook used to train and evaluate the agent.
Continuous control Report.html is a report about the different aspects of this project.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
Actor.py		Actor.py
Agent.py		Agent.py
Continous control project Report.html		Continous control project Report.html
Continuous_Control.ipynb		Continuous_Control.ipynb
Critic.py		Critic.py
README.md		README.md
actor_model.pth		actor_model.pth
critic_model.pth		critic_model.pth

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Continuous-control-Agent

1. Project overview :

2.Task Description :

2.1 Environement :

2.2 Solving the environement:

3. Getting started

3.1 Clone this repository :

3.2 Set up the environment :

3.3 Download the Unity Environment :

4. Project's structure :

About

Releases

Packages

Languages

ZSoumia/Continous-control-Agent

Folders and files

Latest commit

History

Repository files navigation

Continuous-control-Agent

1. Project overview :

2.Task Description :

2.1 Environement :

2.2 Solving the environement:

3. Getting started

3.1 Clone this repository :

3.2 Set up the environment :

3.3 Download the Unity Environment :

4. Project's structure :

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages