Reinforcement-learning

Optimization of Actor Critic Policy in Continuous Action Space

The implementation of Reinforcement learning algorithms has made a huge impact on various problems where no existing methodologies has succeeded in control task and make decision. In this paper we are implementing a hybrid algorithm to virtual self-driving car through collating the Actor-Critic and Proximal Policy Optimization (PPO) methods to introduce a continuous control tasks for locomotion of cars. Successful locomotion of a self-driving car can be achieved through angular movements of the steering by understanding the changes in environment where the actions like to take turns smoothly or throttle maps to continuous action space. The policy which maps input received from the sensors which causes change of action in cars is upgraded to achieve rewards. Due to these upgraded techniques the general policy-based methods have been improvised by the Actor-Critic method. The primary purpose of the research is to study the performance of the modified policy optimization techniques which enhances the interaction of the agent with the environment resulting in improved rewards in comparison with other policy-based methods. The testbeds used for the implementation of the modified algorithm are Cartpole and MountainCarContinuous. The modified actor-critic algorithm has yielded consistent policy update reducing the risk of learning a sudden irreversible bad policy.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
42074_Simanta_Sarkar_Research_Project_report_70540_1462656628 (1).pdf		42074_Simanta_Sarkar_Research_Project_report_70540_1462656628 (1).pdf
README.md		README.md
cartpole-AC.py		cartpole-AC.py
cartpole-AC_Modified.py		cartpole-AC_Modified.py
mountaincar-AC.py		mountaincar-AC.py
mountaincar-AC_modified.py		mountaincar-AC_modified.py
ssar_18201148_ResearchConfiguration.pdf		ssar_18201148_ResearchConfiguration.pdf
utilss.py		utilss.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement-learning

Optimization of Actor Critic Policy in Continuous Action Space

About

Releases

Packages

Languages

simonsimanta/Reinforcement-learning

Folders and files

Latest commit

History

Repository files navigation

Reinforcement-learning

Optimization of Actor Critic Policy in Continuous Action Space

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages