GitHub - glorgao/NES-HT: Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning

Official Implementation for NESHT(Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning)

This project implements an evolution algorithm that integrates the hard-thresholding (HT) operator into the well-known natural evolution strategies (NES) algorithm. For the paper, please check this arixv link.

Details

Parallelism. We employ the simple joblib package, finding it much faster than using ray on a single node cluster. (Our single node machine has 512 cores.)
Policy Network. We apply the NESHT algorithm to a single linear layer agent, as it's well established that the capacity of a single linear layer suffices for Mujoco tasks.
Gaussian Noise. Our comparisons are conducted on noisy Mujoco environments, where the state is a combination of Gaussian noise and environment-provided observations.
For any questions about the implementation, please don't hesitate to contact me.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
NES.py		NES.py
README.md		README.md
evaluation.py		evaluation.py
linear.py		linear.py
main.py		main.py
print.py		print.py
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Official Implementation for NESHT(Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning)

Details

About

Releases

Packages

Languages

glorgao/NES-HT

Folders and files

Latest commit

History

Repository files navigation

Official Implementation for NESHT(Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning)

Details

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages