Official Implementation for NESHT(Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning)

This project implements an evolution algorithm that integrates the hard-thresholding (HT) operator into the well-known natural evolution strategies (NES) algorithm. For the paper, please check this arixv link.

Details

Parallelism. We employ the simple joblib package, finding it much faster than using ray on a single node cluster. (Our single node machine has 512 cores.)
Policy Network. We apply the NESHT algorithm to a single linear layer agent, as it's well established that the capacity of a single linear layer suffices for Mujoco tasks.
Gaussian Noise. Our comparisons are conducted on noisy Mujoco environments, where the state is a combination of Gaussian noise and environment-provided observations.
For any questions about the implementation, please don't hesitate to contact me.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Official Implementation for NESHT(Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning)

Details

Files

README.md

Latest commit

History

README.md

File metadata and controls

Official Implementation for NESHT(Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning)

Details