Pytorch implementation of Soft Actor-Critic algorithm in MuJoco environments.
Source: Haarnoja, T., Zhou, A., Abbeel, P., and Levine, S. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor. In International Conference on Machine Learning (ICML), 2018. Pre-print available at: https://arxiv.org/pdf/1801.01290.pdf