From b35d601e2c1364a770c3a53765512625b3e2b391 Mon Sep 17 00:00:00 2001 From: Adam Parker Date: Sun, 28 Apr 2024 10:11:18 -0500 Subject: [PATCH 1/2] Update third_party_environments.md Adding Buffalo-Gym to external environments --- docs/environments/third_party_environments.md | 7 +++++++ 1 file changed, 7 insertions(+) diff --git a/docs/environments/third_party_environments.md b/docs/environments/third_party_environments.md index c346ba27f..c51a844bd 100644 --- a/docs/environments/third_party_environments.md +++ b/docs/environments/third_party_environments.md @@ -168,6 +168,13 @@ tmrl is a distributed framework for training Deep Reinforcement Learning AIs in EV2Gym is a fully customizable and easily configurable environment for Electric Vehicle (EV) smart charging simulations on a small and large scale. Also, includes non-RL baseline implementations such as mathematical programming, model predictive control, and heuristics. +### [Buffalo-Gym: Multi-Armed Bandit Gymnasium](https://github.com/foreverska/buffalo-gym) + +![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.29.1-blue) +![GitHub stars](https://img.shields.io/github/stars/foreverska/buffalo-gym) + +Buffalo-Gym is a Multi-Armed Bandit (MAB) gymnasium built primarily to assist in debugging RL implementations. MABs are often easy to reason about what the agent is learning and if it is learning correctly. Buffalo-gym encompases Bandits, Contextual bandits and contextual bandits with aliasing. + ## Third-Party Environments using Gym There are a large number of third-party environments using various versions of [Gym](https://github.com/openai/gym). From 2bd9990047f231e1bbaf87bf3973117b6a655473 Mon Sep 17 00:00:00 2001 From: Adam Parker Date: Sun, 28 Apr 2024 19:25:40 -0500 Subject: [PATCH 2/2] ran precommit, final version of text --- docs/environments/third_party_environments.md | 3 +-- 1 file changed, 1 insertion(+), 2 deletions(-) diff --git a/docs/environments/third_party_environments.md b/docs/environments/third_party_environments.md index c51a844bd..4da542891 100644 --- a/docs/environments/third_party_environments.md +++ b/docs/environments/third_party_environments.md @@ -173,8 +173,7 @@ EV2Gym is a fully customizable and easily configurable environment for Electric ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.29.1-blue) ![GitHub stars](https://img.shields.io/github/stars/foreverska/buffalo-gym) -Buffalo-Gym is a Multi-Armed Bandit (MAB) gymnasium built primarily to assist in debugging RL implementations. MABs are often easy to reason about what the agent is learning and if it is learning correctly. Buffalo-gym encompases Bandits, Contextual bandits and contextual bandits with aliasing. - +Buffalo-Gym is a Multi-Armed Bandit (MAB) gymnasium built primarily to assist in debugging RL implementations. MABs are often easy to reason about what the agent is learning and whether it is correct. Buffalo-gym encompasses Bandits, Contextual bandits, and contextual bandits with aliasing. ## Third-Party Environments using Gym There are a large number of third-party environments using various versions of [Gym](https://github.com/openai/gym).