gym-connect-four

Open AI Gym for ConnectFour game

Setup

Install the environment:

git clone https://github.com/IASIAI/gym-connect-four.git  
cd gym-connect-four  
pip install -e .

Test two random players :

python example/sample.opponent.py

Usage

Inside the environment there are a couple of sample players provided:

RandomPlayer: as name suggests it only does random moves, although valid ones
SavedPlayer: loads a saved model and uses it to play

Sample code

import gym
from gym_connect_four import RandomPlayer, ConnectFourEnv
env: ConnectFourEnv = gym.make("ConnectFour-v0")

player1 = RandomPlayer(env, 'Dexter-Bot')
player2 = RandomPlayer(env, 'Deedee-Bot')
result = env.run(player1, player2, render=True)
reward = result.value
print(reward)

Inside the repo there are a couple of examples:

sample_nn: Neural Network implementation identical to the one from CartPole playing against a random opponent

Considerations for the environment:

the environment will throw an exception on invalid move, please use either env .is_valid_action(action) or pick the move from env.available_moves()
Player gets the board through get_next_action() and learn() publicly exposed methods. Every player sees its discs encoded with 1, regardless his turn order in the game. In this way , the same NN model can be used as Player1 or as Player2 without needing to adjust board encoding.

Runner usage

In the tools directory there is a script called runner.py which will be the tool used to execute the final competition.

cd tools
python runner.py random Model playerclass random

It accepts a list of arguments, from which only the random can repeat itself. Each imput parameter will be validated in order against:

"random": It will instantiate a player based on the RandomPlayer class, but with a randomized name
"parameter.py": If a python file with the same name is present, it will import that file and instantiate a class with Parameter name (capitalized). In that class you will be responsable to load your model in the init section and then to supply the requested move (maybe to reformat the board when requested for your specific model)
"Parameter.h5": If your model is saved from a simple Neural Network and you haven't altered the input board on processing, then the model will be loaded via the SavedPlayer class

The runner will pit each player against all the others and will also allow learning for all of them.

Assignment

For the competition the following set of deliverables must be provided:

python file containing a Player class used for training the model (NNPlayer + DQNSolver if relative to sample_nn.py)
python class containing a SavedPlayer implementation (only if any customization was needed)
a .h5 (HDF5) file containing the model structure, weights and optimizer (will be the provided or default SavedPlayer class) (https://keras.io/getting-started/faq/#how-can-i-save-a-keras-model)

Evaluation of the assignment

All supplied models will pitted against each other in a round robbin tournament. Every player will play N (a big number like 1000) rounds with every other player. The formula of the score:

Score = sum( (#Wins against Player_k) - (#Losses against Player_k) for k in {all_opponents})

The draws are not taken into account for score computation.
The winner is the one having highest score.

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
examples		examples
gym_connect_four		gym_connect_four
tests		tests
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gym-connect-four

Setup

Usage

Sample code

Runner usage

Assignment

Evaluation of the assignment

Have fun and happy training!

About

Releases

Packages

Contributors 4

Languages

License

IASIAI/gym-connect-four

Folders and files

Latest commit

History

Repository files navigation

gym-connect-four

Setup

Usage

Sample code

Runner usage

Assignment

Evaluation of the assignment

Have fun and happy training!

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages