AlphaZero applied to board games

Authors: Tom LABIAUSSE - Amine CHERIF HAOUAT - Sami JALLOULI

Date: Feb/Mar 2024

0 - Setup

Clone the repository:

git clone [email protected]:t0m1ab/alphazero.git

Install alphazero as a package in edit mode (see config in pyproject.toml):

cd alphazero/
pip install -e .

You should be able to run tests on the package or print the docs with the following commands in the terminal:

alphazero --test
alphazero --help

Download the alphazero networks available in our Huggingface Hub using one of the following equivalent commands:

alphazero --download

python alphazero/download.py

All models and configuration files will be stored in a models/ folder by default when loading or training a player.

1 - Files

alphazero/

base: implement parent classes such as Board, Player, PolicyValueNetwork...
players.py: implement different game strategies (random, greedy, mcts, alphazero, human)
mcts.py: implement Monte Carlo Tree Search (rollout or neural evaluation mode)
schedulers.py: implement temperature schedulers for MCTS during AlphaZero training
trainers.py: implement a trainer for AlphaZero
timers.py: define timers to perform alphazero training time estimation
arena.py: organize games between players and display results (sequential or parallel mode)
game_ui.py: interface between user and algorithm to play a game
contests.py: define specific contests between players
visualization.py: define plot functions to create training/evaluation graphs
utils.py: utility functions
tests.py: contains various tests that can be run to check the implementation
download.py: run to download alphazero networks stored on a HuggingFace Hub
run.sh: run to launch a training

alphazero/games/

registers.py: define configurations, boards and networks mapping for each game using their name
othello.py: implementation of the Othello environment, game config and neural network for AlphaZero
tictactoe.py: implementation of the Connect4 environment, game config and neural network for AlphaZero
connect4.py: implementation of the Tictactoe environment, game config and neural network for AlphaZero

docs/

help.txt: general informations

figures/

othello_board_example.png: example of Othello 8x8 board with human display
connect4_board_example.png: example of Connect4 board with human display
tictactoe_board_example.png: example of Tictactoe board with human display

2 - Demo

Go in the code folder alphazero/ to execute any of the following commands.

2.1 - Play against the machine with the CLI

Use game_cli.py to launch a game against the machine. The state of the board will be automatically saved as a PNG file in outputs/ and overwrite itself after each move. Examples of commands can be found below or at the end of game_cli.py.

python game_cli.py --othello --mcts
python game_cli.py --othello --net alphazero-othello --infos --bot-starts
python game_cli.py --tictactoe --net alphazero-tictactoe --display pixel

2.2 - Compare machine players

python contests.py

Change the contest function called in contests.py to modify the machine players and/or the game settings.

2.3 - Train an AlphaZero player

bash run.sh

Use the options of trainer.py in run.sh to change the name of the experiment, the game or the configuration.

Freeze the default configurations as JSON files with the following command:

python trainers.py -f

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
alphazero		alphazero
docs		docs
figures		figures
.gitignore		.gitignore
README.md		README.md
alphaothellozero_report.pdf		alphaothellozero_report.pdf
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AlphaZero applied to board games

0 - Setup

1 - Files

alphazero/

alphazero/games/

docs/

figures/

2 - Demo

2.1 - Play against the machine with the CLI

2.2 - Compare machine players

2.3 - Train an AlphaZero player

About

Releases

Packages

Languages

t0m1ab/alphazero

Folders and files

Latest commit

History

Repository files navigation

AlphaZero applied to board games

0 - Setup

1 - Files

alphazero/

alphazero/games/

docs/

figures/

2 - Demo

2.1 - Play against the machine with the CLI

2.2 - Compare machine players

2.3 - Train an AlphaZero player

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages