GitHub - likeaj6/alphazero-hex: AlphaZero implemented for Hex

Training the neural network

hex_zero_model.py contains the building of the Deep Neural Network used for policy and value prediction. sl_bootstrap.py contains a script to bootstrap the neural network on existing hex data, calling on hex_zero_model to build the neural net before training the neural net for the specified epochs.

Instructions

python3 sl_bootstrap.py

Evaluating against various players

Hex.py contains several functions to play against different players (Self, Random, HexPlayerBryce), where you can specify the number of games and who's player 1, and whether to show the game turn by turn.

Instructions

python3 Hex.py

AlphaHex Agent

AlphaHex.py contains the actual agent that utilizes the general AlphaZero algorithm.

Self-Play & Reinforcement Learning

TrainAlphaHexZero.py contains a script to self-play a specified number of iterations. In each iteration, the AlphaHex agent plays a specified number of games against itself, where it collects randomly 50% of the game data played in the iteration and saves it into a .npz file. It then trains the current best model on this game data for a specified number of epochs, and evaluates the new model against this previous model for a specified number of iterations, where the results are written to a .txt file. If the win rate is over a set threshold, than the new model will become the new current best model to be used in the next iteration of self play.

Instructions

python3 TrainAlphaHexZero.py

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
AlphaHex.py		AlphaHex.py
BackupAlphaHex.py		BackupAlphaHex.py
BasicPlayers.py		BasicPlayers.py
Hex.py		Hex.py
README.md		README.md
TestAlphaHex.py		TestAlphaHex.py
TrainAlphaHexZero.py		TrainAlphaHexZero.py
hex_data.ipynb		hex_data.ipynb
hex_data.npz		hex_data.npz
hex_zero_model.py		hex_zero_model.py
sl_bootstrap.py		sl_bootstrap.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Read More Here:

Training the neural network

Instructions

Evaluating against various players

Instructions

AlphaHex Agent

Self-Play & Reinforcement Learning

Instructions

About

Releases

Packages

Languages

likeaj6/alphazero-hex

Folders and files

Latest commit

History

Repository files navigation

Read More Here:

Training the neural network

Instructions

Evaluating against various players

Instructions

AlphaHex Agent

Self-Play & Reinforcement Learning

Instructions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages