GitHub - michaelc-yu/alphago

An implementation of DeepMind's AlphaGo. Original paper from Google DeepMind: https://www.nature.com/articles/nature16961

Start by cloning the git repository.
Start training. Run train.py like so: python3 train.py --policy --value Specify both "--policy" and "--value" to train both networks since we need both to be trained for our Monte Carlo tree search algorithm to work. After training (which might take a while), the parameters will be saved to 'policy_network_model.pth' and 'value_network_model.pth'). Training has to reach the complete end for the parameters to be saved (do not stop while it's in the middle of training). Optionally, you can specify a "--samples " to train on a smaller portion of the data set. The repo comes with around 15,000 total data examples (100 games with about 150 moves per game). So python3 train.py --policy --value --samples 1000 would only train the networks on 1000 data examples.
Play the game by running: python3 mcts.py

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
sgffiles		sgffiles
README.md		README.md
extract_features.py		extract_features.py
go.py		go.py
load_data.py		load_data.py
mcts.py		mcts.py
policy.py		policy.py
testgame.py		testgame.py
testleesedol.py		testleesedol.py
train.py		train.py
value.py		value.py

Provide feedback