Giraffe is an experimental chess engine based on temporal-difference reinforcement learning with deep neural networks. It discovers almost all its chess knowledge through self-play.
For more information, see: http://arxiv.org/abs/1509.01549
Giraffe is written in C++11.
If you decide to compile Giraffe yourself, please grab the neural network definition file (eval.t7) from the binary distribution. It must be in Giraffe's working directory when Giraffe is started.
To use Gaviota tablebases, set the path through the GaviotaTbPath option.
To use Gaviota tablebases with the Wb2Uci adapter, set "GaviotaTbPath=..." in Wb2Uci.eng.
Alternatively, set the GTBPath environment variable to point to the tablebases.
- The Makefile contains -ltcmalloc. libtcmalloc replaces malloc/free with another implementation with thread-local caching. It is optional and doesn't really matter for playing. It can be safely removed. Or you can install the library. It's in the libgoogle-perftools-dev package on Ubuntu (and probably other Debian-based distros). It makes training on many cores faster.
- The Makefile contains -march=native. If you want to do a compile that also runs on older CPUs, change it to something else.
- Only GCC 4.8 or later is supported for now. Intel C/C++ Compiler can be easily supported by just changing compiler options. MSVC is not supported due to use of GCC intrinsics. Patches welcomed to provide alternate code path for MSVC. Clang is not supported due to lack of OpenMP.
- Tested on Linux (GCC 4.9/5.3/6.0), OS X (GCC 4.9), Windows (MinGW-W64 GCC 5.3). GCC versions earlier than 4.8 are definitely NOT supported, due to broken regex implementation in libstdc++.
- Giraffe uses Torch for training. It can be built with or without Torch. Torch is not required if you just want to play against it. See the HAS_TORCH flag in Makefile.
Training Giraffe is a multi-step process that will take more than a week on a quad core machine if you want the highest quality results. Using a higher core count machine is recommended (about 3 days on a 20 cores Haswell Xeon).
Prepare a large file with many positions from real games. One line per position in FEN, that's all. No labels are needed. They don't have to be high quality games. I have extracted 5 million random positions from a CCRL dump. It's available for download on the Downloads page. Let's call this file ccrl.fen
Train the evaluation network:
mkdir trainingResults OMP_NUM_THREADS=n ./giraffe tdl ccrl.fen sts.epd
where n is the number of cores you have, and sts.epd is the EPD you want to test progress on.
It will periodically take a snapshot of the network, and store it in trainingResults/. You can stop the training (Ctrl-C) at any time.
Copy the latest file from trainingResults/ into the parent directory (where giraffe is), and rename it to eval.t7.