First try at reinforcement leaning with Q-learning for the dice game Ten-Thousand (Zehntausend).
python ./tenk/game.py
Train and watch the ai play the game for an AI that processes all states and actions in a single dictionary.
python ./tenk/ai/single.py
Train and watch the ai play the game for an AI that processes all states and actions separately for choosing dices and choosing to re-roll.
python ./tenk/ai/split.py