Based on code from PettingZoo and Tianshou
You can watch a number of pretrained agents play against each other.
Note: mix_train.pth
and mix_train_adv.pth
can only play against each other due to architecture differences
`
Agent trained for 60 epochs against player that plays random moves
Agent trained against itself for 60 epochs
Agent trained with both self play and against random players for 75 epochs
Same as mixed_train.pth
but model has two layers with 256 neurons. Trained for 60 epochs
Same as mix_train.pth
but trained for 100 epochs