Vanilla Policy Gradient VPG reinforcement learning algorithm in PyTorch Tested on CartPole-v0 environment from OpenAI Gym Writeup at vitez.me/vanilla-policy-gradient