Reinforce Adventure This Repository contains my implementation of popular algorithms on popular environments. Repository contains code for - Inverted Pendulum Problem DDPG ( Deep Deterministic Policy Gradient ) Also on official keras-examples Cartpole Problem Actor-Critic Monte Carlo Method PPO ( Proximal Policy Optimization ) Q-Learning with Neural Net Vanilla Policy Gradient Lunar Lander Actor-Critic Mountain Car Q-Learning with Neural Net Slot Machine Bandit problem E-greedy & Thompson Sampling