Left: a Deep Q-Network-based agent trained on a Macbook for several hours plays Snake. Right: cell-wise attributions of the policy network obtained by taking the gradient of the network's output with respect to the inputs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Files

README.md

Latest commit

History

README.md

File metadata and controls