Skip to content
This repository has been archived by the owner on Mar 1, 2024. It is now read-only.

Latest commit

 

History

History
15 lines (11 loc) · 763 Bytes

README.md

File metadata and controls

15 lines (11 loc) · 763 Bytes

useless-thing

Useless try to create neural network.

The idea

Instead of using gradient descent for learning my neural network I've tried simplier method. I tried to do the following:
layer[j]=layer[j]+activity[i]*sign(layer[j])*reward*lr
where layer[j] is weight of ith input of each neuron. activity[i] is latest value of that input. i is number of input. So it increase the absolute value of weight if action was rewarded and decrease it if not. But unfortanetely it doesn't work.

Testing it

This network was tested on LunarLanderContinous-v2 from gym library. On average it do 1 landing over 1000 iterations.

Running

python rl1.py
NOTICE: gym and Box2D libraries are required to run. Python version>=3.6