useless-thing

Useless try to create neural network.

The idea

Instead of using gradient descent for learning my neural network I've tried simplier method. I tried to do the following:
layer[j]=layer[j]+activity[i]*sign(layer[j])*reward*lr
where layer[j] is weight of ith input of each neuron. activity[i] is latest value of that input. i is number of input. So it increase the absolute value of weight if action was rewarded and decrease it if not. But unfortanetely it doesn't work.

Testing it

This network was tested on LunarLanderContinous-v2 from gym library. On average it do 1 landing over 1000 iterations.

Running

python rl1.py
NOTICE: gym and Box2D libraries are required to run. Python version>=3.6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

useless-thing

The idea

Testing it

Running

Files

README.md

Latest commit

History

README.md

File metadata and controls

useless-thing

The idea

Testing it

Running