Skip to content
This repository has been archived by the owner on Mar 1, 2024. It is now read-only.

0xf104a/useless-thing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

useless-thing

Useless try to create neural network.

The idea

Instead of using gradient descent for learning my neural network I've tried simplier method. I tried to do the following:
layer[j]=layer[j]+activity[i]*sign(layer[j])*reward*lr
where layer[j] is weight of ith input of each neuron. activity[i] is latest value of that input. i is number of input. So it increase the absolute value of weight if action was rewarded and decrease it if not. But unfortanetely it doesn't work.

Testing it

This network was tested on LunarLanderContinous-v2 from gym library. On average it do 1 landing over 1000 iterations.

Running

python rl1.py
NOTICE: gym and Box2D libraries are required to run. Python version>=3.6