GPTs

This is intended to be a step by step guide on how to implement any neural network architecture between linear regression and the transformer. This repo will guide you step by step with incremental changes to the code upto complex architectures. You are free to experiment with them as much as you want and i encourage you to do that.

Some compromises are being made in the interest of speed, but i tried to make this maximally useful and as intuitive as i could.
I think this goes without saying, that you shouldn't use this code in prod. It's optimized for understanding, not stability or performance.

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
__pycache__		__pycache__
datasets		datasets
.gitignore		.gitignore
1 - Regression.ipynb		1 - Regression.ipynb
10 - Mamba (WIP).ipynb		10 - Mamba (WIP).ipynb
2 - Perceptron for Images.ipynb		2 - Perceptron for Images.ipynb
3 - Perceptron for Text.ipynb		3 - Perceptron for Text.ipynb
4 - PyTorch.ipynb		4 - PyTorch.ipynb
5 - DeepNets.ipynb		5 - DeepNets.ipynb
6 - RNN.ipynb		6 - RNN.ipynb
7 - Attention.ipynb		7 - Attention.ipynb
8 - MoE.ipynb		8 - MoE.ipynb
9 - Interpretability (WIP).ipynb		9 - Interpretability (WIP).ipynb
LucaM185.py		LucaM185.py
mamba_model.pth		mamba_model.pth
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPTs

About

Releases

Packages

Languages

LucaM185/GPTs-from-scratch

Folders and files

Latest commit

History

Repository files navigation

GPTs

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages