Bayesian Flow Networks

A PyTorch implementation of Bayesian Flow Networks

Currently use of a non causal version of LLAMA2. Currently training TinyStories Models matching https://github.com/karpathy/llama2.c/tree/master

I am going to be using this repository to explore training dynamics of this new class of models. I will maintain a minimal implimentation in the Minimal.ipynd for a simple BFN implimentation. Everything else is an early work in progress.

Features

Discrete model with continuous-time loss, training and sampling (completed)
SOTA performance on XOR dataset
Tiny Stories 15m LLAMA2 Initial Code
Tiny Stories weights (training)
Wiki Text8 Dataset
Bayesian Flow GPT-2 Scale
Fancy Visuals

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
BFN_MNIST.ipynb		BFN_MNIST.ipynb
LICENSE		LICENSE
Minimal.ipynb		Minimal.ipynb
README.md		README.md
bfn.jpeg		bfn.jpeg
correctness.png		correctness.png
model.py		model.py
tinystories.py		tinystories.py
tokenizer.model		tokenizer.model
tokenizer.py		tokenizer.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bayesian Flow Networks

Features

About

Releases

Packages

Languages

License

ghidav/Bayesian-Flow-Networks

Folders and files

Latest commit

History

Repository files navigation

Bayesian Flow Networks

Features

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages