Skip to content

Sparse Transformer with limited attention span in PyTorch

Notifications You must be signed in to change notification settings

santient/sparse-transformer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

sparse-transformer

Sparse Transformers in PyTorch: limited attention span and projection onto a smaller space

Linformer paper: https://arxiv.org/abs/2006.04768

Limited attention span transformers: simply limits maximum attention distance, using sparse tensors. Note: sparse tensors are WIP in PyTorch so this may not work with all versions.

About

Sparse Transformer with limited attention span in PyTorch

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages