asentmax Code for Long-Context Generalization with Sparse Attention. Requirements Install AdaSplash for the entmax attention kernel. WIP