Skip to content

Latest commit

 

History

History
9 lines (8 loc) · 280 Bytes

TODO.md

File metadata and controls

9 lines (8 loc) · 280 Bytes

TODO

  • READ: Mixture of Experts(MoE) model
  • REVISE: LINALG
  • READ: on external memory in transformers for longer context
  • READ: Power scaling laws & Chinchilla paper
  • CODE: Tiling for improving NN throughput
  • READ: Optmization theory
  • EXPLORE: HF Accelerate, FlexGEN, PyG