Skip to content
@FasterDecoding

FasterDecoding

Think deeper, decode faster

Pinned Loading

  1. Medusa Medusa Public

    Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

    Jupyter Notebook 2k 133

Repositories

Showing 4 of 4 repositories
  • Medusa Public

    Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

    FasterDecoding/Medusa’s past year of commit activity
    Jupyter Notebook 2,022 Apache-2.0 133 33 3 Updated Jun 25, 2024
  • BitDelta Public
    FasterDecoding/BitDelta’s past year of commit activity
    Jupyter Notebook 170 Apache-2.0 13 4 1 Updated May 11, 2024
  • SnapKV Public
    FasterDecoding/SnapKV’s past year of commit activity
    Python 143 4 10 0 Updated May 1, 2024
  • REST Public

    REST: Retrieval-Based Speculative Decoding, NAACL 2024

    FasterDecoding/REST’s past year of commit activity
    C 153 Apache-2.0 9 6 0 Updated Apr 22, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…