GitHub / fasterdecoding 1 project

FasterDecoding

Think deeper, decode faster

Projects

Updated 6 months ago

medusa-llm • Rank 13.6 • Science 64%

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads