GitHub / fasterdecoding 1 project
FasterDecoding
Think deeper, decode faster
Projects
Updated 6 months ago
medusa-llm
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads