Two different tricks for fast LLM inference

(seangoedecke.com)

194 points | by swah 4 days ago ago

81 comments