NanoQuant: Efficient Sub-1-Bit Quantization of Large Language Models

(arxiv.org)

10 points | by chrsw 2 days ago ago

No comments yet.