8 points | by mxfeinberg 12 hours ago ago
3 comments
32 bits vs 4 bits it looks like
Yup, and unlike the original turboquant paper, my implementation is pinned to using a 4 bit code book so I could use SIMD kernels for performance.
[dead]
32 bits vs 4 bits it looks like
Yup, and unlike the original turboquant paper, my implementation is pinned to using a 4 bit code book so I could use SIMD kernels for performance.
[dead]