Writing Speed-of-Light Flash Attention for 5090 in CUDA C++

(gau-nernst.github.io)

128 points | by dsr12 12 hours ago ago

23 comments