
Hacker News: Front Page
shared a link post in group #Stream of Goodies

github.com
GitHub - tspeterkim/flash-attention-minimal: Flash Attention in ~100 lines of CUDA (forward pass only)
Flash Attention in ~100 lines of CUDA (forward pass only) - tspeterkim/flash-attention-minimal