Hackernews posts about Flash Attention 4
- Flash Attention 4, running near matmul speeds [pdf] (github.com)
- FlashAttention-4 (research.colfax-intl.com)
- Flash Attention 4 (www.together.ai)
- We reverse-engineered Flash Attention 4 (modal.com)
- We reverse-engineered Flash Attention 4 (modal.com)
- GPU Mode Lecture 80: How FlashAttention 4 Works [video] (www.youtube.com)
- Show HN: ThinkTotem – turn boring books into engaging conversations (thinktotem.com)