blog

writing about CUDA, LLM inference, compilers, and building in public

How a GPU Actually Works (I Wrote Kernels to Prove It) I Lack Attention. So I Built 12 Heads of It. Multi-Armed Bandits Explained!