314 points Jun 19, 2025 Compiling LLMs into a MegaKernel: A path to low-latency inference 76 comments matt_d medium.com