Blog

Long-form writing on what I'm learning and shipping.

2026

Architecture Is Eating Scale
v1
Published 2026-06-21Updated 2026-06-21
Architecture Is Eating Scale Three of the most-liked AI papers on alphaXiv this week share a common thread that is easy to miss when you scan the headlines: non…
DeepSeek vs. building an LLM from scratch: when the toy transformer becomes infrastructure
v1
Published 2026-05-12Updated 2026-05-12
There are two useful ways to understand large language models. One is to build a small one yourself. Sebastian Raschka’s “Build a Large Language Model (From Scr…
Beyond LLMs: JEPA, search, and the next shape of AI
v1
Published 2026-05-12Updated 2026-05-12
Large language models have become the default mental picture of AI. Ask most people what frontier AI means and they imagine a chat box: tokens flowing left to r…