Blog
Long-form writing on what I'm learning and shipping.
2026
DeepSeek vs. building an LLM from scratch: when the toy transformer becomes infrastructure
v1Published 2026-05-12Updated 2026-05-12There are two useful ways to understand large language models. One is to build a small one yourself. Sebastian Raschka’s “Build a Large Language Model (From Scr…
Beyond LLMs: JEPA, search, and the next shape of AI
v1Published 2026-05-12Updated 2026-05-12Large language models have become the default mental picture of AI. Ask most people what frontier AI means and they imagine a chat box: tokens flowing left to r…