Ask HN: Build Your Own LLM?

runjake · 2025-10-10T18:54:57 1760122497

Since you're posting here, you're looking for the shortcut.

The shortcut is Karpathy's "Let's Build GPT: from scratch, in code, spelled out" video:

https://www.youtube.com/watch?v=kCc8FmEb1nY

Then there is a good video that dives into LLMs and how they work that is quite approachable:

https://www.youtube.com/watch?v=7xTGNNLPyMI

From there, flesh out knowledge with his other videos, where he goes both extremely light and extremely deep:

https://www.youtube.com/@AndrejKarpathy/videos

Anyway, I really like's Karpathy's video because he's very good at explaining LLMs at every level.

khamidou · 2025-10-11T17:51:30 1760205090

Sorry to self-promote but I did exactly that a few months back: https://khamidou.com/gpt2/

Generally, I think the Karpathy tutorials are a good starting point but they're very mathy (despite people telling you you only need high school math to understand llms, a lot of the abstractions and concepts he uses are a bit foreign to programmers).

I found out rebuilding inference of a known model taught me a lot more than passively sitting through the videos and maybe retyping his code. You should try it with something simple, like a model from a few years back!

sfmz · 2025-10-10T11:56:57 1760097417

Andrej Karpathy: Let's build GPT: from scratch, in code, spelled out. https://www.youtube.com/watch?v=kCc8FmEb1nY

beardyw · 2025-10-10T12:45:36 1760100336

Andrej Karpathy's Nano GPT is reasonably accessible and easy to run.

https://github.com/karpathy/nanoGPT

liqilin1567 · 2025-10-14T10:04:01 1760436241

There is a new repo of karpathy: https://github.com/karpathy/nanochat. It's a full-stack implementation of an LLM like ChatGPT in a single, clean, minimal, hackable, dependency-lite codebase.

2ro · 2025-10-10T10:41:07 1760092867

How about this?

https://mathstodon.xyz/@empty/115088095028020763

retube · 2025-10-10T11:34:26 1760096066

thanks

pm2222 · 2025-10-10T11:31:48 1760095908

https://www.amazon.com/Build-Large-Language-Model-Scratch/dp...

ryanchants · 2025-10-10T16:54:17 1760115257

I'd get it straight from Manning and save a few bucks and take out the middle man: https://www.manning.com/books/build-a-large-language-model-f...

retube · 2025-10-10T11:34:19 1760096059

thanks. looks potential