This weekend I just cracked into nanoGPT (https://github.com/karpathy/nanoGPT), ...

ACCount37 · 2025-10-13T18:32:42 1760380362

It's a useful exercise. A lot of the good ML work is first validated at small scale.

And this new example goes even further - adds instruction following and tool use SFT, as well as RLVR. Makes for a more useful baseline.

faxmeyourcode · 2025-10-14T01:27:28 1760405248

Absolutely, it's wildly fun to read the outputs of even a little tiny 0.8M model trained on CPU. And now I've actually got a much better understanding of the transformer architecture after playing around with it for a day. This repo is probably going to spawn some new folks to try out ideas which will turn into new researchers in the field, no doubt.

andrewljohnson · 2025-10-13T18:24:44 1760379884

the shakespeare code tuned a little with different training data does a good job of generating Magic The Gathering commander decks

jwitthuhn · 2025-10-14T01:07:28 1760404048

Somewhat related: I wrote up a MTG card generator based on nanoGPT a while ago that I think produces pretty good results for being 1m parameters.

The real neat thing about this is that WotC makes a few thousand new cards each year, so my training data set just grows over time and the model gets better with no effort spent on my part.

https://github.com/jlwitthuhn/TCGGPT

wordpad · 2025-10-14T05:29:45 1760419785

It would be interesting to come up with a use case which requires a freshly trained model and isn't just something that generic models can already, especially with 1MM context window

SeanAnderson · 2025-10-13T19:54:49 1760385289

would love more details on this. this is exactly the type of project I'd like to dabble in to get more up to speed.

astrange · 2025-10-14T01:24:33 1760405073

People have been doing this for a while.

https://x.com/roborosewater

https://bsky.app/profile/roborosewaterm.bsky.social

You can see the invention of RLHF/ChatGPT here because text generation suddenly became much more coherent and also much less interesting. You have to go back to older tech for surrealism because nobody will let you see the good stuff (the base models).

SeanAnderson · 2025-10-14T02:56:26 1760410586

I guess I was much more interested in being able to work with an LLM to create good, synergistic Commander decks and less interested in generating custom Magic cards.

I'm sure I can dig up info on how to do this and piece it together, but I thought OP might have a guide specifically for it.

vunderba · 2025-10-14T00:12:36 1760400756

FWIW, there was a pretty popular post on HN around generating MTG cards using AI a couple years back but I believe that their approach was a fine-tune on an existing LLM.

https://news.ycombinator.com/item?id=37427854

dmarcos · 2025-10-13T18:32:44 1760380364

I like the idea of specific-purpose toy models. How did you tune the code and what dataset you used?