In the previous post we explored how to get NanoGPT training on Modal. There was quite a bit to that, so I left the text generation part to now just to cap that post off. Let’s do that now and then try some more stuff out with NanoGPT. Let’s make some Shakespam With all the…
Tag: NanoGPT
No local GPU? No Problem! Running Andrej Karpathy’s NanoGPT on Modal.com
Andrej Karpathy released a series of timeless lectures teaching us mortal 9-5 programmers from scratch how to train an “AI” language model, a bit like that GPT4 or ChatGPT you may have heard of. He goes into a deep dive that includes building your own tiny Pytorch from scratch, setting up bigram models, and simple…