I explored in a previous post how to run nanoGPT on modal – both the training and sampling. It was successful, but tiresome. There were a lot of changes to the downloaded code which made me unhappy. If I want to try different projects out that are on Github etc. I don’t want to be…
Tag: MachineLearning
Modal.com and NanoGPT continued: producing output; using tiktoken for bigger tokens
In the previous post we explored how to get NanoGPT training on Modal. There was quite a bit to that, so I left the text generation part to now just to cap that post off. Let’s do that now and then try some more stuff out with NanoGPT. Let’s make some Shakespam With all the…
No local GPU? No Problem! Running Andrej Karpathy’s NanoGPT on Modal.com
Andrej Karpathy released a series of timeless lectures teaching us mortal 9-5 programmers from scratch how to train an “AI” language model, a bit like that GPT4 or ChatGPT you may have heard of. He goes into a deep dive that includes building your own tiny Pytorch from scratch, setting up bigram models, and simple…