Let's reproduce GPT-2 (124M)
Andrej Kaparthy
A comprehensive guide on reproducing and training the GPT-2 124M model using PyTorch, comparing its performance to OpenAI's pre-trained models. Taught by Andrej Kaparthy, a founding member of OpenAI and a ex-senior director of AI at Tesla.