Learn Anything Online

Let's reproduce GPT-2 (124M)

By Andrej Kaparthy

Free

Added 7 months ago

View Original Resource

Description

A comprehensive guide on reproducing and training the GPT-2 124M model using PyTorch, comparing its performance to OpenAI's pre-trained models. Taught by Andrej Kaparthy, a founding member of OpenAI and a ex-senior director of AI at Tesla.