Episode #452 from 1:47:14

Post-training

Let's talk if we could a bit about post-training. So it seems that the modern post-training recipe has a little bit of everything. So supervised fine-tuning, RLHF, the constitutional AI with RLAIF- Best acronym.

Why this moment matters

Let's talk if we could a bit about post-training. So it seems that the modern post-training recipe has a little bit of everything. So supervised fine-tuning, RLHF, the constitutional AI with RLAIF- Best acronym.

Starts at 1:47:14
People and topics
All moments
Post-training chapter timestamp | Dario Amodei: Anthropic CEO on Claude, AGI & the Future of AI & Humanity | EpisodeIndex