Course Overview
The Three-Stage Pipeline
Modern LLMs are built through three distinct stages:
- Pre-Training: Learn language patterns from internet-scale data
- Post-Training (SFT): Learn helpful assistant behavior from conversations
- Reinforcement Learning: Discover novel reasoning through trial and error