AI Alignment
1. AI and the years ahead
2. What is AI alignment?
3. Reinforcement learning from human (or AI) feedback
4. Scalable oversight
5. Robustness unlearning and control
6. Mechanistic interpretability
7. Technical governance approaches
8. Contributing to AI safety
9. Rapidly testing your project
10. Developing your project
11. Further developing your project
12. Building in public
Unit 4: Scalable oversight