Unit 1: The technical challenge with AI
Building AI safely is hard
Resources (1 hr 50 mins)
- What is AI alignment?
Create a free account to track your progress and unlock access to the full course content.
- Specification Gaming: How AI Can Turn Your Wishes Against You
Create a free account to track your progress and unlock access to the full course content.
- Why alignment could be hard with modern deep learning
Create a free account to track your progress and unlock access to the full course content.
- Recent Frontier Models Are Reward Hacking
Create a free account to track your progress and unlock access to the full course content.
- If you remember one AI disaster, make it this one
Create a free account to track your progress and unlock access to the full course content.
- Multi-Agent Risks from Advanced AI
Create a free account to track your progress and unlock access to the full course content.
Optional Resources
- Reframing AGI Threat Models
Create a free account to track your progress and unlock access to the full course content.
- What failure looks like
Create a free account to track your progress and unlock access to the full course content.
- AI Could Defeat All Of Us Combined
Create a free account to track your progress and unlock access to the full course content.
- Why Would AI "Aim" To Defeat Humanity?
Create a free account to track your progress and unlock access to the full course content.