EventsBlogAboutJoin us
Start for free
BlueDot Impact Logo
EventsBlogAboutJoin us
Sign inStart for free

AI Alignment (2023)

0. Introduction to Machine Learning (23)

Resources: Introduction to Machine Learning (23)

45min

Exercises: Introduction to Machine Learning (23)

1. Artificial General Intelligence

Resources: Artificial General Intelligence

1h 50min

Exercises: Artificial General Intelligence

2. Reward misspecification and instrumental convergence

Resources: Reward misspecification and instrumental convergence

1h 20min

Exercises: Reward misspecification and instrumental convergence

3. Goal misgeneralisation

Resources: Goal misgeneralisation

1h 40min

Exercises: Goal misgeneralisation

4. Task decomposition for scalable oversight

Resources: Task decomposition for scalable oversight

1h 50min

Exercises: Task decomposition for scalable oversight

5. Adversarial techniques for scalable oversight

Resources: Adversarial techniques for scalable oversight

1h 25min

Exercises: Adversarial techniques for scalable oversight

6. Interpretability

Resources: Interpretability

2h

Exercises: Interpretability

7. Governance (Alignment 23)

Resources: Governance (Alignment 23)

1h 34min

Exercises: Governance (Alignment 23)

8. Agent foundations

9. Careers and Projects

Resources: Careers and Projects

55min

Exercises: Careers and Projects

CoursesAI Alignment (2023)8. Agent foundations

Unit 8: Agent foundations

Exercises: Agent foundations


Analytics cookies help us improve our website and measure ad performance. Privacy Policy.