Unit 7: Technical governance approaches
Resources: Technical governance approaches
Resources (1 hr 20 mins)
- Emerging processes for frontier AI safety
Create a free account to track your progress and unlock access to the full course content.
- Computing Power and the Governance of AI
Create a free account to track your progress and unlock access to the full course content.
- AI Governance Needs Technical Work
Create a free account to track your progress and unlock access to the full course content.
- AI Watermarking Won't Curb Disinformation
Create a free account to track your progress and unlock access to the full course content.
- We need a Science of Evals
Create a free account to track your progress and unlock access to the full course content.
Optional Resources
- Model Evaluation for Extreme Risks
Create a free account to track your progress and unlock access to the full course content.
- Evaluating Language-Model Agents on Realistic Autonomous Tasks
Create a free account to track your progress and unlock access to the full course content.
- Red-teaming language models with language models
Create a free account to track your progress and unlock access to the full course content.
- Black-Box Access is Insufficient for Rigorous AI Audits
Create a free account to track your progress and unlock access to the full course content.
- Measuring Massive Multitask Language Understanding
Create a free account to track your progress and unlock access to the full course content.
- Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
Create a free account to track your progress and unlock access to the full course content.
- Anthropic's Responsible Scaling Policy
Create a free account to track your progress and unlock access to the full course content.
- Responsible Scaling Policies Are Risk Management Done Wrong
Create a free account to track your progress and unlock access to the full course content.
- Increased Compute Efficiency and the Diffusion of AI Capabilities
Create a free account to track your progress and unlock access to the full course content.
- What Does It Take To Catch a Chinchilla? Verifying Rules on Large-Scale Neural Network Training via Compute Monitoring
Create a free account to track your progress and unlock access to the full course content.
- Azure OpenAI Service abuse monitoring
Create a free account to track your progress and unlock access to the full course content.
- Challenges in evaluating AI systems
Create a free account to track your progress and unlock access to the full course content.