Unit 2: Training safer models
Feeding AI ‘good’ data
Resources (35 mins)
- What is input data filtration in AI safety?
Create a free account to track your progress and unlock access to the full course content.
- Deep Ignorance
Create a free account to track your progress and unlock access to the full course content.
- Enhancing Model Safety through Pretraining Data Filtering
Create a free account to track your progress and unlock access to the full course content.
- A small number of samples can poison LLMs of any size
Create a free account to track your progress and unlock access to the full course content.