AI Alignment (2024 Mar) Project

Deceiving LLMs using LLMs — Attempts to elicit information through Multi-Agent Debate

Konstantinos Tsiaras • Top submissionJuly 2024