AI Alignment (2024 Jun) Project

New Capabilities, New Risks? - Evaluating Agentic General Assistants using Elements of GAIA & METR Frameworks

Tej Lander • 2nd PlaceOctober 2024