Competitions: A Better Framework For Evaluating AI Agents