TechCrunch Startups·2 min read

The PhD students who became the judges of the AI industry

Arena sets the standard for evaluating AI models.

Arena, a startup founded by UC Berkeley PhD students, has quickly become the leading public benchmark for evaluating large language models (LLMs), achieving a valuation of $1.7 billion in just seven months. Co-founders Anastasios Angelopoulos and Wei-Lin Chiang discuss the challenges of maintaining neutrality while receiving funding from major AI players like OpenAI and Google, as well as the platform's plans to extend its evaluation capabilities beyond chat applications to include real-world tasks.

Key Takeaways

  • 1.

    Arena, valued at $1.7 billion, serves as a public leaderboard for AI models.

  • 2.

    The platform aims for structural neutrality despite backing from major AI firms.

  • 3.

    Claude currently leads the expert leaderboard for legal and medical use cases.

Get your personalized feed

Trace groups the biggest stories, videos, and discussions into one feed so you can stay current without scanning ten tabs.

Try Trace free
The PhD students who became the judges of the AI industry | Trace