
Now loading...
A promising AI evaluation startup born from UC Berkeley research has skyrocketed in value with a massive new funding round. LMArena, which debuted as a university project back in 2023, revealed on Tuesday that it secured $150 million in Series A financing, pushing its post-money valuation to $1.7 billion. Felicis Ventures took the lead on the investment, joined by UC Investments, the university’s own venture arm.
The company wasted no time turning heads after its May seed round of $100 million at a $600 million valuation. This latest influx brings LMArena’s total funding to $250 million in just seven months, signaling intense investor enthusiasm for its unique approach to benchmarking AI systems.
At its core, LMArena operates popular leaderboards that crowdsource judgments on AI model performance. On its public platform, visitors enter a prompt, which the system routes to two competing models; users then vote on the superior response. These crowd-driven insights, drawn from over 5 million monthly active users in 150 countries and generating 60 million conversations each month, power comprehensive rankings. The evaluations cover diverse categories like text generation, web coding tasks, visual processing, image creation from text, and more.
The lineup of tested models spans mainstream heavyweights such as variants of OpenAI’s GPT series, Google’s Gemini, Anthropic’s Claude, and xAI’s Grok, alongside specialized tools for areas like image synthesis and advanced reasoning.
LMArena traces its roots to Chatbot Arena, an open-source initiative developed by Berkeley researchers Anastasios Angelopoulos and Wei-Lin Chiang, initially supported by grants and community contributions. These leaderboards quickly captivated AI developers, becoming a go-to resource for gauging model strengths and weaknesses.
As it shifted toward commercialization, LMArena forged partnerships with leading firms including OpenAI, Google, and Anthropic, allowing their top models to be rigorously assessed by its global user base. Controversy arose in April when rival researchers released a study claiming these collaborations enabled benchmark manipulation by major labs; LMArena strongly refuted the accusations, standing by the integrity of its methods.
Building on its momentum, the company rolled out its AI Evaluations service in September, offering customized assessments for businesses, AI developers, and research teams via its established community. This enterprise offering has already hit an annualized consumption rate of $30 million by December, a remarkable feat in under four months.
Such rapid growth and widespread adoption proved irresistible to venture capitalists. The Series A also drew checks from heavyweights like Andreessen Horowitz, The House Fund, LDVP, Kleiner Perkins, Lightspeed Venture Partners, and Laude Ventures, underscoring LMArena’s pivotal role in the evolving landscape of AI accountability.
