Best Evaluation AI Tools
16 ai evaluation startups tracked, with the largest concentration in US. Total tracked funding: $14.7B.
Funding by year — AI Evaluation
2023 → 2026Top countries
By startup countStage breakdown
Latest round typeTop investors backing AI Evaluation
See all →Recent rounds in AI Evaluation
All rounds →| Date | Startup | Round | Amount |
|---|---|---|---|
| Feb 2026 | Braintrust | Series B | $80M |
| Feb 2026 | micro1 | Series A | $35M |
| Aug 2025 | Confident AI | Seed | $2.2M |
| Jun 2025 | Scale AI | Strategic | $14.3B |
| Feb 2025 | Arize AI | Series C | $70M |
| Dec 2024 | Gentrace | Series A | $8M |
| Jun 2024 | Maxim AI | Seed | $3M |
| Jan 2024 | LangWatch | Pre-Seed | $1.1M |
All AI Evaluation startups
Page 1Scale AI
VerifiedData labeling and AI infrastructure platform powering frontier models for enterprises and governments.
Galileo
Confident AI
DeepEval-powered LLM evaluation and observability
Arize AI
The AI & Agent Engineering Platform for development, observability, and evaluation of LLM applications.
Braintrust
The AI observability platform for building quality AI products at scale.
Guardrails AI
The AI reliability platform for production GenAI
AfterQuery
Expert reasoning datasets and benchmarks for frontier AI
HoneyHive
Observability and evaluation for production AI agents
Freeplay
micro1
Human intelligence infrastructure for high-quality AI training data
Gentrace
Collaborative testing and evaluation platform for generative AI apps
Maxim AI
GenAI evaluation, simulation and observability platform for AI agents
LangWatch
Platform for LLM evaluations, agent testing and observability
RagaAI
Humane Intelligence
A 501(c)(3) nonprofit dedicated to breaking down barriers to AI deployment for social good through rigorous evaluations.
Autoblocks AI
Collaborative evaluation and testing platform to build safe AI apps