Skip to main content
NeuronFeed
CATEGORY

Best Evaluation AI Tools

16 tools compared · 2026

16 ai evaluation startups tracked, with the largest concentration in US. Total tracked funding: $14.7B.

Tracked
16
Total Raised
$14.7B
Countries
3
Active Deals
1

Top by score

View all 16 →

Funding by year — AI Evaluation

2023 → 2026
$2M
’23
$12.1M
’24
$14.4B
’25
$115M
’26

Top countries

By startup count

Stage breakdown

Latest round type
  • Seed 7
  • Series A 3
  • Series B 2
  • Strategic 1
  • Series C 1
  • Pre-Seed 1

Top investors backing AI Evaluation

See all →

Recent rounds in AI Evaluation

All rounds →
Date Startup Round Amount
Feb 2026 Braintrust Series B $80M
Feb 2026 micro1 Series A $35M
Aug 2025 Confident AI Seed $2.2M
Jun 2025 Scale AI Strategic $14.3B
Feb 2025 Arize AI Series C $70M
Dec 2024 Gentrace Series A $8M
Jun 2024 Maxim AI Seed $3M
Jan 2024 LangWatch Pre-Seed $1.1M

All AI Evaluation startups

Page 1

Scale AI

Verified
US est. 2016

Data labeling and AI infrastructure platform powering frontier models for enterprises and governments.

Raised
$14.3B
Stage
STRATEGIC
85

Galileo

est. 2021
Raised
$68M
Stage
S-B
78

Confident AI

US est. 2024

DeepEval-powered LLM evaluation and observability

Raised
$2.2M
Stage
Seed
76

Arize AI

The AI & Agent Engineering Platform for development, observability, and evaluation of LLM applications.

Raised
$132M
Stage
S-C
70

Braintrust

US est. 2023

The AI observability platform for building quality AI products at scale.

Raised
$80M
Stage
S-B
68

Guardrails AI

US est. 2023

The AI reliability platform for production GenAI

Raised
$7.5M
Stage
Seed
68

AfterQuery

US est. 2025

Expert reasoning datasets and benchmarks for frontier AI

Raised
$30M
Stage
S-A
66

HoneyHive

US est. 2022

Observability and evaluation for production AI agents

Raised
$7.4M
Stage
Seed
64

Freeplay

est. 2023
Raised
$8.9M
Stage
Seed
64

micro1

US est. 2023

Human intelligence infrastructure for high-quality AI training data

Raised
$35M
Stage
S-A
63

Gentrace

US est. 2023

Collaborative testing and evaluation platform for generative AI apps

Raised
$14M
Stage
S-A
63

Maxim AI

IN est. 2023

GenAI evaluation, simulation and observability platform for AI agents

Raised
$3M
Stage
Seed
61

LangWatch

NL est. 2023

Platform for LLM evaluations, agent testing and observability

Raised
$1.1M
Stage
Pre-S
60

RagaAI

est. 2023
Raised
$4.7M
Stage
Seed
60

Humane Intelligence

US

A 501(c)(3) nonprofit dedicated to breaking down barriers to AI deployment for social good through rigorous evaluations.

59

Autoblocks AI

US est. 2022

Collaborative evaluation and testing platform to build safe AI apps

Raised
$2M
Stage
Seed
55