Braintrust is an AI observability and evaluation platform that helps teams measure model quality, run evals, monitor production traces, and iterate on prompts and models with confidence.

Who founded Braintrust?

Braintrust was founded by Ankur Goyal, who previously built and sold Impira to Figma. The company emerged from his own experience building internal evals tooling repeatedly at past roles.

How much funding has Braintrust raised?

Braintrust has raised $80M in disclosed funding, led by ICONIQ in a February 2026 Series B that valued the company at roughly $800M. Other investors include Andreessen Horowitz, Greylock, and Elad Gil.

How does Braintrust compare to LangSmith?

Both platforms target LLM observability and evals. Braintrust emphasizes evals as a first-class developer workflow with strong CI integration, while LangSmith is more tightly coupled with the LangChain ecosystem.

How is Braintrust priced?

Braintrust offers usage-based pricing with a free tier for individuals and small projects, plus team and enterprise plans. Specific rates are published on the Braintrust website.

Who is Braintrust for?

Braintrust is built for engineering and product teams shipping production AI features who need rigorous, repeatable evals and trace-level observability.

Is Braintrust suitable for regulated or enterprise use?

Yes. Braintrust offers enterprise plans with appropriate security controls and is used by large companies including Stripe and BILL, which operate in regulated environments.

Does Braintrust support custom evaluators?

Yes. Teams can define scoring functions in code, use LLM-as-judge evaluators, or combine them with human review, all versioned alongside datasets and experiments.

Startups AI Developer Tools Braintrust

Braintrust

Active

The AI observability platform for building quality AI products at scale.

📍 United States 📅 Founded 2023 🏷 AI Developer Tools

Visit website

Total raised

$80M

1 round

Stage

Series B

Feb 2026

Team

—

Pricing

Enterprise

Founded

2023

United States

Agent-ready

—

About Braintrust

Braintrust is an AI observability and evaluation platform designed for teams building production AI features. The product helps engineers and product teams systematically measure model and prompt quality, monitor production traces, and iterate quickly on what is shipped. It combines an evals framework, prompt and dataset management, online and offline experiment tracking, and trace-level observability into a single workflow.

The company was founded by Ankur Goyal, who previously built and sold Impira to Figma and led machine learning engineering work there. Goyal has said publicly that Braintrust came directly out of his own pain — at both Impira and Figma, he and his teams had to build internal evals tooling from scratch every time they shipped a new AI feature.

Braintrust has raised $80M to date. The most recent round, an $80M Series B announced in February 2026, was led by ICONIQ with participation from Andreessen Horowitz, Greylock, Elad Gil, and basecase capital. That round valued the company at roughly $800M. The customer list is notable: Notion, Stripe, Vercel, Airtable, Instacart, Zapier, Ramp, Dropbox, Cloudflare, and BILL all use the platform, positioning Braintrust as a default observability layer for many leading AI-native and AI-adopting companies.

The core workflow centers on evals. Teams define datasets and scoring functions — code-based, LLM-as-judge, or human review — and run them against prompt and model variants in CI or ad hoc. Results are versioned, allowing teams to see whether a change improved or regressed quality before shipping. Production traces feed back into datasets so issues caught in prod can drive new tests.

Braintrust competes with LangSmith, Arize, Weights & Biases, and a growing field of LLM observability tools. Its differentiation is a focus on evals as a first-class workflow — not just dashboards — and on developer experience for engineering teams that already think in terms of unit tests, CI, and code review.

Key capabilities

Eval framework with code, LLM-judge, and human scoring

Prompt and dataset versioning

Production trace logging and search

Online and offline experiment tracking

CI integration for regression testing on AI changes

Side-by-side prompt and model comparison

Feedback loop from production to datasets

Team collaboration on prompts and evals

Technology stack

3detected May 30, 2026

Est. monthly stack spend ~$150/mo

Analytics

Google Tag Manager

Framework

webpack

Infra

Vercel

Agent readiness

40/100

Developing

MCP server

Public API

Webhooks

OAuth 2.0

SDKs

API docs ↗ No public agent surfaces detected yet.

Funding history

1 · $80M

Feb 2026 Series B $80M ● Andreessen Horowitz

Capital network

$80M raised ·1 backer·10 network links

Backers1
Andreessen HorowitzLead investorLead
Shared portfoliocompanies these backers also fund
Databricks1 Safe Superintelligence1 xAI1 Mistral AI1 Thinking Machines Lab1
Extended networkfunds that co-invest alongside them
Sequoia Capital5 Valor Equity Partners4 General Catalyst3 Lightspeed Venture Partners3 NVIDIA2

Alternatives

6 All →

OpenAI

Creator of ChatGPT, GPT-4, and the leading frontier AI lab.

AI ChatbotsAI Developer Tools

xAI

AI designed to understand the universe

AI ChatbotsAI Agents

Thinking Machines Lab

Frontier AI research lab building customizable, multimodal models

AI Developer ToolsFoundation Models

Upscale AI

Pure-play AI networking infrastructure

AI Developer ToolsAI Infrastructure

Resolve AI

AI SRE for complex production environments

AI AgentsAI Developer Tools

Cursor

The AI code editor built for productive engineers.

AI CodingAI Agents

Frequently asked

What is Braintrust?: Braintrust is an AI observability and evaluation platform that helps teams measure model quality, run evals, monitor production traces, and iterate on prompts and models with confidence.
Who founded Braintrust?: Braintrust was founded by Ankur Goyal, who previously built and sold Impira to Figma. The company emerged from his own experience building internal evals tooling repeatedly at past roles.
How much funding has Braintrust raised?: Braintrust has raised $80M in disclosed funding, led by ICONIQ in a February 2026 Series B that valued the company at roughly $800M. Other investors include Andreessen Horowitz, Greylock, and Elad Gil.
How does Braintrust compare to LangSmith?: Both platforms target LLM observability and evals. Braintrust emphasizes evals as a first-class developer workflow with strong CI integration, while LangSmith is more tightly coupled with the LangChain ecosystem.
How is Braintrust priced?: Braintrust offers usage-based pricing with a free tier for individuals and small projects, plus team and enterprise plans. Specific rates are published on the Braintrust website.
Who is Braintrust for?: Braintrust is built for engineering and product teams shipping production AI features who need rigorous, repeatable evals and trace-level observability.
Is Braintrust suitable for regulated or enterprise use?: Yes. Braintrust offers enterprise plans with appropriate security controls and is used by large companies including Stripe and BILL, which operate in regulated environments.
Does Braintrust support custom evaluators?: Yes. Teams can define scoring functions in code, use LLM-as-judge evaluators, or combine them with human review, all versioned alongside datasets and experiments.

Discussion

Watching

Get Braintrust updates

New funding, product launches, and team changes — to your inbox.

Follow startup

Claim ownership

Verify with your work email to manage this listing.

Explore more around Braintrust

Contextual paths to related AI startups, deals and rankings.

Similar to Braintrust

Country

United States AI startups

Compare

Alternatives

All alternatives to Braintrust

Braintrust

Claim Braintrust

Enter your code

Claim approved

Claim received

Claim Braintrust

Enter your code

Claim approved

Claim received

About Braintrust

Key capabilities

Technology stack

Agent readiness

Funding history

Capital network

Alternatives

OpenAI

xAI

Thinking Machines Lab

Upscale AI

Resolve AI

Cursor

Frequently asked

Explore more around Braintrust

Similar to Braintrust

Categories

Country

Compare

Alternatives

Rankings