What does Polymath build?

Polymath builds simulation environments to train and evaluate long-horizon AI agents, combining running applications, real tools, and multi-step tasks that reflect real-world complexity.

Horizon-SWE is Polymath's flagship benchmark that tests frontier models on end-to-end software engineering tasks.

Why focus on long-horizon agents?

Long-horizon, multi-step tasks are difficult for AI agents, so Polymath builds realistic simulated worlds where agents can learn to operate autonomously and be evaluated rigorously.

It is aimed at AI researchers and teams developing autonomous agents who need realistic environments and benchmarks, and it is a Y Combinator Winter 2026 company based in San Francisco.

Startups AI Agents Polymath

Polymath

Active

Simulation environments to train & evaluate long-horizon AI agents

📍 San Francisco, United States 📅 Founded 2026 👥 1-10 🏷 AI Agents

Visit website

Total raised

$500K

1 round

Stage

Seed

Jan 2026

Team

1-10

since 2026

Pricing

—

Founded

2026

San Francisco, United States

Agent-ready

—

About Polymath

What Polymath does

Polymath is an applied research lab focused on increasing the reliability and autonomy of AI agents. It builds simulation environments and simulated worlds where agents can practice, learn, and be evaluated on long-horizon tasks that mirror real-world conditions. The goal is to help agents perform useful work over extended periods with little or no human supervision.

Key capabilities

The company develops training and evaluation environments that let AI agents gain capabilities through practical experience rather than one-shot prompting. These environments are designed to test how agents plan, recover from errors, and complete multi-step work across long time horizons. Polymath partners with AI model developers to advance agent performance using this simulation-based approach.

Who it's for

Polymath serves AI model labs and research organizations building more autonomous agent systems, as well as teams that need rigorous ways to train and measure long-horizon agent reliability. It is backed by Base10 and Y Combinator.

Key capabilities

Simulation environments to train and evaluate long-horizon AI agents

Simulated worlds where agents learn to operate autonomously

Combines running applications, real tools, and multi-step tasks

Environments reflect real-world complexity

Horizon-SWE benchmark for end-to-end software engineering tasks

Tests frontier models on multi-step, realistic workloads

Y Combinator Winter 2026 company based in San Francisco

Agent readiness

12/100

Early

MCP server

Public API

Webhooks

OAuth 2.0

SDKs

No public agent surfaces detected yet.

Funding history

1 · $500K

Jan 2026 Seed $500K ● Y Combinator

Capital network

$500K raised ·1 backer·10 network links

Backers1
Y CombinatorLead investorLead
Shared portfoliocompanies these backers also fund
Moonvalley1 Onyx1 Raycast1 Prosper AI1 Latent1
Extended networkfunds that co-invest alongside them
General Catalyst3 Khosla Ventures3 Andreessen Horowitz2 Accel2 Bessemer Venture Partners1

Key operators

Dylan Ma

Co-founder & CEO

Naren Yenuganti

Co-founder & CTO

Alternatives

6 All →

Cursor

The AI code editor built for productive engineers.

AI CodingAI Agents

Uniphore

Enterprise business AI for conversations, agents, and data

AI AgentsAI Customer Support

Invisible Technologies

AI training data and enterprise automation platform

AI AgentsData Labeling

Aily Labs

Enterprise AI decision intelligence for the Fortune 500

AI AgentsAI Analytics

Moonshot AI

Maker of Kimi — the long-context AI chatbot.

AI ChatbotsAI Agents

Crescendo

AI-native contact center blending agents and humans

AI AgentsAI Customer Support

Frequently asked

What does Polymath build?: Polymath builds simulation environments to train and evaluate long-horizon AI agents, combining running applications, real tools, and multi-step tasks that reflect real-world complexity.
What is Horizon-SWE?: Horizon-SWE is Polymath's flagship benchmark that tests frontier models on end-to-end software engineering tasks.
Why focus on long-horizon agents?: Long-horizon, multi-step tasks are difficult for AI agents, so Polymath builds realistic simulated worlds where agents can learn to operate autonomously and be evaluated rigorously.
Who is Polymath for?: It is aimed at AI researchers and teams developing autonomous agents who need realistic environments and benchmarks, and it is a Y Combinator Winter 2026 company based in San Francisco.

Discussion

Watching

Get Polymath updates

New funding, product launches, and team changes — to your inbox.

Follow startup

Claim ownership

Verify with your work email to manage this listing.

Explore more around Polymath

Contextual paths to related AI startups, deals and rankings.

Similar to Polymath

Country

United States AI startups

Compare

Alternatives

All alternatives to Polymath

Polymath

Claim Polymath

Enter your code

Claim approved

Claim received

Claim Polymath

Enter your code

Claim approved

Claim received

About Polymath

What Polymath does

Key capabilities

Who it's for

Key capabilities

Agent readiness

Funding history

Capital network

Key operators

Dylan Ma

Naren Yenuganti

Alternatives

Cursor

Uniphore

Invisible Technologies

Aily Labs

Moonshot AI

Crescendo

Frequently asked

Explore more around Polymath

Similar to Polymath

Categories

Country

Compare

Alternatives

Rankings