dlt is dltHub's open-source Python library for building data pipelines, with automatic schema inference, normalization, and incremental loading into warehouses and lakes.

Yes. The dlt library is free and open-source under the Apache 2.0 license. dltHub also offers a commercial managed platform, dltHub Pro, for teams.

How does dltHub use AI?

dltHub Context provides a knowledge base of thousands of data sources so AI agents can automatically generate working data pipelines and connectors.

How is dltHub different from Fivetran or Airbyte?

dltHub is code-first and open-source, embedding directly into Python workflows, and is increasingly agent-native, versus the managed connector model of Fivetran and Airbyte.

Which destinations does dlt support?

dlt loads into destinations including BigQuery, Snowflake, Redshift, DuckDB, and Postgres, among others.

Startups AI Data Engineering dltHub

dltHub

Active

Company behind dlt, a popular open-source Python library for building data pipelines and

📅 Founded 2022 👥 11-50 🏷 AI Data Engineering

Visit website

Total raised

$8M

1 round

Stage

Seed

Team

11-50

since 2022

Pricing

Freemium

free plan

Founded

2022

Agent-ready

—

About dltHub

dltHub was founded in 2022 by Matthaus Krzykowski and team to make building data pipelines as simple as writing a few lines of Python. Its flagship open-source library, dlt (data load tool), lets developers extract data from APIs, databases, and files and load it into warehouses and lakes with automatic schema inference, normalization, and incremental loading, all without standing up heavy ETL infrastructure. Released under a permissive Apache 2.0 license, dlt has become one of the most widely adopted open-source ingestion tools.

The library handles the unglamorous but critical parts of data ingestion: it infers and evolves schemas automatically, manages state for incremental loads, handles retries and pagination, and writes to destinations like BigQuery, Snowflake, Redshift, DuckDB, and Postgres. Because it is just a Python library, it slots naturally into developers' existing code, notebooks, and orchestration tools rather than requiring a separate platform.

On top of the open-source core, dltHub offers dltHub Pro, a commercial managed platform that adds deployment, scheduling, alerting, observability, and agentic workflows for teams that want a fully supported experience. A standout direction is AI-driven pipeline creation: dltHub Context provides a knowledge base spanning thousands of data sources so AI agents can generate working connectors automatically. The company reports that agent-created pipelines have grown explosively, becoming responsible for the large majority of new pipeline creation on its platform.

dltHub raised an $8 million seed round in August 2025 led by Bessemer Venture Partners, with participation from Dig Ventures and Firestreak Ventures, bringing total funding to roughly $14 million across rounds. With millions of monthly PyPI downloads and thousands of production users, dltHub competes with managed connectors like Fivetran and Airbyte by being code-first, open-source, and increasingly agent-native.

Key capabilities

Open-source dlt Python library for building data pipelines

Automatic schema inference, evolution, and normalization

Incremental loading with built-in state management

Connectors to warehouses and lakes (BigQuery, Snowflake, DuckDB, etc.)

dltHub Pro managed deployment, scheduling, and observability

Agentic pipeline generation via dltHub Context knowledge base

Coverage of 10,000+ data sources for AI-assisted connector building

Retries, pagination, and error handling out of the box

Agent readiness

10/100

Early

MCP server

Public API

Webhooks

OAuth 2.0

SDKs

No public agent surfaces detected yet.

Funding history

1 · $8M

— Seed $8M incl. Bessemer Venture Partners +2

Capital network

$8M raised ·3 backers·10 network links

Backers3
Bessemer Venture Partners1 round Dig Ventures1 round Firestreak Ventures1 round
Shared portfoliocompanies these backers also fund
Moonvalley1 ChipAgents1 Torq1 Rosebud1 Jasper1
Extended networkfunds that co-invest alongside them
General Catalyst2 Khosla Ventures2 Y Combinator1 Insight Partners1 Initialized Capital1

Key operators

Marcin Rudolf

Founder

Matthaus Krzykowski

Founder

Alternatives

6 All →

Tigris Data

Globally distributed, S3-compatible object storage built for AI

AI InfrastructureAI Data Engineering

Onehouse

Fully managed universal data lakehouse built on Apache Hudi, Iceberg and Delta Lake

AI InfrastructureAI Data Engineering

Revefi

Zero-touch platform that monitors data quality, warehouse spend, performance and usage

AI ObservabilityAI Data Engineering

Euno

Data model governance that pulls business logic out of BI tools and back into the data layer

AI Data EngineeringAI Governance

Prophecy

Agentic AI platform that turns plain-English goals into editable visual data pipelines

AI AnalyticsAI Data Engineering

Bruin

End-to-end data platform combining ingestion, SQL and Python pipelines and an AI data analyst

AI AnalyticsAI Data Engineering

Frequently asked

What is dlt?: dlt is dltHub's open-source Python library for building data pipelines, with automatic schema inference, normalization, and incremental loading into warehouses and lakes.
Is dlt free?: Yes. The dlt library is free and open-source under the Apache 2.0 license. dltHub also offers a commercial managed platform, dltHub Pro, for teams.
How does dltHub use AI?: dltHub Context provides a knowledge base of thousands of data sources so AI agents can automatically generate working data pipelines and connectors.
How is dltHub different from Fivetran or Airbyte?: dltHub is code-first and open-source, embedding directly into Python workflows, and is increasingly agent-native, versus the managed connector model of Fivetran and Airbyte.
Which destinations does dlt support?: dlt loads into destinations including BigQuery, Snowflake, Redshift, DuckDB, and Postgres, among others.

Discussion

Watching

Get dltHub updates

New funding, product launches, and team changes — to your inbox.

Follow startup

Claim ownership

Verify with your work email to manage this listing.

Explore more around dltHub

Contextual paths to related AI startups, deals and rankings.

Similar to dltHub

Compare

Alternatives

All alternatives to dltHub

dltHub

Claim dltHub

Enter your code

Claim approved

Claim received

Claim dltHub

Enter your code

Claim approved

Claim received

About dltHub

Key capabilities

Agent readiness

Funding history

Capital network

Key operators

Marcin Rudolf

Matthaus Krzykowski

Alternatives

Tigris Data

Onehouse

Revefi

Euno

Prophecy

Bruin

Frequently asked

Explore more around dltHub

Similar to dltHub

Categories

Compare

Alternatives

Rankings