LMNT is an AI text-to-speech platform that delivers studio-quality voice synthesis with low-latency streaming, reported at around 150-200ms. It is optimized for real-time applications such as conversational apps, voice agents, and games.

Does LMNT support voice cloning?

Yes, LMNT offers instant voice cloning that the company says can work from just 5 seconds of audio. This lets developers create custom voices quickly for their applications.

How many languages does LMNT support?

LMNT supports 24 languages for speech synthesis. Combined with its low-latency streaming, this makes it suitable for multilingual conversational and interactive use cases.

LMNT is built for developers building conversational apps, agents, and games that need fast, lifelike speech. The company lists organizations such as Khan Academy, HeyGen, Vapi, Vercel, and Unity among those it works with.

LMNT positions itself as an affordable text-to-speech option, typically charged based on usage, but specific rates are not listed in this directory entry. See lmnt.com for current pricing tiers and any free trial or developer allowances.

Startups AI Voice & Speech LMNT

LMNT

Active

Fast, lifelike, and affordable AI text-to-speech with low-latency streaming and instant voice cloning.

📍 United States 📅 Founded 2021 👥 11-50 🏷 AI Voice & Speech

Visit website

Total raised

—

Stage

Seed

Team

11-50

since 2021

Pricing

Freemium

from $10/mo

Founded

2021

United States

Agent-ready

—

About LMNT

What LMNT does

LMNT is an AI text-to-speech platform that converts written text into natural-sounding speech. It is built for low-latency, real-time applications, delivering streaming audio with around 150-200ms latency, and supports instant voice cloning from short recordings. The service is offered through a developer API and a free web playground for testing.

Key capabilities

Lifelike AI text-to-speech with low-latency streaming
Voice cloning from a short voice sample
Multilingual support across many languages, with mid-sentence language switching
Developer-friendly API suitable for conversational AI and real-time voice apps

Who it's for

LMNT is aimed at developers and AI engineers building voice-enabled applications, plus companies in gaming, video and avatar generation, education, and media. Its low latency makes it a fit for conversational agents and other real-time voice experiences where responsiveness matters.

Key capabilities

Low-latency streaming TTS (150-200ms)

Instant voice cloning from 5-second recordings

24-language support with mid-sentence switching

WebSocket-based real-time speech sessions

REST API for synchronous speech synthesis

Built-in voices with varied personas (Leah, Vesper, Natalie, Tyler, Brandon)

Blizzard model for expressive, natural prosody

No concurrency or rate limits on API

Technology stack

2detected May 30, 2026

Est. monthly stack spend ~$100/mo

Framework

webpack

Infra

Vercel

Agent readiness

65/100

Developing

MCP server

Public API

Webhooks

OAuth 2.0

SDKs · Python, JavaScript

API docs ↗

Alternatives

6 All →

Wispr

Effortless voice dictation powered by AI

AI WritingAI Voice & Speech

ElevenLabs

The most realistic AI voices for creators, developers, and enterprise.

AI ChatbotsAI Voice & Speech

Quo

AI-driven business phone and front office for growing teams

AI Voice & SpeechAI Customer Support

Wondercraft

The Canva of audio — generative AI podcasts and ads

AI VideoAI Voice & Speech

Kyutai

An open-science AI lab dedicated to building and democratizing Artificial General Intelligence through open research.

AI Voice & SpeechFoundation Models

David AI

The data layer for next-generation audio and voice AI models

AI Voice & SpeechAI Audio

Frequently asked

What does LMNT do?: LMNT is an AI text-to-speech platform that delivers studio-quality voice synthesis with low-latency streaming, reported at around 150-200ms. It is optimized for real-time applications such as conversational apps, voice agents, and games.
Does LMNT support voice cloning?: Yes, LMNT offers instant voice cloning that the company says can work from just 5 seconds of audio. This lets developers create custom voices quickly for their applications.
How many languages does LMNT support?: LMNT supports 24 languages for speech synthesis. Combined with its low-latency streaming, this makes it suitable for multilingual conversational and interactive use cases.
Who uses LMNT?: LMNT is built for developers building conversational apps, agents, and games that need fast, lifelike speech. The company lists organizations such as Khan Academy, HeyGen, Vapi, Vercel, and Unity among those it works with.
How is LMNT priced?: LMNT positions itself as an affordable text-to-speech option, typically charged based on usage, but specific rates are not listed in this directory entry. See lmnt.com for current pricing tiers and any free trial or developer allowances.

Discussion

Watching

Get LMNT updates

New funding, product launches, and team changes — to your inbox.

Follow startup

Claim ownership

Verify with your work email to manage this listing.

Explore more around LMNT

Contextual paths to related AI startups, deals and rankings.

Similar to LMNT

Country

United States AI startups

Compare

Alternatives

All alternatives to LMNT

LMNT

Claim LMNT

Enter your code

Claim approved

Claim received

Claim LMNT

Enter your code

Claim approved

Claim received

About LMNT

What LMNT does

Key capabilities

Who it's for

Key capabilities

Technology stack

Agent readiness

Alternatives

Wispr

ElevenLabs

Quo

Wondercraft

Kyutai

David AI

Frequently asked

Explore more around LMNT

Similar to LMNT

Categories

Country

Compare

Alternatives

Rankings