Moss delivers sub-10ms lookup times by running search and embeddings inside the application process with no network hop on the hot path.

Do I need a vector database?

No. Moss embeds retrieval and embeddings directly in your app, so no separate vector database is required.

What kind of retrieval does Moss support?

It offers hybrid retrieval combining semantic and keyword search, plus built-in embeddings and metadata filtering, all from one SDK.

The single SDK runs in browsers, on-device, at the edge, and in the cloud, making it suitable for latency-sensitive conversational AI.

Startups AI Search Moss

Moss

Active

Real-time semantic search for Conversational AI

📍 San Francisco, United States 📅 Founded 2025 👥 1-10 🏷 AI Search

Visit website

Total raised

$500K

1 round

Stage

Seed

Jan 2025

Team

1-10

since 2025

Pricing

—

Founded

2025

San Francisco, United States

Agent-ready

—

About Moss

What Moss does

Moss is a real-time semantic search runtime built for conversational AI, developed by InferEdge Inc. It provides a local-first search engine that embeds directly inside AI applications, giving agents and copilots fast access to relevant context during conversations without the latency of remote database queries.

Key capabilities

Moss delivers sub-10ms lookup latency with instant index updates and a local-first architecture that keeps data on-device by default, with optional cloud sync. It supports hybrid search combining semantic similarity and keyword matching, ships built-in embedding models (moss-minilm and moss-mediumlm) alongside custom embeddings, and is built in Rust and WebAssembly for performance. It deploys across browsers, edge devices, and the cloud through a unified API, with JavaScript/TypeScript and Python libraries.

Who it's for

Moss serves teams building AI agents, voice assistants, and copilots that need fast conversational context retrieval. Common use cases include voice agents and copilots requiring real-time grounding, knowledge base and help-center search, on-device and offline applications, and edge deployments where a lightweight runtime matters. It is backed by Y Combinator.

Key capabilities

Real-time semantic search runtime for conversational AI

Built in Rust and WebAssembly for high performance

Sub-10ms lookup times

Runs search and embeddings inside the app process with no network hop

No external vector database required

Hybrid retrieval combining semantic and keyword search

Built-in embeddings and metadata filtering

Single SDK runs in browsers, on-device, at the edge, and in the cloud

Agent readiness

12/100

Early

MCP server

Public API

Webhooks

OAuth 2.0

SDKs

No public agent surfaces detected yet.

Funding history

1 · $500K

Jan 2025 Seed $500K ● Y Combinator

Capital network

$500K raised ·1 backer·10 network links

Backers1
Y CombinatorLead investorLead
Shared portfoliocompanies these backers also fund
Moonvalley1 Onyx1 Raycast1 Prosper AI1 Latent1
Extended networkfunds that co-invest alongside them
General Catalyst3 Khosla Ventures3 Andreessen Horowitz2 Accel2 Bessemer Venture Partners1

Key operators

Harsha Nalluru

Co-founder & CTO

Sri Raghu Malireddi

Co-founder & CEO

Alternatives

6 All →

Perplexity

AI-powered answer engine delivering real-time, cited responses to complex queries.

AI SearchAI Productivity

Jina AI

Search foundation models: embeddings, rerankers, and a web reader API

AI SearchEmbeddings & RAG

Onyx

Onyx, formerly Danswer, is an open-source enterprise AI search and assistant that connects to

AI Search

Raycast

The extensible AI launcher that puts every app, command, and model a keystroke away

AI SearchAI Productivity

Vetted

AI shopping research agent that reads Reddit, reviews, and experts so you don't have to

AI SearchAI E-commerce

Profound

AI search visibility platform for Answer Engine Optimization across ChatGPT and Perplexity

AI SearchAI Marketing

Frequently asked

How fast is Moss?: Moss delivers sub-10ms lookup times by running search and embeddings inside the application process with no network hop on the hot path.
Do I need a vector database?: No. Moss embeds retrieval and embeddings directly in your app, so no separate vector database is required.
What kind of retrieval does Moss support?: It offers hybrid retrieval combining semantic and keyword search, plus built-in embeddings and metadata filtering, all from one SDK.
Where can Moss run?: The single SDK runs in browsers, on-device, at the edge, and in the cloud, making it suitable for latency-sensitive conversational AI.

Discussion

Watching

Get Moss updates

New funding, product launches, and team changes — to your inbox.

Follow startup

Claim ownership

Verify with your work email to manage this listing.

Explore more around Moss

Contextual paths to related AI startups, deals and rankings.

Similar to Moss

Country

United States AI startups

Compare

Alternatives

All alternatives to Moss

Moss

Claim Moss

Enter your code

Claim approved

Claim received

Claim Moss

Enter your code

Claim approved

Claim received

About Moss

What Moss does

Key capabilities

Who it's for

Key capabilities

Agent readiness

Funding history

Capital network

Key operators

Harsha Nalluru

Sri Raghu Malireddi

Alternatives

Perplexity

Jina AI

Onyx

Raycast

Vetted

Profound

Frequently asked

Explore more around Moss

Similar to Moss

Categories

Country

Compare

Alternatives

Rankings