68AI Infrastructure startups tracked. Also see full AI Infrastructure directory.
The Essential Cloud for AI, purpose-built to power pioneers' most complex workloads with next-generation infrastructure and intelligent tools.
The AI-powered data foundation
The data + AI company
European sovereign AI infrastructure
Wafer-scale AI chips for unprecedented speed
Fastest AI inference on the planet
GPU cloud for AI teams
Enterprise AI for language understanding
General-purpose humanoid robots
Sustainable AI cloud infrastructure
An AI research company building a superlearner to achieve superintelligence through reinforcement learning
The AI community building the future of machine learning.
Rebellions provides high-performance AI inference infrastructure, enabling efficient and scalable AI deployment for real-world applications.
Advanced language AI for enterprise applications
Efficient general-purpose foundation models built for edge, on-device, and cloud deployment at every scale.
Production-grade generative AI serving
The open AI cloud
Production-scale AI infrastructure powered by Ray for distributed training, data curation, and batch inference.
The vector database for AI
Open-source AI-native vector database
High-performance AI infrastructure with sub-second cold starts and instant autoscaling.
Run AI models in the cloud
High-performance vector search engine built in Rust for production-grade AI retrieval.
AI-native inference cloud built for production AI workloads on H100, H200 and Blackwell GPUs.
Video understanding AI for developers
The productivity search engine with AI agents.
Deploy and scale AI models in production with the fastest inference infrastructure.
AI-native multimodal lakehouse for vector search, training data, and retrieval.
Search for AI applications
Inference hosting for AI teams who ship fast and scale faster.
AI infrastructure that adapts as you grow, orchestrating across any cloud or hardware for faster deployment and optimal cost-performance.
Preferred Networks develops and provides all four layers of the AI technology value chain: AI semiconductors, computing infrastructure, generative AI foundation models, and AI products/solutions.
Incubating the future by understanding and creating intelligence from first principles to solve challenging problems.
The enterprise platform for AI workloads and GPU orchestration.
India's AI-first cloud platform where India builds, offering industry-leading, future-ready, and developer-first infrastructure.
India's full-stack sovereign AI platform, built on sovereign compute and powered by frontier-class models for population-scale impact.
Tenstorrent provides high-performance AI compute solutions, from cards and workstations to scale-out servers, powered by flexible, open-source IP.
A unified AI inference platform for high-performance, portable compute, enabling full optimisations from GPU kernel to API endpoint.
Browserbase makes the web as reliable and programmable as APIs for your AI agents.
Etched is building custom hardware to accelerate large language models and achieve superintelligence.
The fastest AI inference platform, purpose-built for agentic AI with custom dataflow technology and three-tier memory architecture.
Rearchitecting entire industries for frontier technologies with production AI systems that deliver measurable impact in weeks.
Orchestrate AI and backend workflows at any scale, making any code durable by default.
The Enterprise AI Agent Cloud provides open-source, secure environments with real-world tools for enterprise-grade AI agents.
Orchestrate AI agents with enterprise-grade security, bringing secure AI to work for IT teams.
The AI Control Plane for unifying ML and GenAI workflows across your fragmented stack.
The Unified Context Layer for building AI agents across SaaS, VPC, and On-Prem environments.
The fastest way to deploy and scale AI workloads with on-demand GPUs and serverless infrastructure.
Agent-ready GPU infrastructure for AI, offering API-native provisioning, real-time pricing, and per-second billing.
G42 is an Abu Dhabi-born AI and cloud computing company building globally to push AI to do more for everyone.