What\u2019s new in AI tools
Automated weekly roundup of release notes and changelogs across every AI tool we track. Currently indexing 86 updates.
This week \u2014 2026-W18
AI Tools This Week: 42 updates across 7 products
-
January Drop 2026
Introduced specialized AI agents for marketing, advanced code search and writing for engineers, and enhanced Assistant capabilities.
-
February Drop 2026
Released 85+ new agent actions, conversational deployment analytics, and specialized engineering agents for workflow automation.
-
ChatGPT app, Dynamic voice speed, A/B testing, and more
Introduced ChatGPT app for building voice agents, dynamic voice speed that adapts to caller pace, and A/B testing capabilities for splitting call traffic across multiple agents.
-
Prompt Caching added
Prompt caching automatically reuses computation from recent requests when they share a common prefix, delivering 50% cost savings for cached portions and improved response times. The feature works automatically with no code changes required and data expires within hours for privacy.
-
Moonshot AI Kimi K2 Instruct 0905 added
Kimi K2-0905 brings Moonshot AI's cutting-edge model to GroqCloud with day zero support, featuring a 256K context window and enhanced agentic coding capabilities. The model offers prompt caching with up to 50% cost savings and runs at 200+ tokens/second.
-
Figure Signs Agreement with Catalyst Brands to Scale Humanoid Operations
Figure AI announced a strategic agreement with Catalyst Brands to expand humanoid robot operations and scaling capabilities.
-
GitHub Copilot CLI - Agentic coding agent for terminal
GitHub Copilot CLI enables users to build, debug, and deploy code directly from the terminal with an AI coding agent. The tool includes GitHub MCP by default and supports extending with additional servers for enhanced context.
-
Observability for any agent, anywhere: Production-ready tracing with OpenTelemetry & Unity Catalog on Databricks
Databricks launches production-ready AI tracing capabilities using OpenTelemetry and Unity Catalog for comprehensive agent observability. The solution addresses traditional observability challenges in AI applications moving to production.
-
January Drop 2026
Released specialized AI agents for marketing, advanced code search and writing for engineers, and enhanced Assistant capabilities.
-
February Drop 2026
Introduced 85+ new agent actions, conversational deployment analytics, and specialized engineering agents for workflow automation.
-
March Drop 2026
Launched remote MCP server to ground ChatGPT and Claude in company context, expanded Assistant and Agents with third-party MCP tools.
-
April Drop 2026
Added slide and interactive HTML page generation capabilities, plus embedded MCP Apps in Glean Assistant for cross-tool actions.
-
ChatGPT app, Dynamic voice speed, A/B testing, and more
Launched ChatGPT app for building voice agents, dynamic voice speed that adapts to caller pace, and A/B testing for splitting call traffic across multiple agents.
-
Prompt Caching
Automatic prompt caching feature that reuses computation from recent requests with common prefixes, delivering 50% cost savings on cached tokens and improved response times. Initially available for Kimi K2 model.
-
Groq Compound and Compound Mini
Production-ready agentic AI systems moved from beta to general availability, integrating web search, code execution, and browser automation in a single API call. Built on GPT-OSS-120B and Llama models.
-
Moonshot AI Kimi K2 Instruct 0905
New model release with 256K context window, prompt caching support, and enhanced agentic coding capabilities. Offers 200+ t/s performance at $1.50/M tokens blended pricing.
-
Remote Model Context Protocol (MCP)
Beta release of Remote MCP server integration on GroqCloud, enabling AI models to connect to thousands of external tools through Anthropic's open MCP standard. Fully compatible with OpenAI APIs for seamless migration.
-
Introducing Forge
Forge is a new system that allows enterprises to build frontier-grade AI models grounded in their proprietary knowledge.
-
Speaking of Voxtral
Voxtral TTS is a new open-weights text-to-speech model that produces fast, adaptable, and lifelike speech for voice agents.
-
Connect the dots: Build with built-in and custom MCPs in Studio
New connectors in Studio allow enterprises to connect data to AI applications with reusable connectors, tool calling, and approval controls.
-
Remote agents in Vibe. Powered by Mistral Medium 3.5.
Mistral AI introduced Mistral Medium 3.5 model with remote coding agents in Vibe and a new Work mode in Le Chat for complex tasks.
-
Expanded interoperability with Unity Catalog Open APIs
Unity Catalog introduces expanded Open APIs to improve interoperability across different data platforms and tools.
-
AI QA Analyst, Custom Alert Notifications, Node Search, and everything else from this month
Introduced AI QA Analyst for automatic call review and feedback, custom alerting system for monitoring agent performance, and conversation flow node search functionality.
-
Agent Guardrails, Webhook Testing, Auto-Failover, Billing Transparency & more
Added safety guardrails to block harmful content and jailbreaks, webhook testing tools, global node return paths, and billing transparency improvements with separate TTS line items.
-
ChatGPT app, Dynamic voice speed, A/B testing, and more
Retell AI launched a ChatGPT app for building voice agents, dynamic voice speed that adapts to caller pace, and A/B testing capabilities for splitting call traffic across multiple agents.
-
Groq Compound and Compound Mini added
Compound and Compound Mini are Groq's production-ready agentic AI systems that integrate web search, code execution, and browser automation into a single API call. Moving from beta to general availability, these systems deliver ~25% higher accuracy and ~50% fewer mistakes across benchmarks.
-
Moonshot AI Kimi K2 Instruct 0905 added
Kimi K2-0905 brings Moonshot AI's cutting-edge model to GroqCloud with enhanced agentic coding capabilities and a 256K context window. The model delivers improved frontend development performance and includes prompt caching for up to 50% cost savings.
-
Remote Model Context Protocol (MCP) added
Remote Model Context Protocol (MCP) server integration is now available in Beta on GroqCloud, connecting AI models to thousands of external tools through Anthropic's open MCP standard. Developers can connect any remote MCP server to models hosted on GroqCloud with zero code changes from OpenAI.
-
EvenUp Introduces a New Era of PI Operations with Pre-Litigation as a Service (PLAAS)
EvenUp launches Pre-Litigation as a Service (PLAAS), combining purpose-built AI with expert case managers to help personal injury firms scale quality outcomes.
-
3.5: Notion Developer Platform
Notion launched a comprehensive developer platform allowing teams to sync any data source, build custom agent tools, and orchestrate external agents like Claude and Codex. The platform includes Workers for hosted code execution, a CLI for developers, and APIs for database sync and webhook triggers.
-
3.5: Notion Developer Platform
Notion launched a comprehensive developer platform allowing users to sync any data source, build custom agent tools, and orchestrate external agents like Claude and Codex. The platform includes Workers for custom code execution, CLI tools, and APIs for extending Notion without managing infrastructure.
-
EvenUp Introduces a New Era of PI Operations with Pre-Litigation as a Service (PLAAS)
EvenUp launches Pre-Litigation as a Service (PLAAS), combining purpose-built AI with expert case managers to help firms scale quality outcomes.
-
Pushing the Frontier for Data Agents with Genie
Databricks enhanced Genie, their state-of-the-art data agent designed for answering complex questions. The improvements advance the capabilities of data agents in handling sophisticated analytical queries.
-
MCP Marketplace brings real-time intelligence to agentic applications
Databricks launched MCP Marketplace to provide real-time intelligence capabilities for agentic AI applications. The marketplace enables AI systems to access business context and reasoning capabilities.
-
How Superhuman and Databricks built a 200K QPS inference platform together
Superhuman partnered with Databricks to build a high-performance inference platform capable of handling 200,000 queries per second. The collaboration demonstrates real-time AI inference capabilities at scale.
-
How AI Communication Agents Expanded Operational Capacity 2.5x in the First 90 Days
EvenUp's AI Communication Agents demonstrated significant operational leverage by expanding capacity 2.5x within 90 days of launch.
-
How AI Communication Agents Expanded Operational Capacity 2.5x in the First 90 Days
EvenUp's AI Communication Agents proved operational leverage within 90 days post-launch, expanding capacity by 2.5x.
-
ChatGPT app, Dynamic voice speed, A/B testing, and more
Retell AI launched a ChatGPT app for building voice agents, dynamic voice speed matching caller pace, and A/B testing for splitting call traffic across multiple agents.
-
Groq Compound and Compound Mini
Compound and Compound Mini are Groq's production-ready agentic AI systems that integrate web search, code execution, and browser automation into a single API call. Moving from beta to general availability, these systems deliver ~25% higher accuracy and ~50% fewer mistakes across benchmarks.
-
Moonshot AI Kimi K2 Instruct 0905
Kimi K2-0905 brings Moonshot AI's cutting-edge model to GroqCloud with day zero support, featuring a 256K context window and enhanced agentic coding capabilities. The model offers prompt caching with up to 50% cost savings and runs at 200+ t/s at $1.50/M tokens blended pricing.
-
Remote Model Context Protocol (MCP)
Remote Model Context Protocol (MCP) server integration is now available in Beta on GroqCloud, connecting AI models to thousands of external tools through Anthropic's open MCP standard. Developers can connect any remote MCP server to models hosted on GroqCloud with zero code changes from OpenAI.
-
Ramping Figure 03 Production
Figure AI announced the scaling up of production for their Figure 03 humanoid robot model.
-
Remote agents in Vibe. Powered by Mistral Medium 3.5.
Mistral AI introduces Mistral Medium 3.5 model, remote coding agents in Vibe, and a new Work mode in Le Chat for handling complex tasks.
-
Ramping Figure 03 Production
Figure AI scales up manufacturing of their Figure 03 humanoid robot model.
-
Remote agents in Vibe. Powered by Mistral Medium 3.5.
Mistral AI introduced Mistral Medium 3.5 with remote coding agents in Vibe and a new Work mode in Le Chat for complex tasks.
-
Workflows for work that runs the business
Mistral AI launches Workflows in public preview, enabling automated business processes.
-
Workflows for work that runs the business
Workflows feature is now available in public preview for business automation.
-
Langfuse v3.171.0
Allows source=ANNOTATION on the public scores API endpoint and adds model pricing for GPT-5.5. Includes ClickHouse performance improvements and PostHog export window fixes.
-
The next generation of Databricks Genie
Databricks announced the next generation of its Genie product with new capabilities for data interaction. The update was published as a platform and product release.
-
OpenAI GPT-5.5 + Codex, now available and fully-governed on Databricks
OpenAI GPT-5.5 and Codex are now available on Databricks with governance through Unity AI Gateway. The integration targets agentic enterprise workflows and complex tasks.