What Integrates with Vapi AI?
Find out what Vapi AI integrations exist in 2026. Learn what software and services currently integrate with Vapi AI, and sort them by reviews, cost, features, and more. Below is a list of products that Vapi AI currently integrates with:
-
1
Speechmatics
Speechmatics
$0 per monthBest-in-Market Speech-to-Text & Voice AI for Enterprises. Speechmatics delivers industry-leading Speech-to-Text and Voice AI for enterprises needing unrivaled accuracy, security, and flexibility. Our enterprise-grade APIs provide real-time and batch transcription with exceptional precision—across the widest range of languages, dialects, and accents. Powered by Foundational Speech Technology, Speechmatics supports mission-critical voice applications in media, contact centers, finance, healthcare, and more. With on-prem, cloud, and hybrid deployment, businesses maintain full control over data security while unlocking voice insights. Trusted by global leaders, Speechmatics is the top choice for best-in-class transcription and voice intelligence. 🔹 Unmatched Accuracy – Superior transcription across languages & accents 🔹 Flexible Deployment – Cloud, on-prem, and hybrid 🔹 Enterprise-Grade Security – Full data control 🔹 Real-Time & Batch Processing – Scalable transcription 🚀 Power your Speech-to-Text and Voice AI with Speechmatics today! -
2
Cloudonix
Cloudonix
$39 per monthCloudonix operates as a CPaaS (Communications Platform as a Service) provider that specializes in voice and text APIs/SDKs, catering to developers, agencies, telecom companies/MSPs, and enterprises seeking programmable voice communication solutions, AI-driven voice agents, and efficient SIP trunking. Their services feature agentic voice trunking, enabling users to integrate voice-agent platforms with any phone system, whether cloud-based or on-premise, through an easy plug-in approach; they also provide highly flexible SIP trunking along with built-in SBC capabilities (including transcoding and negotiation for TLS/TCP/UDP) to facilitate the connection of any SIP carrier or PBX with ease. For developers working on voice applications, they offer a comprehensive suite of programmable voice APIs, mobile/web voice SDKs, audio streaming options, and call control functionalities such as transfers and IVR management, enhanced by a scripting language for call flow design. Additionally, Cloudonix features low-code tools within their platform, empowering non-technical users to create IVR menus, automated call flows, outbound dialing systems, and sophisticated AI-enabled voice receptionists, broadening accessibility for various stakeholders in the communications landscape. This combination of powerful tools and user-friendly interfaces makes Cloudonix a versatile choice for businesses aiming to enhance their communication capabilities. -
3
Gladia
Gladia
10 hours freeGladia is an advanced audio transcription and intelligence solution that provides a cohesive API, accommodating both asynchronous (for pre-recorded content) and real-time transcription, thereby allowing developers to translate spoken words into text across more than 100 languages. This platform boasts features such as word-level timestamps, language recognition, code-switching capabilities, speaker identification, translation, summarization, a customizable vocabulary, and entity extraction. With its real-time engine, Gladia maintains latencies below 300 milliseconds while ensuring a high level of accuracy, and it offers “partials” or intermediate transcripts to enhance responsiveness during live events. Overall, Gladia stands out as a versatile tool for developers looking to integrate comprehensive audio transcription capabilities into their applications. -
4
Mercury Edit 2
Inception
$0.25 per 1M input tokensMercury Edit 2 is a cutting-edge AI model from Inception Labs, part of the Mercury suite, specifically crafted for rapid reasoning, coding, and editing by employing a novel architecture distinctly different from typical large language models. It enhances the capabilities of Mercury 2, a diffusion-based model that generates and refines complete outputs simultaneously, rather than the conventional method of creating text one token at a time, which results in markedly improved speeds and more agile editing processes. Rather than functioning as a linear “typewriter,” this system operates as a dynamic editor, beginning with a rough draft and methodically enhancing it across multiple tokens simultaneously, facilitating real-time engagement and swift iterations in various tasks such as code editing, content creation, and agent-based workflows. This innovative framework achieves an impressive throughput of up to approximately 1,000 tokens per second, significantly outpacing traditional models while still upholding competitive reasoning abilities across various benchmarks. Its unique design not only transforms the way users interact with AI but also sets a new standard for performance in the field of artificial intelligence. -
5
Inworld TTS
Inworld
$0.005 per minuteInworld TTS stands out as a cutting-edge text-to-speech solution that provides exceptionally realistic and context-aware speech synthesis alongside advanced voice-cloning features, all at an incredibly affordable price. Its leading model, TTS-1, is tailored for real-time usage, boasting low-latency streaming capabilities—where the first audio segment is available in about 200 milliseconds—and supports a wide array of languages such as English, Spanish, French, Korean, Chinese, and several others. Developers have the flexibility to utilize instant zero-shot voice cloning, requiring only 5 to 15 seconds of audio input, or opt for more detailed fine-tuned cloning, enabling the addition of voice-tags that convey emotion, style, and non-verbal cues, while also allowing for language switching without losing the unique voice identity. For those seeking even greater expressiveness and multilingual capabilities, the TTS-1-Max model is currently in preview, offering enhanced features. The platform accommodates various access methods, including API and portal options, and can operate in either streaming or batch modes, making it suitable for a diverse range of applications such as interactive voice agents, gaming characters, and bespoke audio branding experiences. With its versatility and advanced technology, Inworld TTS is poised to revolutionize how we interact with synthetic voices. -
6
Operata
Operata
$0.0060 per agent minutesOperata is a cutting-edge platform designed specifically for cloud contact centers, leveraging artificial intelligence to enhance customer experience observability by continuously gathering and analyzing real-time data from all aspects of interactions, including calls, agent environments, networks, CCaaS, and AI engagements; this comprehensive approach offers teams a complete understanding of both customer and agent experiences, enabling them to identify not only the events that occurred but also the underlying reasons and to respond promptly. Among its standout features are a consolidated CX Insights Graph that aligns various technical, operational, and experiential signals, as well as CX Copilot and Agent Copilot—intelligent assistants powered by Tenor AI that facilitate natural language queries and provide instant recommendations. Additionally, the platform includes Customer Journey Trace for visualizing full interaction sequences across diverse channels, pre-configured playbooks and dynamic dashboards for gaining timely insights, readiness testing and assurance tools for performance benchmarking, seamless compatibility with over 50 CX and voice systems, and an MCP Server that integrates observability data into broader enterprise AI frameworks. With such a robust suite of tools, Operata empowers organizations to enhance their customer service strategies effectively. -
7
Paygent
Paygent
Paygent serves as a cutting-edge profitability and monetization infrastructure specifically designed for businesses that rely on AI technologies. Unlike traditional billing systems that merely account for revenue, Paygent focuses on the critical metrics that AI companies prioritize, such as the margin generated by each agent, the real gross profit associated with each customer, and the instantaneous costs tied to every LLM call, API request, and computational event. Among its notable features are: - Immediate cost attribution for LLM usage based on agent, customer, and workflow - Simulation tools for predictive pricing that allow businesses to strategize pricing models prior to launching into production - Automation of billing processes for various pricing models, including usage-based, outcome-based, hybrid, and digital employee frameworks - Automated invoicing coupled with cost alert notifications to identify and mitigate runaway agent loops that could harm profitability With seamless integration through Node.js, Python, and Go SDKs, Paygent adds no latency to agent operations. Eliminate uncertainty regarding your margins and transform your AI agents into a thriving business venture. By leveraging Paygent, companies can gain a clearer understanding of their financial landscape and make informed decisions that drive profitability. -
8
Hamming
Hamming
Automated voice testing, monitoring and more. Test your AI voice agent with 1000s of simulated users within minutes. It's hard to get AI voice agents right. LLM outputs can be affected by a small change in the prompts, function calls or model providers. We are the only platform that can support you from development through to production. Hamming allows you to store, manage, update and sync your prompts with voice infra provider. This is 1000x faster than testing voice agents manually. Use our prompt playground for testing LLM outputs against a dataset of inputs. Our LLM judges quality of generated outputs. Save 80% on manual prompt engineering. Monitor your app in more than one way. We actively track, score and flag cases where you need to pay attention. Convert calls and traces to test cases, and add them to the golden dataset. -
9
AI Agents Directory
AI Agents Directory
The AI Agents Directory stands as the largest marketplace and database for AI agents globally, showcasing more than 1,300 AI agents ready for enterprise use in over 64 categories. This extensive platform allows users to discover, compare, and implement AI agents customized to meet diverse business requirements. Visitors can delve into numerous categories such as productivity, sales, customer service, coding, and voice, with each section featuring specialized agents aimed at automating processes and boosting efficiency. In addition, the directory offers thorough details on each agent, equipping users with the necessary information to make well-informed choices based on features, pricing, and user reviews. Furthermore, the platform includes tools for requesting bespoke AI agents and submitting new entries, thereby promoting a vibrant ecosystem for both businesses and developers eager to harness the power of AI solutions. By continuously expanding its offerings, the AI Agents Directory remains a vital resource in the evolving landscape of artificial intelligence. -
10
VerbaFlo
VerbaFlo
VerbaFlo is a conversational platform powered by AI that consolidates and automates communication across various channels including voice, chat, email, SMS, WhatsApp, and web, enhancing real-time interactions with prospects, clients, and residents, especially in the real estate sector. Leveraging natural language processing and machine learning, it ensures intelligent handling of conversations around the clock, facilitating lead qualification, scheduling, follow-ups, and tailored responses at every interaction point—all without the issue of script fatigue. Additionally, it seamlessly integrates with existing solutions like CRM or property management systems to centralize workflows and analytics. The platform also accommodates multilingual communication and offers real-time dashboards, conversational memory, and automated outbound initiatives such as renewals, rent reminders, and maintenance updates. Moreover, it provides valuable insights into occupancy trends and client behavior while tracking performance metrics, which empowers property teams to respond more swiftly, boost conversion and retention rates, and minimize overall operational costs. By harnessing such comprehensive features, VerbaFlo significantly elevates the efficiency and effectiveness of communication within the real estate market.
- Previous
- You're on page 1
- Next