Roo Code Integrations in 2026

Visual Studio Code

Microsoft

Free

See Software

Visual Studio Code is a highly extensible AI-powered code editor built for developers who demand flexibility and performance. It combines intelligent coding assistance, modern debugging tools, and collaboration features in one lightweight package. With Agent Mode, VS Code reads your codebase, runs terminal commands, and edits across files automatically until tasks are complete. Its Next Edit Suggestions feature predicts and completes your next move as you type, enhancing speed and code accuracy. The Model Context Protocol (MCP) enables developers to connect their favorite AI models—from OpenAI, Anthropic, Azure, or Google—and extend functionality through custom servers. Developers can work in any language, from JavaScript and Python to C#, Java, and Go, while leveraging over 75,000 extensions for added productivity. Seamless integration with GitHub Codespaces, cloud storage, and CI/CD tools allows teams to code, collaborate, and deploy anywhere. Open-source at its core, VS Code empowers both individuals and enterprises to innovate without limits.

OpenAI

3 Ratings

See Software

OpenAI aims to guarantee that artificial general intelligence (AGI)—defined as highly autonomous systems excelling beyond human capabilities in most economically significant tasks—serves the interests of all humanity. While we intend to develop safe and advantageous AGI directly, we consider our mission successful if our efforts support others in achieving this goal. You can utilize our API for a variety of language-related tasks, including semantic search, summarization, sentiment analysis, content creation, translation, and beyond, all with just a few examples or by clearly stating your task in English. A straightforward integration provides you with access to our continuously advancing AI technology, allowing you to explore the API’s capabilities through these illustrative completions and discover numerous potential applications.

Grok Code Fast 1

xAI

$0.20 per million input tokens

See Software

Grok Code Fast 1 introduces a new class of coding-focused AI models that prioritize responsiveness, affordability, and real-world usability. Tailored for agentic coding platforms, it eliminates the lag developers often experience with reasoning loops and tool calls, creating a smoother workflow in IDEs. Its architecture was trained on a carefully curated mix of programming content and fine-tuned on real pull requests to reflect authentic development practices. With proficiency across multiple languages, including Python, Rust, TypeScript, C++, Java, and Go, it adapts to full-stack development scenarios. Grok Code Fast 1 excels in speed, processing nearly 190 tokens per second while maintaining reliable performance across bug fixes, code reviews, and project generation. Pricing makes it widely accessible at $0.20 per million input tokens, $1.50 per million output tokens, and just $0.02 for cached inputs. Early testers, including GitHub Copilot and Cursor users, praise its responsiveness and quality. For developers seeking a reliable coding assistant that’s both fast and cost-effective, Grok Code Fast 1 is a daily driver built for practical software engineering needs.

GLM-5

Zhipu AI

Free

See Software

GLM-5 is a next-generation open-source foundation model from Z.ai designed to push the boundaries of agentic engineering and complex task execution. Compared to earlier versions, it significantly expands parameter count and training data, while introducing DeepSeek Sparse Attention to optimize inference efficiency. The model leverages a novel asynchronous reinforcement learning framework called slime, which enhances training throughput and enables more effective post-training alignment. GLM-5 delivers leading performance among open-source models in reasoning, coding, and general agent benchmarks, with strong results on SWE-bench, BrowseComp, and Vending Bench 2. Its ability to manage long-horizon simulations highlights advanced planning, resource allocation, and operational decision-making skills. Beyond benchmark performance, GLM-5 supports real-world productivity by generating fully formatted documents such as .docx, .pdf, and .xlsx files. It integrates with coding agents like Claude Code and OpenClaw, enabling cross-application automation and collaborative agent workflows. Developers can access GLM-5 via Z.ai’s API, deploy it locally with frameworks like vLLM or SGLang, or use it through an interactive GUI environment. The model is released under the MIT License, encouraging broad experimentation and adoption. Overall, GLM-5 represents a major step toward practical, work-oriented AI systems that move beyond chat into full task execution.

GLM-5.1

Zhipu AI

Free

See Software

GLM-5.1 represents the latest advancement in Z.ai’s GLM series, crafted as a cutting-edge, agent-focused AI model tailored for coding, reasoning, and managing long-term workflows. This iteration builds upon the framework of GLM-5, which employs a Mixture-of-Experts (MoE) architecture to achieve high performance without incurring excessive inference expenses, aligning with a larger initiative towards open-weight models that are accessible to developers. A significant emphasis of GLM-5.1 is on fostering agentic behavior, allowing it to plan, execute, and refine multi-step tasks instead of merely reacting to isolated prompts. Its capabilities are specifically engineered to manage intricate workflows, such as debugging code, exploring repositories, and performing sequential operations while maintaining context over time. In comparison to its predecessors, GLM-5.1 enhances reliability during lengthy interactions, ensuring coherence throughout extended sessions and minimizing failures in multi-step reasoning processes. Overall, this model signifies a leap forward in AI development, particularly in its ability to support complex task management seamlessly.

Model Context Protocol (MCP)

Anthropic

Free

See Software

The Model Context Protocol (MCP) is a flexible, open-source framework that streamlines the interaction between AI models and external data sources. It enables developers to create complex workflows by connecting LLMs with databases, files, and web services, offering a standardized approach for AI applications. MCP’s client-server architecture ensures seamless integration, while its growing list of integrations makes it easy to connect with different LLM providers. The protocol is ideal for those looking to build scalable AI agents with strong data security practices.

GLM-4.6

Zhipu AI

Free

See Software

GLM-4.6 builds upon the foundations laid by its predecessor, showcasing enhanced reasoning, coding, and agent capabilities, resulting in notable advancements in inferential accuracy, improved tool usage during reasoning tasks, and a more seamless integration within agent frameworks. In comprehensive benchmark evaluations that assess reasoning, coding, and agent performance, GLM-4.6 surpasses GLM-4.5 and competes robustly against other models like DeepSeek-V3.2-Exp and Claude Sonnet 4, although it still lags behind Claude Sonnet 4.5 in terms of coding capabilities. Furthermore, when subjected to practical tests utilizing an extensive “CC-Bench” suite that includes tasks in front-end development, tool creation, data analysis, and algorithmic challenges, GLM-4.6 outperforms GLM-4.5 while nearing parity with Claude Sonnet 4, achieving victory in approximately 48.6% of direct comparisons and demonstrating around 15% improved token efficiency. This latest model is accessible through the Z.ai API, providing developers the flexibility to implement it as either an LLM backend or as the core of an agent within the platform's API ecosystem. In addition, its advancements could significantly enhance productivity in various application domains, making it an attractive option for developers looking to leverage cutting-edge AI technology.

GLM-4.6V

Zhipu AI

Free

See Software

The GLM-4.6V is an advanced, open-source multimodal vision-language model that belongs to the Z.ai (GLM-V) family, specifically engineered for tasks involving reasoning, perception, and action. It is available in two configurations: a comprehensive version with 106 billion parameters suitable for cloud environments or high-performance computing clusters, and a streamlined “Flash” variant featuring 9 billion parameters, which is tailored for local implementation or scenarios requiring low latency. With a remarkable native context window that accommodates up to 128,000 tokens during its training phase, GLM-4.6V can effectively manage extensive documents or multimodal data inputs. One of its standout features is the built-in Function Calling capability, allowing the model to accept various forms of visual media — such as images, screenshots, and documents — as inputs directly, eliminating the need for manual text conversion. This functionality not only facilitates reasoning about the visual content but also enables the model to initiate tool calls, effectively merging visual perception with actionable results. The versatility of GLM-4.6V opens the door to a wide array of applications, including the generation of interleaved image-and-text content, which can seamlessly integrate document comprehension with text summarization or the creation of responses that include image annotations, thereby greatly enhancing user interaction and output quality.

GLM-4.1V

Zhipu AI

Free

See Software

GLM-4.1V is an advanced vision-language model that offers a robust and streamlined multimodal capability for reasoning and understanding across various forms of media, including images, text, and documents. The 9-billion-parameter version, known as GLM-4.1V-9B-Thinking, is developed on the foundation of GLM-4-9B and has been improved through a unique training approach that employs Reinforcement Learning with Curriculum Sampling (RLCS). This model accommodates a context window of 64k tokens and can process high-resolution inputs, supporting images up to 4K resolution with any aspect ratio, which allows it to tackle intricate tasks such as optical character recognition, image captioning, chart and document parsing, video analysis, scene comprehension, and GUI-agent workflows, including the interpretation of screenshots and recognition of UI elements. In benchmark tests conducted at the 10 B-parameter scale, GLM-4.1V-9B-Thinking demonstrated exceptional capabilities, achieving the highest performance on 23 out of 28 evaluated tasks. Its advancements signify a substantial leap forward in the integration of visual and textual data, setting a new standard for multimodal models in various applications.

GLM-4.5V-Flash

Zhipu AI

Free

See Software

GLM-4.5V-Flash is a vision-language model that is open source and specifically crafted to integrate robust multimodal functionalities into a compact and easily deployable framework. It accommodates various types of inputs including images, videos, documents, and graphical user interfaces, facilitating a range of tasks such as understanding scenes, parsing charts and documents, reading screens, and analyzing multiple images. In contrast to its larger counterparts, GLM-4.5V-Flash maintains a smaller footprint while still embodying essential visual language model features such as visual reasoning, video comprehension, handling GUI tasks, and parsing complex documents. This model can be utilized within “GUI agent” workflows, allowing it to interpret screenshots or desktop captures, identify icons or UI components, and assist with both automated desktop and web tasks. While it may not achieve the performance enhancements seen in the largest models, GLM-4.5V-Flash is highly adaptable for practical multimodal applications where efficiency, reduced resource requirements, and extensive modality support are key considerations. Its design ensures that users can harness powerful functionalities without sacrificing speed or accessibility.

GLM-4.5V

Zhipu AI

Free

See Software

GLM-4.5V is an evolution of the GLM-4.5-Air model, incorporating a Mixture-of-Experts (MoE) framework that boasts a remarkable total of 106 billion parameters, with 12 billion specifically dedicated to activation. This model stands out by delivering top-tier performance among open-source vision-language models (VLMs) of comparable scale, demonstrating exceptional capabilities across 42 public benchmarks in diverse contexts such as images, videos, documents, and GUI interactions. It offers an extensive array of multimodal functionalities, encompassing image reasoning tasks like scene understanding, spatial recognition, and multi-image analysis, alongside video comprehension tasks that include segmentation and event recognition. Furthermore, it excels in parsing complex charts and lengthy documents, facilitating GUI-agent workflows through tasks like screen reading and desktop automation, while also providing accurate visual grounding by locating objects and generating bounding boxes. Additionally, the introduction of a "Thinking Mode" switch enhances user experience by allowing the selection of either rapid responses or more thoughtful reasoning based on the situation at hand. This innovative feature makes GLM-4.5V not only versatile but also adaptable to various user needs.

GLM-4.7

Zhipu AI

Free

See Software

GLM-4.7 is a next-generation AI model built to serve as a powerful coding and reasoning partner. It improves significantly on its predecessor across software engineering, multilingual coding, and terminal interaction benchmarks. GLM-4.7 introduces enhanced agentic behavior by thinking before tool use or execution, improving reliability in long and complex tasks. The model demonstrates strong performance in real-world coding environments and popular coding agents. GLM-4.7 also advances visual and frontend generation, producing modern UI designs and well-structured presentation slides. Its improved tool-use capabilities allow it to browse, analyze, and interact with external systems more effectively. Mathematical and logical reasoning have been strengthened through higher benchmark performance on challenging exams. The model supports flexible reasoning modes, allowing users to trade latency for accuracy. GLM-4.7 can be accessed via Z.ai, OpenRouter, and agent-based coding tools. It is designed for developers who need high performance without excessive cost.

MiniMax-M2.1

MiniMax

Free

See Software

MiniMax-M2.1 is a state-of-the-art open-source AI model built specifically for agent-based development and real-world automation. It focuses on delivering strong performance in coding, tool calling, and long-term task execution. Unlike closed models, MiniMax-M2.1 is fully transparent and can be deployed locally or integrated through APIs. The model excels in multilingual software engineering tasks and complex workflow automation. It demonstrates strong generalization across different agent frameworks and development environments. MiniMax-M2.1 supports advanced use cases such as autonomous coding, application building, and office task automation. Benchmarks show significant improvements over previous MiniMax versions. The model balances high reasoning ability with stability and control. Developers can fine-tune or extend it for specialized agent workflows. MiniMax-M2.1 empowers teams to build reliable AI agents without vendor lock-in.

MiniMax M2.5

MiniMax

Free

See Software

MiniMax M2.5 is a next-generation foundation model built to power complex, economically valuable tasks with speed and cost efficiency. Trained using large-scale reinforcement learning across hundreds of thousands of real-world task environments, it excels in coding, tool use, search, and professional office workflows. In programming benchmarks such as SWE-Bench Verified and Multi-SWE-Bench, M2.5 reaches state-of-the-art levels while demonstrating improved multilingual coding performance. The model exhibits architect-level reasoning, planning system structure and feature decomposition before writing code. With throughput speeds of up to 100 tokens per second, it completes complex evaluations significantly faster than earlier versions. Reinforcement learning optimizations enable more precise search rounds and fewer reasoning steps, improving overall efficiency. M2.5 is available in two variants—standard and Lightning—offering identical capabilities with different speed configurations. Pricing is designed to be dramatically lower than competing frontier models, reducing cost barriers for large-scale agent deployment. Integrated into MiniMax Agent, the model supports advanced office skills including Word formatting, Excel financial modeling, and PowerPoint editing. By combining high performance, efficiency, and affordability, MiniMax M2.5 aims to make agent-powered productivity accessible at scale.

InsForge

$25 per month

See Software

InsForge is an innovative backend platform designed specifically for AI-driven development, offering all necessary tools to create, oversee, and launch comprehensive applications via AI coding agents. As a Backend-as-a-Service, it comes equipped with essential features such as a managed PostgreSQL database, OAuth and JWT authentication, cloud storage, serverless functions, real-time updates, and AI integration, all presented through a well-structured interface that supports agent interaction. In contrast to traditional backends tailored for human developers, InsForge provides its services through a semantic layer and an MCP server, enabling AI agents to comprehend, reason about, and fully manage backend infrastructure autonomously. This unique approach empowers agents to configure databases, oversee schemas, direct authentication processes, deploy application logic, and sustain applications with minimal human input. Furthermore, this platform promotes efficiency and innovation, allowing developers to focus on higher-level tasks while AI handles routine backend operations seamlessly.

MiniMax M2.7

MiniMax

Free

See Software

MiniMax M2.7 is a powerful AI model built to drive real-world productivity across coding, search, and office-based workflows. It is trained using reinforcement learning across a wide range of real-world environments, enabling it to execute complex, multi-step tasks with precision and efficiency. The model demonstrates strong problem-solving capabilities by breaking down challenges into structured steps before generating solutions across multiple programming languages. It delivers high-speed performance with rapid token output, ensuring faster completion of demanding tasks. With optimized reasoning, it reduces token usage and execution time, making it more efficient than previous models. M2.7 also achieves state-of-the-art results in software engineering benchmarks, significantly improving response times for technical issues. Its advanced agentic capabilities allow it to work seamlessly with tools and support complex workflows with high skill accuracy. The model is designed to handle professional tasks, including multi-turn interactions and high-quality document editing. It also provides strong support for office productivity, enabling efficient handling of structured data and business tasks. With competitive pricing, it delivers high performance while remaining cost-effective. Overall, it combines speed, intelligence, and versatility to meet the needs of modern professionals and teams.

MiMo-V2-Pro

Xiaomi Technology

$1/million tokens

See Software

Xiaomi MiMo-V2-Pro is an advanced AI foundation model engineered to support real-world agentic workloads and complex workflow orchestration. It serves as the central intelligence for agent systems, enabling seamless coordination of coding, search, and multi-step task execution. The model is built on a large-scale architecture with over a trillion parameters, supporting extended context lengths for handling complex scenarios. It demonstrates strong benchmark performance, particularly in coding and agent-based evaluations, placing it among top-tier global models. MiMo-V2-Pro is optimized for real-world usability, focusing on reliability, efficiency, and practical task completion rather than just theoretical performance. It features improved tool-calling accuracy and stability, making it suitable for integration into production environments. The model also excels in software engineering tasks, offering structured reasoning and high-quality code generation. With its ability to handle long-context interactions, it supports advanced workflows across development and automation use cases. Its API accessibility and competitive pricing make it attractive for developers and enterprises. Overall, MiMo-V2-Pro delivers a balance of scale, intelligence, and real-world performance for modern AI applications.

OpenSpec

Fission AI

Free

See Software

OpenSpec is an open-source framework designed to enhance AI-assisted development through a structured, spec-driven approach. It provides a system for defining requirements before coding, ensuring alignment between developers and AI tools. The platform organizes work into clear artifacts, including proposals, specifications, design documents, and task checklists. It integrates with more than 20 AI coding assistants, making it compatible with a wide range of tools and workflows. OpenSpec promotes an iterative and flexible process, allowing teams to refine specifications as projects evolve. Its command-based interface enables users to propose features, implement changes, and archive completed work efficiently. By introducing structure, it reduces the unpredictability often associated with AI-generated code. The framework supports both individual developers and large teams, scaling across different project sizes. It also emphasizes context management to improve the accuracy and relevance of AI outputs. Ultimately, OpenSpec helps teams build software more reliably by combining human intent with AI execution in a structured workflow.

Mercury Edit 2

Inception

$0.25 per 1M input tokens

See Software

Mercury Edit 2 is a cutting-edge AI model from Inception Labs, part of the Mercury suite, specifically crafted for rapid reasoning, coding, and editing by employing a novel architecture distinctly different from typical large language models. It enhances the capabilities of Mercury 2, a diffusion-based model that generates and refines complete outputs simultaneously, rather than the conventional method of creating text one token at a time, which results in markedly improved speeds and more agile editing processes. Rather than functioning as a linear “typewriter,” this system operates as a dynamic editor, beginning with a rough draft and methodically enhancing it across multiple tokens simultaneously, facilitating real-time engagement and swift iterations in various tasks such as code editing, content creation, and agent-based workflows. This innovative framework achieves an impressive throughput of up to approximately 1,000 tokens per second, significantly outpacing traditional models while still upholding competitive reasoning abilities across various benchmarks. Its unique design not only transforms the way users interact with AI but also sets a new standard for performance in the field of artificial intelligence.

VSCodium

See Software

The source code for Microsoft's Visual Studio Code (VSCode) is available as open source under the MIT license, yet the downloadable version of the product is subject to a different license that is not considered free and open-source software (FLOSS) and includes telemetry features. To provide an alternative, the VSCodium project was created, allowing users to avoid the hassle of downloading and building from the source code by offering pre-built binaries. This project utilizes specialized build scripts that fetch Microsoft's VSCode repository, execute the necessary build commands, and subsequently upload the finished binaries to GitHub releases, all of which are licensed under the MIT license and have telemetry features disabled. For users on Mac OS X Mojave, if you encounter the message “App can’t be opened because Apple cannot check it for malicious software” when you first try to open VSCodium, you can simply right-click the application and select Open, which should only be necessary the first time you launch it on that operating system. Additionally, comprehensive documentation is available for anyone looking to transition from Visual Studio Code, as well as for addressing various issues that may arise during use of VSCodium. This ensures users have access to all the guidance they need for a smooth experience.

Kimi Code

Kimi

$15 per month

See Software

Kimi Code is an AI-driven coding assistant tailored for developers, available through the Kimi Membership, that aims to enhance efficiency by automating various software development processes and integrating effortlessly with widely-used workflows. It provides robust command-line interface (CLI) tools and is compatible with terminal environments and integrated development environments (IDEs) such as VS Code, empowering developers to read and modify code, obtain insights about codebases, create new features, resolve bugs, refactor existing code, and validate modifications through a user-friendly natural-language interface. The platform includes a specialized console that displays real-time logs, manages request quotas, and allows for pace adjustments, enabling users to set up API keys for applications like Kimi CLI, Claude Code, and Roo Code, which facilitates expedited coding with AI assistance while working within commits and ongoing workflows. In the VS Code environment, Kimi Code enhances the user experience with a built-in chat panel that supports slash commands, references to files and folders, diff views, and integration with external tools, providing context-aware coding help. Overall, Kimi Code represents a significant advancement in coding efficiency, making the software development process more intuitive and streamlined for developers at all levels.

Requesty

See Software

Requesty is an innovative platform tailored to enhance AI workloads by smartly directing requests to the best-suited model for each specific task. It boasts sophisticated capabilities like automatic fallback systems and queuing processes, guaranteeing seamless service continuity even when certain models are temporarily unavailable. Supporting an extensive array of models, including GPT-4, Claude 3.5, and DeepSeek, Requesty also provides AI application observability, enabling users to monitor model performance and fine-tune their application usage effectively. By lowering API expenses and boosting operational efficiency, Requesty equips developers with the tools to create more intelligent and dependable AI solutions. This platform not only optimizes performance but also fosters innovation in AI development, paving the way for groundbreaking applications.

MiMo-V2-Omni

Xiaomi Technology

See Software

MiMo-V2-Omni is a powerful multimodal AI model engineered to process and understand multiple types of data, including text, code, and structured inputs, within a unified system. It is designed to power agent-based workflows, enabling the execution of complex, multi-step tasks with improved accuracy and efficiency. The model combines advanced reasoning capabilities with strong tool integration, allowing it to interact with external systems and automate workflows effectively. It supports a wide range of applications, from software development and data analysis to enterprise automation and research tasks. With enhanced contextual understanding, it can maintain coherence across long interactions and complex scenarios. MiMo-V2-Omni is optimized for real-world performance, ensuring reliability in practical use cases rather than just benchmark results. Its architecture enables efficient handling of large-scale tasks while maintaining speed and responsiveness. The model also supports seamless integration into existing platforms and workflows. By combining multimodal understanding with agentic execution, it provides a flexible and scalable solution for modern AI applications. Overall, it delivers a balance of intelligence, versatility, and efficiency for diverse use cases.

Roo Code Integrations

What Integrates with Roo Code?

Visual Studio Code

OpenAI

Grok Code Fast 1

GLM-5

GLM-5.1

Model Context Protocol (MCP)

GLM-4.6

GLM-4.6V

GLM-4.1V

GLM-4.5V-Flash

GLM-4.5V

GLM-4.7

MiniMax-M2.1

MiniMax M2.5

InsForge

MiniMax M2.7

MiMo-V2-Pro

OpenSpec

Mercury Edit 2

VSCodium

Kimi Code

Requesty

MiMo-V2-Omni

Relevant Categories

Category Integrations