Agent-Fusion
Agent Fusion is a local RAG semantic search engine that gives AI agents instant access to your code, documentation (Markdown, Word, PDF). Query your codebase from code agents without hallucinations. Runs 100% locally, includes a lightweight embedding model, and optional multi-agent task orchestration. Deploy with a single JAR
How to use
Agent-Fusion is a powerful local RAG (Retrieve, Augment, Generate) semantic search engine designed to provide AI agents with direct access to your code and documentation, including formats such as Markdown, Word, and PDF. Developers can use this tool to query their codebases without the risk of hallucinations, enhancing productivity and efficiency in code exploration and documentation retrieval. With its lightweight embedding model and optional multi-agent task orchestration, Agent-Fusion streamlines the interaction between AI agents and your development resources.
Once connected to the Agent-Fusion server, you can interact with it by sending natural language queries that reference your code and documentation. For optimal results, focus on specific questions about your codebase or documentation content. You can also leverage the multi-agent task orchestration feature to execute complex queries that require the collaboration of multiple agents. Keep your queries clear and concise to maximize the relevance of the results returned.
How to install
Prerequisites
- Java 11 or higher (for running the JAR file)
- Optional: Docker (for containerized deployment)
Option A: Quick Start with JAR
You can quickly start using Agent-Fusion by downloading the JAR file from the repository. Use the following command in your terminal:
java -jar path/to/Agent-Fusion.jar
Option B: Global Install Alternative
If you prefer to run it in a global context, you can download the JAR file and place it in a directory included in your system’s PATH. Then run the JAR file as mentioned above.
Additional notes
To maximize the performance of Agent-Fusion, consider configuring environment variables such as JAVA_OPTS to allocate more memory if you're working with large codebases. Make sure to check the documentation for any specific configuration settings related to your embedding model. A common gotcha is ensuring that the required file formats are properly indexed before querying, so verify that your documentation is accessible to the server.
Related MCP Servers
archestra
Secure cloud-native MCP registry, gateway & orchestrator
cui
A web UI for Claude Code agents
better-chatbot
Just a Better Chatbot. Powered by Agent & MCP & Workflows.
OpenContext
A personal context store for AI agents and assistants—reuse your existing coding agent CLI (Codex/Claude/OpenCode) with built‑in Skills/tools and a desktop GUI to capture, search, and reuse project knowledge across agents and repos.
kindly-web-search
Kindly Web Search MCP Server: Web search + robust content retrieval for AI coding tools (Claude Code, Codex, Cursor, GitHub Copilot, Gemini, etc.) and AI agents (Claude Desktop, OpenClaw, etc.). Supports Serper, Tavily, and SearXNG.
vsync
Sync MCP servers, Skills, Agents & Commands across Claude Code, Cursor, OpenCode, Codex. One config, all tools.