ItMeDiaTech/rag-cli

45 stars · Last commit 2026-03-22

Local Retrieval-Augmented Generation (RAG) plugin for Claude Code that combines Chroma db vector embeddings with intelligent info retrieval with Multi-Agent Framework (MAF) orchestration for context-aware development assistance. Uses Open Source / Free frameworks. Implements bridge to Claude Code CLI so no token use. And it's easy to setup.

README preview

# RAG-CLI v2.0

# DO NOT USE THIS TOOL FOR ANTHROPIC / CLAUDE - SEE BELOW

Just a heads-up, turns out Anthropic / Claude does not like it when you avoid token usage cost by routing traffic to the CLI tool from them. This shadow banned me from their platform when I was on their $200 a month plan. They refuse to respond after months of submitting an appeal, etc, and no project I worked on violated any aspect of their Terms. After research, I see many people have been banned on similar cases. You have been warned.

**Local Retrieval-Augmented Generation system for Claude Code with Multi-Agent Framework integration.**

A production-ready Claude Code plugin that combines ChromaDB vector embeddings with intelligent document retrieval and Multi-Agent Framework (MAF) orchestration for context-aware development assistance.

## Project Status

**Current Version**: 2.0.0
**Status**: Production Ready (with known limitations documented in KNOWN_ISSUES.md)

**Key Features:**
- ChromaDB-based vector storage with HNSW indexing
- Hybrid search combining semantic and keyword matching
- Multi-Agent Framework for intelligent query routing
- Zero external API costs for document processing

View full repository on GitHub →