RAG Document Intelligence

UPLOAD ANYDOCUMENT.ASK ANYTHING.

DocMind is a RAG-based document Q&A platform. Upload PDFs, DOCX, or text files — DocMind chunks, embeds, and indexes your content into ChromaDB. Ask questions in natural language and get cited answers grounded in your source material. Zero hallucination tolerance, evaluated with RAGAS.

Chunk Size
512tokens
Faithfulness
0.91RAGAS
Processing
AsyncCelery
DocMind / Live Session
Streaming

What attention mechanism does the Transformer use and how many heads?

The Transformer uses Multi-Head Attention with 8 parallel heads1. Each head operates on a d_k dimension of 64, with a model dimension d_model of 5122

attention_paper.pdf
p. 5
[1]
attention_paper.pdf
p. 7
[2]
1 source · 5 chunks scannedRAGAS 0.91

Built on production-grade infrastructure

FA
FastAPI
NX
Next.js
CH
ChromaDB
LC
LangChain
GM
Gemini
CL
Celery
RD
Redis
DK
Docker
The Pipeline

From upload to insight.

A three-stage RAG pipeline engineered for grounded, traceable answers. Every step is observable, every chunk is cited.

01
Step 01

Upload

Drag any PDF, DOCX, or text file. Async ingestion via Celery — your dashboard stays responsive.

PyPDF · python-docx · 20MB max
02
Step 02

Index

Documents are chunked semantically, embedded with Gemini, and stored in your isolated ChromaDB namespace.

512-token chunks · cosine similarity
03
Step 03

Chat

Multi-query expansion + cross-encoder reranking deliver cited answers streamed token-by-token.

Top-5 reranked · SSE streaming
Capabilities

Built for grounded answers.

Every feature engineered with one goal: zero hallucinations. Cite the source or refuse to answer.

Real-time

Token-by-token streaming.

Server-Sent Events deliver each token the moment Gemini generates it. No waiting for full responses — feel the model think.

▸ data: {"type":"token"}
▸ data: {"type":"sources"}
▸ data: {"type":"done"}
Grounded

Citation-first.

Every claim traces back to a chunk. If the document doesn't answer it, neither does the model.

Universal

Any document.

PDFs with tables, DOCX with formatting, raw text. We handle the parsing chaos so you don't have to.

Isolated

Multi-tenant ready.

Per-user ChromaDB namespaces. Your documents never touch another tenant's vector space.

Open source

Run it anywhere.

Docker Compose ships everything: FastAPI, Celery, Redis, ChromaDB, Postgres, and the Next.js client. One command, full stack.

$ docker compose up
Try it now

See it in action.

Sample document: 'Attention Is All You Need' — the original Transformer paper. Pick a question to watch the streaming response.

attention_paper.pdf
1 doc · 47 chunks · ready
Demo Mode

Pick a sample question below to start

Try these questions
Pricing

Simple plans. No surprises.

Start free. Upgrade when you ship. Cancel any time.

For exploring

Free

$0/forever
  • 5 documents
  • 100 queries/month
  • Gemini Flash model
  • Public deploy
  • Community support
Most Popular
For builders

Pro

$19/per month
  • Unlimited documents
  • 10K queries/month
  • Gemini Pro + GPT-4o
  • Private deployments
  • RAGAS eval reports
  • Priority support
For teams

Enterprise

Custom/annual
  • Everything in Pro
  • Self-hosted option
  • SSO + audit logs
  • SLA + dedicated support
  • Custom integrations
  • Volume pricing
Ready when you are

START BUILDING
IN 60 SECONDS.

No credit card. Just clone, compose, and chat. Your first cited answer is one upload away.

✓ Open source✓ Self-hostable✓ Production-ready✓ MIT licensed