Django RAG System

A Django-based Retrieval-Augmented Generation (RAG) app for uploading .txt and .pdf files, storing embeddings in PostgreSQL with pgvector, and answering questions with Groq's OpenAI-compatible chat API.

What It Does

Upload text and PDF documents through the web UI or JSON API.
Extract raw text from each file.
Split text into chunks with overlap for retrieval.
Generate 384-dimensional embeddings with sentence-transformers/all-MiniLM-L6-v2.
Store chunk embeddings in PostgreSQL using pgvector.
Retrieve the closest chunks with vector similarity search.
Send retrieved context to Groq's llama-3.1-8b-instant model to generate an answer.
Save query history and returned sources for later viewing.

Stack

Django 4.2
PostgreSQL
pgvector
langgraph
sentence-transformers
langchain-text-splitters
Groq via the OpenAI Python client
PyPDF2
python-dotenv

Note: the app now uses LangGraph for the initial query orchestration path, while keeping the rest of the service layer and Django views intact.

Project Layout

DjangoRag/
├── django_rag/                         # Django project config
│   ├── settings.py
│   ├── urls.py
│   ├── asgi.py
│   └── wsgi.py
├── rag_app/                            # Main app
│   ├── management/commands/
│   │   └── ingest_documents.py         # Batch ingestion command
│   ├── migrations/
│   │   ├── 0000_enable_pgvector.py     # Enables PostgreSQL vector extension
│   │   └── 0001_initial.py             # Creates core tables
│   ├── services/
│   │   └── rag_service.py              # Text processing, embeddings, retrieval, LLM calls
│   ├── forms.py
│   ├── models.py
│   ├── urls.py
│   └── views.py
├── templates/
│   ├── base.html
│   └── rag_app/                        # Document, query, and result templates
├── documents/                          # Uploaded files and media root
├── requirements.txt
├── README.md
└── Architecture.md

Data Model

`Document`

Stores uploaded file metadata, extracted text, and processing state.

title
file
content
uploaded_at
processed

`DocumentChunk`

Stores chunked document content plus a 384-dimensional embedding.

document
content
chunk_index
embedding
created_at

`QueryHistory`

Stores user questions, generated answers, and source snippets returned to the UI/API.

query
answer
sources
created_at

Request Flow

Document ingestion

A user uploads a .txt or .pdf file.
DocumentUploadView saves the file as a Document.
RAGService.ingest_document() reads the file from disk.
TextProcessor extracts raw text.
Text is chunked using RecursiveCharacterTextSplitter.
EmbeddingService generates an embedding for each chunk.
Each chunk is saved as a DocumentChunk.
The document is marked as processed.

Query answering

A user submits a question from the UI or API.
RAGService.retrieve_similar_chunks() embeds the query.
A LangGraph workflow routes either to a no-results response or to answer generation.
pgvector L2Distance ranks the closest chunks.
Top chunks are concatenated into a prompt.
Groq generates the answer with llama-3.1-8b-instant.
The answer and sources are saved in QueryHistory.

Setup

1. Install dependencies

python -m venv venv
source venv/bin/activate
pip install -r requirements.txt

2. Create PostgreSQL database and enable `pgvector`

Create a PostgreSQL database, then make sure the vector extension can be enabled by migrations.

Example:

CREATE DATABASE rag_db;

The app migration rag_app/migrations/0000_enable_pgvector.py runs:

CREATE EXTENSION IF NOT EXISTS vector;

3. Configure environment variables

Copy .env.template to .env and set values:

DB_NAME=rag_db
DB_USER=postgres
DB_PASSWORD=postgres
DB_HOST=localhost
DB_PORT=5432

GROQ_API_KEY=your_groq_api_key_here

SECRET_KEY=your_django_secret_key_here
DEBUG=True
ALLOWED_HOSTS=localhost,127.0.0.1

4. Run migrations

python manage.py migrate

5. Start the server

python manage.py runserver

Open http://127.0.0.1:8000.

Docker

You can run the app locally with Docker Compose and a dedicated PostgreSQL + pgvector container.

1. Create your `.env`

Copy .env.template to .env and set at least:

DB_NAME=rag_db
DB_USER=postgres
DB_PASSWORD=postgres
DB_HOST=localhost
DB_PORT=5432
GROQ_API_KEY=your_groq_api_key_here
SECRET_KEY=your_django_secret_key_here
DEBUG=True
ALLOWED_HOSTS=localhost,127.0.0.1

Note: in Docker, docker-compose.yml overrides DB_HOST to db, so you can keep localhost in your local .env for non-Docker runs.

2. Build and start the containers

docker compose up --build

This starts:

web on http://127.0.0.1:8000
db on localhost:5432

The web container runs migrations automatically on startup.

3. Stop the stack

docker compose down

To also remove the Postgres data volume:

docker compose down -v

4. Useful Docker commands

Run Django tests inside the app container:

docker compose exec web python manage.py test

Open a Django shell:

docker compose exec web python manage.py shell

Inspect the database with psql:

docker compose exec db psql -U postgres -d rag_db

Docker files

Dockerfile builds the Django app image.
docker-compose.yml defines the app and database services.
.dockerignore keeps the Docker build context small.

Usage

Web UI

/ shows uploaded documents and processing status.
/documents/upload/ uploads and processes a document.
/rag/query/ asks a question and shows recent query history.
/rag/answer/<id>/ shows a saved result.

API

Upload a document

POST /api/documents/upload/

Multipart form fields:

title
file

Returns:

{
  "success": true,
  "document_id": 1,
  "title": "Example",
  "chunks": 4
}

Ask a question

POST /api/rag/query/

Request body:

{
  "query": "What does the document say about refunds?"
}

Returns:

{
  "query_id": 1,
  "answer": "...",
  "sources": [
    {
      "document": "Policy",
      "content": "...",
      "similarity": 0.8123
    }
  ]
}

Batch Ingestion

You can ingest files from disk with the management command:

python manage.py ingest_documents
python manage.py ingest_documents --directory documents
python manage.py ingest_documents --directory /absolute/path/to/files --reprocess

Supported file types are .txt and .pdf.

Configuration Defaults

Current defaults from django_rag/settings.py:

EMBEDDING_MODEL: sentence-transformers/all-MiniLM-L6-v2
CHUNK_SIZE: 1000
CHUNK_OVERLAP: 200
MEDIA_ROOT: documents/
TIME_ZONE: UTC

Important Implementation Notes

Ingestion runs synchronously inside the upload request. Large files will make the upload request slow.
Query answering makes a live external API call to Groq.
The initial LangGraph rollout covers the query orchestration path only.
Query results use the top 5 nearest chunks by L2 distance.
The Docker setup is intended for local development and testing, not production hardening.
Uploaded files are stored under the local documents/ directory.
Document.content stores the full extracted text, so database size grows with file volume.
The current tests are lightweight and do not mock external embedding or LLM calls.

Development Notes

Main service logic lives in rag_app/services/rag_service.py.
UI templates live in templates/rag_app.
URL routing starts in django_rag/urls.py and delegates to rag_app/urls.py.

For a deeper system breakdown, see Architecture.md.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.github/workflows		.github/workflows
django_rag		django_rag
rag_app		rag_app
templates		templates
.dockerignore		.dockerignore
.env.template		.env.template
.gitignore		.gitignore
Architecture.md		Architecture.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
SPEC.md		SPEC.md
docker-compose.yml		docker-compose.yml
manage.py		manage.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Django RAG System

What It Does

Stack

Project Layout

Data Model

Document

DocumentChunk

QueryHistory

Request Flow

Document ingestion

Query answering

Setup

1. Install dependencies

2. Create PostgreSQL database and enable pgvector

3. Configure environment variables

4. Run migrations

5. Start the server

Docker

1. Create your .env

2. Build and start the containers

3. Stop the stack

4. Useful Docker commands

Docker files

Usage

Web UI

API

Upload a document

Ask a question

Batch Ingestion

Configuration Defaults

Important Implementation Notes

Development Notes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`Document`

`DocumentChunk`

`QueryHistory`

2. Create PostgreSQL database and enable `pgvector`

1. Create your `.env`

Packages