Mutation Browser

Mutation Browser is a full-stack web application for storing, querying, and interactively exploring structured genomic mutation data. The system combines a relational database, a backend web API, a browser-based user interface, and asynchronous query logging, with deployment options via Docker Compose or Kubernetes (Minikube).

It is designed to demonstrate a clean, modular architecture for data ingestion, persistence, asynchronous processing, and downstream consumption.

Tech Stack

Frontend: React, Vite
Backend: Python, Flask, Flask-CORS, Gunicorn, Celery
Database: PostgreSQL
Data Access: psycopg2 (PostgreSQL driver)
Infra/DevOps: Docker, Docker Compose, RabbitMQ, Kubernetes (Minikube)
Architecture: React frontend + Flask REST API + PostgreSQL database + async logging with Celery/RabbitMQ

Overview

The Mutation Browser system provides:

A PostgreSQL-backed data model for mutation records
A batch ingestion mechanism for loading mutation data into the database
A Flask-based web API for querying mutations
A React-based frontend for interactive exploration
Asynchronous logging of query events using Celery and RabbitMQ
Background worker for audit and analytics tasks
Containerized deployment using Docker and Docker Compose
Local Kubernetes deployment using Minikube and Kubernetes manifests

The architecture cleanly separates data storage, application logic, and presentation, and is suitable for local development as well as cloud deployment.

Architecture

Mutation records are stored in a PostgreSQL database.
A Flask API queries the database using parameterized SQL.
A React frontend consumes API responses over HTTP.
A Celery worker consumes query-log tasks from RabbitMQ and writes to PostgreSQL.
Services are orchestrated locally using Docker Compose.
Kubernetes manifests mirror the Compose services for local orchestration.

This diagram illustrates the Kubernetes (Minikube) deployment view, showing services, deployments, and the worker/broker flow.

This diagram illustrates the flow of mutation data from ingestion through persistent storage and into a stateless web API, which is then consumed by a browser-based client.

Project structure

backend/
Flask web service implementing the API layer
backend/celery_app.py
Celery app configuration and async task definitions
frontend/
React application providing an interactive user interface
db/
Database schema definitions
data/
Data preparation and ingestion utilities
scripts/
Operational and maintenance scripts
docker/
Containerization documentation and assets
k8s/
Kubernetes manifests for local Minikube deployment

Database schema

The database contains tables for mutation records and query logging:

variants

sample_id
gene
variant
vaf
tumor_type

query_events

request_id
gene
requested_at
status

The schema is defined in db/schema.sql.

API endpoints

GET /health
Health check endpoint
GET /variants?gene=<GENE>
Returns mutation records for the specified gene and a request ID
GET /status/<REQUEST_ID>
Returns the async query-log status for a request ID

User interface

The frontend provides a browser-based interface for querying mutation records by gene name and inspecting results in tabular form, including a count of matching samples.

The frontend communicates directly with the backend API and demonstrates end-to-end data flow from database to user-facing application.

Running the system locally

The backend API and database are run using Docker Compose, while the frontend is served via a local development server.

At a high level:

Start backend services (API + Postgres + RabbitMQ + Celery):
- docker compose up -d
Start the frontend dev server:
- cd frontend && npm run dev
Open the app:
- Frontend: http://localhost:5173
- API: http://localhost:8000/health

Detailed run instructions are provided in the respective component directories.

Run with Minikube (Kubernetes)

Prerequisites:

brew install kubectl minikube
kubectl version --client
minikube version

Steps:

Stop Docker Compose (if running):
- docker compose down
Start Minikube (Docker driver):
- minikube start --driver=docker
Point Docker to Minikube:
- eval $(minikube docker-env)
Build the backend image inside Minikube:
- docker build -t mutation-browser-api:local .
Apply Kubernetes manifests:
- kubectl apply -f k8s/
Wait for pods:
- kubectl get pods
Create query_events table:
- kubectl exec -i postgres-0 -- psql -U mutation_user -d mutation_browser <<'SQL' CREATE TABLE IF NOT EXISTS query_events ( id SERIAL PRIMARY KEY, request_id TEXT NOT NULL UNIQUE, gene TEXT NOT NULL, requested_at TIMESTAMPTZ NOT NULL, status TEXT NOT NULL ); SQL
Create variants table:
- kubectl exec -i postgres-0 -- psql -U mutation_user -d mutation_browser <<'SQL' CREATE TABLE IF NOT EXISTS variants ( id SERIAL PRIMARY KEY, sample_id TEXT NOT NULL, gene TEXT NOT NULL, variant TEXT NOT NULL, vaf FLOAT, tumor_type TEXT ); SQL
Load sample data:
- kubectl cp data/mutations.csv postgres-0:/tmp/mutations.csv
- kubectl exec -i postgres-0 -- psql -U mutation_user -d mutation_browser <<'SQL' \copy variants(sample_id, gene, variant, vaf, tumor_type) FROM '/tmp/mutations.csv' WITH (FORMAT csv, HEADER true); SQL
Port-forward API:

kubectl port-forward svc/api 8000:8000

Test:

curl "http://localhost:8000/health"
curl "http://localhost:8000/variants?gene=TP53"

(Optional) run frontend:

cd frontend && npm run dev

Switch Docker back to host:

eval $(minikube docker-env -u)

Stop and resume (Minikube)

Stop everything:
- minikube stop
- eval $(minikube docker-env -u)
Resume later (no code changes):
- minikube start --driver=docker
- kubectl port-forward svc/api 8000:8000
If you rebuilt code:
- eval $(minikube docker-env)
- docker build -t mutation-browser-api:local .
- kubectl apply -f k8s/
- kubectl port-forward svc/api 8000:8000
Restore host Docker context:
- eval $(minikube docker-env -u)

Check Kubernetes resources

Pods (running containers): kubectl get pods
Services (network endpoints): kubectl get svc
Deployments (desired vs actual pods): kubectl get deployments

Check query log data

Minikube (Kubernetes):
- kubectl exec -i postgres-0 -- psql -U mutation_user -d mutation_browser -c "SELECT * FROM query_events ORDER BY id DESC LIMIT 10;"
Docker Compose:
- docker compose exec -T postgres psql -U mutation_user -d mutation_browser -c "SELECT * FROM query_events ORDER BY id DESC LIMIT 10;"

RabbitMQ Management UI

Docker Compose:
- http://localhost:15672 (user: guest, password: guest)
Minikube (Kubernetes):
- kubectl port-forward svc/rabbitmq 15672:15672
- http://localhost:15672 (user: guest, password: guest)

Optional: migrate data from Docker Compose Postgres

Export from Docker Compose:
- docker compose exec -T postgres pg_dump -U mutation_user -d mutation_browser > dump.sql
Import into Minikube Postgres:
- kubectl exec -i postgres-0 -- psql -U mutation_user -d mutation_browser < dump.sql

Async Request Flow (Query Logging)

User enters a gene in the React UI and clicks Search.
React calls GET /variants?gene=<GENE>.
Flask generates a request_id and enqueues a log task.
Flask queries Postgres and returns results immediately.
RabbitMQ delivers the queued task to the Celery worker.
The Celery worker writes a log row to query_events.
(Optional) GET /status/<REQUEST_ID> returns the log status.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
backend		backend
data		data
db		db
docker		docker
docs		docs
frontend		frontend
k8s		k8s
scripts		scripts
tests		tests
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mutation Browser

Table of contents

Tech Stack

Overview

Architecture

Project structure

Database schema

API endpoints

User interface

Running the system locally

Run with Minikube (Kubernetes)

Stop and resume (Minikube)

Check Kubernetes resources

Check query log data

RabbitMQ Management UI

Optional: migrate data from Docker Compose Postgres

Async Request Flow (Query Logging)

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Mutation Browser

Table of contents

Tech Stack

Overview

Architecture

Project structure

Database schema

API endpoints

User interface

Running the system locally

Run with Minikube (Kubernetes)

Stop and resume (Minikube)

Check Kubernetes resources

Check query log data

RabbitMQ Management UI

Optional: migrate data from Docker Compose Postgres

Async Request Flow (Query Logging)

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages