findajob

Self-hosted infrastructure for a sane job search.

The modern job search grinds people down — hundreds of listings per day, most irrelevant; the same cover letter rewritten at midnight; no memory of which companies went silent weeks ago; no signal about whether the rejections mean "wrong skill," "wrong level," or "wrong field."

Burnout is the default. findajob absorbs the triage, the tailoring, and the tracking so your attention goes to the few applications actually worth sending. It's a pre-1.0 personal project — used daily by the operator and a small wave of beta testers, not a polished product yet.

LinkedIn, Indeed, Greenhouse, and Gmail flow in; a local LLM filters out the noise; a web UI lets you triage, prep, and track. Runs as a Docker container on any Linux host. ~$0.50–2/day in API usage.

What it does

The pipeline narrows the funnel at every step where a human would otherwise waste attention — LLM triage on the way in, human triage on the way to prep, prep only for jobs worth applying to. Thirty days on the operator's instance looks like this:

Funnel snapshot (30 days, except where noted)	Count	Conversion at this step
Listings ingested	12,824	—
Scored ≥7 (surfaced to operator)	393	3.1% of ingested
Prepped (resume + cover letter + briefing)	160	41% of surfaced
Applications sent	60	38% of prepped
Interviews (lifetime, not 30d)	6	—

Pass rate at each step:
Surfaced   ▓░░░░░░░░░░░░░░░░░░░░░░░░   3.1%   ← LLM triage does the heaviest cut
Prepped    ▓▓▓▓▓▓▓▓▓▓░░░░░░░░░░░░░░░   41%
Applied    ▓▓▓▓▓▓▓▓▓░░░░░░░░░░░░░░░░   38%
Interview  ▓▓░░░░░░░░░░░░░░░░░░░░░░░   10%

12,824 listings narrowed to 60 applications — triage cuts the noise so attention goes to the few worth sending. The reject-with-reason flow (323 rejected with feedback, 54 waitlisted in the same 30 days) feeds back into the scorer so its cuts keep improving. Prep is LLM-assisted but user-gated: you never apply to a job the system chose for you.

Is this for you?

If your search feels like 11pm cover letters, spreadsheet sprawl, and bot-rejection silence — this is built for exactly that.
If you want polished consumer SaaS — not yet. It's self-hosted, rough at the edges, opinionated, and used daily by the operator.
If you're technical and want to read the code — see docs/architecture.md.

Roadmap

Live status of every issue and milestone is on the project board — issues move through Backlog → Up Next → In Progress → Done as work happens. The summary below is a snapshot.

Milestone	What it means	Status	Target
General Availability	A second non-technical user runs their own instance end-to-end. Config layer fully externalized, user docs written, onboarding flow exists.	35 closed / 13 open	2026-05-31
v1.1 — Cost + Credentials Hardening	You see per-job and per-week LLM spend in-app, and no plaintext API key lives on disk.	0 closed / 7 open	2026-07-30
v1.2 — Tuning Loop + Stats	The pipeline recommends scorer tunes from your behavior, and `/stats/*` dashboards show precision, outcome, recall, and cost trends over time.	0 closed / 19 open	2026-09-29
v1.3 — Ops Hardening	Fresh-install smoke is CI-gated, log rotation works, DB migrates cleanly across versions.	1 closed / 12 open	2026-10-30
v1.4 — Funnel + Triage UX	Every candidate row in the daily triage loop is actionable in one click, with prior-application context inline.	7 closed / 13 open	2026-08-30

Counts above are approximate snapshots — for the live state, follow the project board.

How it works

0. Onboarding (one-time, on first visit to the web UI) — produces your candidate profile, target-companies list, prefilter rules, and search queries from a structured interview. See Quick start below for the two onboarding paths and what you'll need.

1. Daily triage (00:00, scheduler-driven) — fetches 100–500 listings from RapidAPI (LinkedIn), direct ATS feeds (Greenhouse, Ashby, Lever), and Gmail job alerts (LinkedIn + Indeed); cleans + deduplicates; enriches with JD text; scores each against your candidate profile (the profile.md produced in step 0) using an LLM. Results land in SQLite.

2. Dashboard triage — the web UI shows every scored job that cleared the threshold, with relevance/fit/probability scores, known contacts, and AI notes. You flag the ones worth prepping.

3. Prep (on-flag) — launches prep_application.py, which generates a folder per job containing a tailored resume, cover letter, company briefing, and network-outreach drafts. Uses Claude Opus for writing, Perplexity for company research.

4. Apply + track — you submit the application, mark the job Applied. The Applied tab color-codes by days-since-submission so you can see at a glance which applications have gone silent too long.

5. Reject with reason — jobs that don't work out get rejected with a reason (Skills Mismatch, Too Senior, Comp Too Low, Geography/Onsite, etc.). Those reasons feed back into the next day's scorer as negative examples.

6. Learn — stats dashboards make the funnel and the rejection mix legible, so you can tell whether the scorer is drifting or whether a particular reason is spiking — a signal to tune the profile or retarget.

Screenshots are from a fresh-install demo database seeded with fictional jobs across data center operations, social work, and K–12 education. No real employer or candidate data.

What you get out of it

No more switching between Linear, Notion, Gmail, and three browser tabs. Dashboard, Applied, Waitlist, Review, Rejected, Archive are all filtered views of the same SQLite table. Sort, filter, density toggles are URL query params — any view is bookmarkable and shareable.
Your tailored resumes and cover letters stay yours. Generated folders sit on your Docker host as plain .docx and .md files; the web UI renders Markdown inline and serves the docs as downloads. Nothing is locked behind a vendor login.
When you reject a job, you tell it why — and tomorrow's scorer remembers. Every rejection is a labeled training example. Every manual-review flag points at the part of your profile the LLM found ambiguous, so you know exactly where to tune.
Built by a data center ops candidate; designed to work for a social worker, teacher, accountant, or trades professional too. Same pipeline, same setup — only profile.md changes. See docs/maintainers/generalization.md for the state of the field-agnostic work.
Your data stays local. SQLite on your Docker host. The only outbound calls are to the LLM providers you've configured; the repo contains zero personal data.

Stack

Component	Choice
Scoring	DeepSeek v3.2 via OpenRouter
Resume + cover letter + outreach	Claude Opus / Sonnet 4.6 via OpenRouter
Company research	Perplexity Sonar Pro via OpenRouter
LLM transport	`findajob.llm.openrouter` — stdlib HTTP wrapper with prompt caching (Anthropic) and provider pinning
Storage	SQLite
Job sources	RapidAPI jobs-api14, Greenhouse JSON, Gmail IMAP/app-password (configurable; see `docs/getting-started/gmail.md`)
Web UI	FastAPI + HTMX + Tailwind + Chart.js
Push notifications	ntfy.sh
Scheduler	supercronic (in-container)

Quick start

The pipeline ships as ghcr.io/brockamer/findajob pulled via Docker Compose.

What you'll need

Two API keys before you start. Sign-up walkthroughs + cost expectations are in docs/getting-started/api-keys.md:

Provider	Required?	What you'll spend	What findajob uses it for
OpenRouter	yes	pay-as-you-go from $0; ~$0.50/day triage-only, $1.50–3.00 per fully-prepped job, ~$3-6 per in-app onboarding interview	All LLM calls (scoring, prep writing, in-app onboarding)
RapidAPI (jobs-api14)	optional	BASIC plan: 150 req/month free, no credit card	LinkedIn + Indeed search ingestion

Skipping RapidAPI means LinkedIn/Indeed search is inactive — Greenhouse/Ashby/Lever feeds and Gmail alerts still work, so the daily pipeline runs identically without it. The "What it costs to run" section near the bottom of this README breaks the OpenRouter spend down by component if you want a more granular budget. You collect both keys on the onboarding page once your container is up.

Deploy

# On your Docker host
sudo mkdir -p /opt/stacks/findajob-<you>/state/{data,config,candidate_context,companies,logs}
sudo chown -R $(id -u):$(id -g) /opt/stacks/findajob-<you>/
cd /opt/stacks/findajob-<you>

curl -fsSL -o compose.yaml https://raw.githubusercontent.com/brockamer/findajob/main/ops/compose.yaml.example
curl -fsSL -o .env         https://raw.githubusercontent.com/brockamer/findajob/main/ops/stack.env.example

# Edit .env (timezone, port, basic-auth password if internet-exposed)
docker compose up -d

First-run onboarding

Open http://<your-host>:<port>/ in a browser. A fresh stack redirects you straight into onboarding — no need to know to navigate via Tools → Onboarding.

Step 1 — paste your OpenRouter key (required), plus optional RapidAPI and Google keys. The OpenRouter key is smoke-checked against the live API before being saved.

Step 2 — click Start interview. A chat surface opens inside findajob and walks you through a 60–90 minute conversation about your background, target role, exclusions, and writing voice. The session is server-side persistent — close the tab anytime, reload, and the page surfaces a Resume affordance. When the LLM finishes emitting your config blocks, a Finalize button appears; clicking it writes the config files atomically, runs initial company discovery, and lands you on the dashboard. No copy-paste step.

Cost runs ~$3-6 per onboarding (Claude Sonnet 4.6 with prompt caching). The next scheduled triage run (00:00 in your TZ) ingests its first batch of jobs.

Full walkthrough → docs/getting-started/install-docker.md (or start at docs/getting-started/README.md for the guided sequence).

Documentation

Start here:

Getting started — guided sequence for getting your stack running
Daily workflow — what to do each day, tab by tab in the web UI
Troubleshooting — symptom index, log reading, health alerts
Architecture — system design, data flow, component map (for operators who want to read the code)

All documentation (click to expand)

Doc	Contents
docs/getting-started/README.md	Getting started — start here
docs/usage.md	Daily workflow: web UI tab by tab
docs/troubleshooting.md	Symptom index + log reading + health alerts
docs/architecture.md	System design, data flow, component map
docs/getting-started/prerequisites.md	API keys, accounts, tools you need
docs/getting-started/install-docker.md	Docker Compose setup
docs/getting-started/api-keys.md	Getting your three API keys (OpenRouter, RapidAPI, Google AI Studio)
docs/getting-started/configure.md	Profile, resume, search queries, advanced config
docs/operations/README.md	Operator reference: manual commands, monitoring
docs/operations/internet-exposure.md	Exposing the stack to the public internet
docs/operations/restore.md	Restore from backup
docs/getting-started/notifications.md	ntfy.sh setup

For contributors:

Doc	Contents
docs/maintainers/release-process.md	Release ceremony, dogfood gate, CHANGELOG conventions
docs/maintainers/project-board.md	GitHub Projects v2 board conventions and CLI
docs/maintainers/plan-conventions.md	What every implementation plan must contain
docs/maintainers/generalization.md	Making the pipeline work for non-tech fields

What it costs to run

Real-world per-day usage on the operator's instance, ~10k jobs/month scored:

Item	Typical day
Scoring (DeepSeek via OpenRouter)	$0.10–0.30
Company research (Perplexity Sonar Pro)	$0.10–0.20 per prepped job
Prep writing (Claude Opus)	$1.50–3.00 per prepped job
Embeddings rebuild (Gemini)	~$0.01/week

Total: ~$0.50/day when triaging only; ~$5–15 on days you prep a few applications.

Privacy

The repository contains no personal data. All candidate content (resume, profile, writing samples, search queries, API keys) lives in gitignored paths populated from .example templates. The Claude Code session pattern is split: tracked CLAUDE.md carries generic project guidance, gitignored CLAUDE.local.md carries personal identifiers — see the top of CLAUDE.md for the contract.

Stay in touch / contribute

Project board — what's being worked on, what's blocked, what's on the roadmap. The single source of truth for active work.
Issues — file a bug, request a feature, or browse the open ones. New issues land in the board's Backlog and get triaged with a Priority field.
In-app feedback widget — if you're running an instance, the floating "Feedback" button on every page files a GitHub issue directly from the web UI (configure with a fine-grained PAT per docs/getting-started/configure.md).
Discussions — for "how do I..." or "have you considered..." threads that aren't bug reports yet.
Security disclosures — please don't file public issues for security-relevant bugs. See SECURITY.md for the private disclosure path.

This is a personal project, but contributions are welcome. The code is opinionated, the docs are written for an external reader trying it for the first time, and the pre-commit hook will block any PII you accidentally try to commit. Start at CONTRIBUTING.md — it covers dev setup, commit conventions, the migration-required label, and the architectural invariants the code enforces.

License

MIT.

Name		Name	Last commit message	Last commit date
Latest commit History 521 Commits
.github		.github
candidate_context		candidate_context
config		config
data		data
docs		docs
ops		ops
scripts		scripts
src/findajob		src/findajob
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.local.md.example		CLAUDE.local.md.example
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
README.md		README.md
SECURITY.md		SECURITY.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

findajob

What it does

Is this for you?

Roadmap

How it works

What you get out of it

Stack

Quick start

What you'll need

Deploy

First-run onboarding

Documentation

What it costs to run

Privacy

Stay in touch / contribute

License

About

Uh oh!

Releases 44

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

findajob

What it does

Is this for you?

Roadmap

How it works

What you get out of it

Stack

Quick start

What you'll need

Deploy

First-run onboarding

Documentation

What it costs to run

Privacy

Stay in touch / contribute

License

About

Topics

Resources

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 44

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages