speechie

speech to text thing that runs on my home server.

setup

create a .env file (OpenAI compatible API required):

LLM_API_KEY=abc123
LLM_BASE_URL=https://api.example.com/v1/chat/completions
LLM_MODEL=llama-3.1-8b-instant

start the server:

uv run uvicorn main:app --host 0.0.0.0 --port 8000 --env-file .env

curl -X POST "http://localhost:8000/transcribe" \
    -H "Content-Type: multipart/form-data" \
    -F "file=@/path/to/audio.wav"

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
asr.py		asr.py
llm.py		llm.py
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock