Skip to content
View ekhodzitsky's full-sized avatar
  • Russia, Stavropol

Block or report ekhodzitsky

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ekhodzitsky/README.md

Senior Backend · On-Device Speech AI · Agentic Systems


About

Senior backend engineer — 8+ years in production.

  • Node.js / TypeScript (NestJS, Moleculer) — REST APIs from scratch, API gateways, message brokers, payment & telephony integrations, legacy refactors at ~20k LOC scale.
  • Python (FastAPI, Django) — email pipelines on APScheduler + SMTP, document generation (PDF/DOCX) at template-set scale.
  • Rust — current focus: on-device speech AI and agentic developer tools.

Local-first by default: zero cloud APIs, zero vendor lock-in, models that ship inside the binary.

Currently shipping: oh-my-kimi — wire-first Rust agent runtime · localmt — offline Android translation SDK targeting Redmi Note 14 airplane-mode proof.



🚀 Flagship — gigastt

Russian speech recognition on M1 in 700 ms. 11.4% WER. One binary. No cloud.

cargo install gigastt && gigastt serve
# WebSocket: ws://127.0.0.1:9876/v1/ws
# REST API:  http://127.0.0.1:9876/v1/transcribe

GigaAM v3 + ONNX Runtime · INT8 with 0% accuracy loss · WebSocket streaming + REST · Homebrew tap · CoreML / CUDA / CPU · v2.0.3


Open source

Speech AI — Rust + ONNX Runtime, fully offline

  • polyvoice — Speaker diarization without Python · DER ~14% VoxConverse, ~23% AMI · 10× realtime on CPU, ~80% of pyannote accuracy
  • phonex — Generic on-device STT engine · 10+ languages, ~70 ms / 5 s clip, single binary
  • nihostt — Japanese STT (ReazonSpeech-k2-v2) · CER ~1.1% (clean) / 8% (full 309-clip bench) · ~200 ms latency, INT8 ~155 MB
  • phostt — Vietnamese STT (Zipformer-vi RNN-T) · ~75 MB model · crates.io + pypi
  • localmt — Offline Android translation SDK · GGUF + llama.cpp via JNI, arm64-v8a target

Agentic developer tools — Kimi ecosystem

  • oh-my-kimi — Wire-first orchestration for Kimi CLI · scheduler-backed teams, ownership conflict detection, verification gates, proof/failure artifacts
  • cargo-kimi — Cargo subcommand that scores Rust files 0–100 on contract quality (Hoare triples, panic safety, typestate, size, Result discipline) with LSP server · on crates.io
  • kimi-guidelines — Composable configs, instructions, and skills for Kimi K2.6

Telegram Email

Pinned Loading

  1. gigastt gigastt Public

    Local STT server powered by GigaAM v3.

    Rust 6 1

  2. oh-my-kimi oh-my-kimi Public

    Local, Wire-first, proof-driven orchestration for Kimi CLI: scheduler-backed teams, verification gates, run timelines, and proof reports.

    Rust

  3. polyvoice polyvoice Public

    Speaker diarization for Rust — who spoke when, without Python. Silero VAD + WeSpeaker + AHC in a single Pipeline::run() call.

    Rust

  4. cargo-kimi cargo-kimi Public

    Cargo subcommand for structured contracts, Hoare triples, and Rust verification

    Rust

  5. phonex phonex Public

    Generic on-device speech-to-text. Local inference, no cloud APIs, full privacy. Rust + ONNX Runtime.

    Rust

  6. localmt localmt Public

    Offline Android translation SDK in Rust: verified GGUF/Hy-MT model packs, llama.cpp FFI/JNI, no network, no text leaves the phone.

    Rust