Skip to content
View kdrkdrkdr's full-sized avatar
🚩
Elnino
🚩
Elnino

Organizations

@TeamHoshino @millennium-seminar @project-elnino

Block or report kdrkdrkdr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
kdrkdrkdr/README.md

Hi πŸ‘‹, I'm Gyeongmin Kim (kdr)

Software engineer working on speech processing, on-device model optimization, and real-time audio systems. Co-founder of El Nino, building Knoc β€” a real-time translation subtitle service.


πŸ“ Publications

  • Extracting Voice Styles from Frozen TTS Models via Gradient-Based Inverse Optimization Gyeongmin Kim. Preprint, Apr 2026. To be submitted to ICASSP 2027 / Interspeech 2027. [DOI] [supertonic.embed] [kokoro.embed]

πŸ’Ό Experience

  • Yonsei University Health System (YUHS) (Seoul, South Korea) β€” Research Engineer (Mar 2025 – Oct 2025) Led dev for NGS clinical report pipeline & SICU false-alarm monitoring desktop app.
  • NCSOFT (Seongnam, South Korea) β€” Audio Data Engineer, Contract (Jul 2024 – Jan 2025) Built end-to-end audio post-processing automation for TTS data pipelines.
  • Axcellworks (Japan, Remote) β€” Speech Engineer, Freelance (Nov 2023 – Feb 2024) Improved real-time Voice Changer intelligibility via chunk merging algorithm.
  • Taiyaki Studios (USA, Remote) β€” AI Engineer, Contract (Jan 2023 – Jul 2023) Built a complete TTS training toolkit and production inference pipeline.

πŸš€ Featured Projects

On-Device Speech Model Optimization (C / WebAssembly)

  • MossTTS-Nano.c β€” 100M TTS model rewritten in pure C. NEON/SSE SIMD, KV cache, pthread parallelism β€” 30Γ— speedup (68s β†’ 2.3s), 1.8Γ— faster than PyTorch CPU, RTF 0.33.
  • DeepFilterNet3.c.wasm β€” Noise-reduction model in pure C/WASM. ~1 ms on MacBook M2, ~4 ms on Galaxy S23.
  • fastenhancer.c.wasm β€” Audio enhancement in pure C. 546Γ— size reduction (183 KB), mobile RTF 0.28.
  • LILAC β€” Zero-shot real-time voice conversion from 3s audio. OpenVoice v2 ported from PyTorch to pure C with streaming HiFi-GAN decoder, RNNoise SIMD, 2-thread SPSC audio pipeline. RTF 0.7–0.8 on CPU.

Multilingual TTS & Voice Conversion

  • JA2ML-VITS β€” Multilingual TTS inducing 19-language speech from Japanese-only datasets.
  • JK-VITS β€” Korean/Japanese bilingual TTS.
  • RVC-VITS β€” Voice-conversion-based dataset augmentation and TTS training pipeline using RVC.
  • ProsekaTTS β€” Character TTS service. 2.1M+ visitors (Feb 2026).
  • ShirokoTTS β€” First-ever Blue Archive Shiroko TTS.

G2P Packages (PyPI)

  • g2pk3 β€” Korean/Japanese/English β†’ Korean pronunciation.
  • ko2kana β€” Korean/English pronunciation β†’ Katakana.

Japanese Translation Tools

  • novel-reader β€” Android app translating novels from 7 Japanese sites with a custom proper-noun dictionary system.
  • EhndWebTranslate β€” Async Japanese web page translator with real-time/document/novel modes.
  • UserDict4Papago β€” Proper-noun dictionary overlay for Papago KR-JP translation.

🌐 Open Source Contributions

πŸŽ“ Education

  • Hanyang University, Seoul, South Korea β€” B.S. in Computer Science (Mar 2023 – Present, Leave of Absence since Jul 2024)

Pinned Loading

  1. supertonic.embed supertonic.embed Public

    Extract voice style embeddings from any WAV for SupertonicTTS β€” no style encoder needed.

    Python 18 4

  2. ProsekaTTS ProsekaTTS Public

    Web demo Project Sekai TTS

    Python 28 8

  3. lilac lilac Public

    ✨Realtime Voice Changer with 3~ seconds for custom voice in CPU

    C 20 3

  4. RVC-VITS RVC-VITS Public archive

    Few-shot multilingual tts with RVC and Vits

    Python 50 6

  5. JK-VITS JK-VITS Public

    Bilingual-TTS (Japanese and Korean)

    Jupyter Notebook 32 6

  6. JA2ML-VITS JA2ML-VITS Public

    Japanese Dataset to Multi Language TTS (Only for Japanese Dataset)

    Jupyter Notebook 5