Software engineer working on speech processing, on-device model optimization, and real-time audio systems. Co-founder of El Nino, building Knoc β a real-time translation subtitle service.
- π Portfolio / CV
- π’ Team El Nino
- π€ Hugging Face
- π« kdrkdrkdr@hanyang.ac.kr
- Extracting Voice Styles from Frozen TTS Models via Gradient-Based Inverse Optimization Gyeongmin Kim. Preprint, Apr 2026. To be submitted to ICASSP 2027 / Interspeech 2027. [DOI] [supertonic.embed] [kokoro.embed]
- Yonsei University Health System (YUHS) (Seoul, South Korea) β Research Engineer (Mar 2025 β Oct 2025) Led dev for NGS clinical report pipeline & SICU false-alarm monitoring desktop app.
- NCSOFT (Seongnam, South Korea) β Audio Data Engineer, Contract (Jul 2024 β Jan 2025) Built end-to-end audio post-processing automation for TTS data pipelines.
- Axcellworks (Japan, Remote) β Speech Engineer, Freelance (Nov 2023 β Feb 2024) Improved real-time Voice Changer intelligibility via chunk merging algorithm.
- Taiyaki Studios (USA, Remote) β AI Engineer, Contract (Jan 2023 β Jul 2023) Built a complete TTS training toolkit and production inference pipeline.
On-Device Speech Model Optimization (C / WebAssembly)
- MossTTS-Nano.c β 100M TTS model rewritten in pure C. NEON/SSE SIMD, KV cache, pthread parallelism β 30Γ speedup (68s β 2.3s), 1.8Γ faster than PyTorch CPU, RTF 0.33.
- DeepFilterNet3.c.wasm β Noise-reduction model in pure C/WASM. ~1 ms on MacBook M2, ~4 ms on Galaxy S23.
- fastenhancer.c.wasm β Audio enhancement in pure C. 546Γ size reduction (183 KB), mobile RTF 0.28.
- LILAC β Zero-shot real-time voice conversion from 3s audio. OpenVoice v2 ported from PyTorch to pure C with streaming HiFi-GAN decoder, RNNoise SIMD, 2-thread SPSC audio pipeline. RTF 0.7β0.8 on CPU.
Multilingual TTS & Voice Conversion
- JA2ML-VITS β Multilingual TTS inducing 19-language speech from Japanese-only datasets.
- JK-VITS β Korean/Japanese bilingual TTS.
- RVC-VITS β Voice-conversion-based dataset augmentation and TTS training pipeline using RVC.
- ProsekaTTS β Character TTS service. 2.1M+ visitors (Feb 2026).
- ShirokoTTS β First-ever Blue Archive Shiroko TTS.
G2P Packages (PyPI)
- g2pk3 β Korean/Japanese/English β Korean pronunciation.
- ko2kana β Korean/English pronunciation β Katakana.
Japanese Translation Tools
- novel-reader β Android app translating novels from 7 Japanese sites with a custom proper-noun dictionary system.
- EhndWebTranslate β Async Japanese web page translator with real-time/document/novel modes.
- UserDict4Papago β Proper-noun dictionary overlay for Papago KR-JP translation.
- VOICEVOX Engine β Added Korean speech support by mapping Japanese phonemes to Korean phonemes.
- Versatile Audio Super Resolution β Contributed silence-removal utility for upsampled audio.
- Manga Image Translator β Maintained translation module for image translation web service.
- Hanyang University, Seoul, South Korea β B.S. in Computer Science (Mar 2023 β Present, Leave of Absence since Jul 2024)


