End-to-end speech recognition large model: 31 languages, dialects, accents, lyrics, hotwords, timestamps, speaker diarization. Trained on tens of millions of hours.
pytorch speech-recognition speech-to-text transcription asr speaker-diarization chinese-dialects real-time-asr audio-language-model multilingual-asr fun-asr whisper-alternative 31-languages llm-asr
-
Updated
May 26, 2026 - Python