Skip to content

PaperCodex

Subscribe

Automatic Speech Recognition

Omnilingual ASR: Open-Source Speech Recognition for 1,600+ Languages—Including 500 Never Before Supported

Omnilingual ASR: Open-Source Speech Recognition for 1,600+ Languages—Including 500 Never Before Supported 2504

For decades, automatic speech recognition (ASR) has flourished in high-resource languages like English, Spanish, or Mandarin. But for the vast…

01/04/2026Automatic Speech Recognition, Multilingual Speech Processing, Zero-Shot Language Generalization
NeMo: Build Production-Grade Speech, LLM, and Multimodal AI Faster with NVIDIA’s Optimized Framework

NeMo: Build Production-Grade Speech, LLM, and Multimodal AI Faster with NVIDIA’s Optimized Framework 16305

NVIDIA NeMo is a cloud-native, open-source framework designed for developers, research engineers, and technical decision-makers who need to build, customize,…

12/27/2025Automatic Speech Recognition, Large Language Models, Multimodal Learning
FireRedASR: Industrial-Grade Mandarin Speech Recognition with SOTA Accuracy and LLM Integration

FireRedASR: Industrial-Grade Mandarin Speech Recognition with SOTA Accuracy and LLM Integration 1658

FireRedASR is an open-source, industrial-grade automatic speech recognition (ASR) system specifically engineered for Mandarin Chinese—but with strong capabilities in Chinese…

12/17/2025Automatic Speech Recognition, LLM-Integrated Speech Processing, Multilingual ASR
Copyright © 2026 PaperCodex.
  • Facebook
  • YouTube
  • Twitter

PaperCodex