ExKaldi-RT swMATH ID: 38118 Software Authors: Yu Wang, Chee Siang Leow, Akio Kobayashi, Takehito Utsuro, Hiromitsu Nishizaki Description: ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of Kaldi. The availability of open-source software is playing a remarkable role in automatic speech recognition (ASR). Kaldi, for instance, is widely used to develop state-of-the-art offline and online ASR systems. This paper describes the ”ExKaldi-RT,” online ASR toolkit implemented based on Kaldi and Python language. ExKaldi-RT provides tools for providing a real-time audio stream pipeline, extracting acoustic features, transmitting packets with a remote connection, estimating acoustic probabilities with a neural network, and online decoding. While similar functions are available built on Kaldi, a key feature of ExKaldi-RT is completely working on Python language, which has an easy-to-use interface for online ASR system developers to exploit original research, for example, by applying neural network-based signal processing and acoustic model trained with deep learning frameworks. We performed benchmark experiments on the minimum LibriSpeech corpus, and showed that ExKaldi-RT could achieve competitive ASR performance in real-time. Homepage: https://arxiv.org/abs/2104.01384 Keywords: Audio; Speech Processing; arXiv_eess.AS; arXiv_cs.CL; Real-Time; Automatic Speech Recognition; ASR; Kaldi; deep learning; Kaldi; Python Related Software: TensorFlow; Jasper; PyTorch; PyKaldi; LibriSpeech; GStreamer; ExKaldi; PyTorch-Kaldi; Kaldi; Python Cited in: 0 Publications Standard Articles 1 Publication describing the Software Year ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of Kaldi Yu Wang, Chee Siang Leow, Akio Kobayashi, Takehito Utsuro, Hiromitsu Nishizaki 2021