swMATH ID: 21656
Software Authors: Lee A, Kawahara A, Shikano K
Description: Julius: an open source realtime large vocabulary recognition engine. ”Julius” is a high-performance, small-footprint large vocabulary continuous speech recognition (LVCSR) decoder software for speech-related researchers and developers. Based on word N-gram and context-dependent HMM, it can perform real-time decoding on various computers and devices from micro-computer to cloud server. The algorithm is based on 2-pass tree-trellis search, which fully incorporates major decoding techniques such as tree-organized lexicon, 1-best / word-pair context approximation, rank/score pruning, N-gram factoring, cross-word context dependency handling, enveloped beam search, Gaussian pruning, Gaussian selection, etc. Besides search efficiency, it is also modularized to be independent from model structures, and wide variety of HMM structures are supported such as shared-state triphones and tied-mixture models, with any number of mixtures, states, or phone sets. It also can run multi-instance recognition, running dictation, grammar-based recognition or isolated word recognition simultaneously in a single thread. Standard formats are adopted for the models to cope with other speech / language modeling toolkit such as HTK, SRILM, etc. Recent version also supports Deep Neural Network (DNN) based real-time decoding.
Homepage: https://github.com/julius-speech/julius
Source Code:  https://github.com/julius-speech/julius
Related Software: fairseq; Lingvo; Espresso; PyTorch-Kaldi; k2; VoxPopuli; ContextNet; Libri-Light; SentencePiece; SpecAugment; PyTorch Lightning; fastai; Keras; SciPy; Scikit; Transformers; QuartzNet; Jasper; wav2vec; SLURP
Cited in: 0 Documents