
Kaldi ASR
Yack.net
Spok Speech Solutions
Sensory
Speechmatics
Deepgram
LumenVox ASR
Hidden Markov Model Toolkit
Vim Python IDE
Kaldi ASR
Vim Python IDEBased on our record, Kaldi ASR seems to be more popular. It has been mentiond 3 times since March 2021. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
Additionally, C++ may be used for extremely high levels of optimization even for cloud-based ML. Dlib and Kaldi are C++ libraries used as dependencies in Python codebases for computer vision and audio processing, for example. So if your application requires you to customize any functions similar to those libraries, then you'll need C++ knowhow. Source: over 3 years ago
I'm not how sure it stacks with recent state of art, but Kaldi toolkit (https://github.com/kaldi-asr/kaldi) used to be popular for building all kinds of practical integrations and experiments for speech recognition. Source: about 4 years ago
Vosk-api isn't an SST engine itself, it is built using the Kaldi speech recognition toolkit (https://github.com/kaldi-asr/kaldi) and nicely implements and packages an API for Kaldi chain/LF-MMI models. - Source: Hacker News / over 4 years ago
Yack.net - Recorded and transcribed calls for team collaboration
Spok Speech Solutions - Spok Speech Solutions allows organization to process routine phone requests such as transfers, directory assistance, messaging, and paging without live operators, letting to manage call volumes, operator workloads, and keeping calls from dropping.
Sensory - Sensory provides accurate, low-cost embedded voice and biometric AI. Sensoryโs technologies have shipped in over a billion units of consumer products.
Speechmatics - The most accurate and inclusive speech-to-text API ever released.
Deepgram - Search engine for speech
LumenVox ASR - LumenVox Automated Speech Recognizer (ASR) is a software solution that converts spoken audio into text, providing users with a more efficient means of input.