Software Alternatives, Accelerators & Startups

DeepSpeech VS CMU Sphinx

Compare DeepSpeech VS CMU Sphinx and see what are their differences

DeepSpeech logo DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

CMU Sphinx logo CMU Sphinx

CMU Sphinx is a speaker-independent large vocabulary continuous speech recognizer released under...
Not present
  • CMU Sphinx Landing page
    Landing page //
    2022-12-17

DeepSpeech features and specs

No features have been listed yet.

CMU Sphinx features and specs

  • Open Source
    CMU Sphinx is free and open source, allowing developers to use, modify, and distribute the software without any licensing costs.
  • Offline Functionality
    CMU Sphinx can be used for offline speech recognition, making it suitable for applications where internet connectivity is unreliable or unavailable.
  • Flexible and Extensible
    CMU Sphinx provides a variety of tools and libraries that can be extended and customized for specific use cases, such as adapting it to recognize domain-specific vocabulary.
  • Multiple Language Support
    Supports various languages and accents, making it versatile for global applications.
  • Custom Models
    Allows the creation of custom acoustic and language models tailored to specific applications, thereby improving accuracy in specialized environments.

Possible disadvantages of CMU Sphinx

  • Accuracy
    CMU Sphinx often has lower recognition accuracy compared to more modern, deep learning-based speech recognition systems.
  • Complex Setup
    Setting up and configuring CMU Sphinx can be complex and requires a significant understanding of speech recognition technology.
  • Limited Community Support
    The user community and support for CMU Sphinx are not as large or active as those for some commercial or newer open-source alternatives.
  • Resource Intensive
    Running CMU Sphinx, especially with large custom models, can be resource-intensive, requiring significant CPU and memory resources.
  • Lagging Behind in Technology
    CMU Sphinx has not kept pace with recent advancements in speech recognition technology, particularly deep learning innovations employed by newer systems.

DeepSpeech videos

DeepSpeech | Speech to Text | Common Voice | Donate Your Voice

CMU Sphinx videos

Training CMU Sphinx Speech Recognition

Category Popularity

0-100% (relative to DeepSpeech and CMU Sphinx)
Knowledge Sharing
12 12%
88% 88
Speech Recognition And Processing
Knowledge Search
28 28%
72% 72
Tool
0 0%
100% 100

User comments

Share your experience with using DeepSpeech and CMU Sphinx. For example, how are they different and which one is better?
Log in or Post with

What are some alternatives?

When comparing DeepSpeech and CMU Sphinx, you can also consider the following products

Kaldi - Kaldi is a toolkit for speech recognition written in C++ and licensed under the Apache License v2.0.

Express Dictate Digital Dictation Software - Express Dictate software is a voice recording program that works like a dictaphone.

Word Online - Word Online, part of Office Online, is the online version of Microsoft Word.

LipSurf - "Siri for Chrome" Completely control the browser without your hands -- say "google...

ConstEdit - ConstEdit word processor is a Google Chrome web browser extension.

TextFromToSpeech - Free online speech recognition tool that will help you write text with your voice without typing.