It's fast - but for an API, not the fastest speech-to-text. For a long while I hadn't done research and trusted them. Then tried Whisper and Picovoice. On-device latency is nothing comparable with cloud APIs. If latency is important go with Whisper or Picovoice. If customization is also important go with Picovoice.
don't get me wrong it's still faster than amazon, Microsoft or Assemblyai
Lastly, we will be using Deepgram Audio Diarization APIs to get speaker details from a sample audio clip. - Source: dev.to / 4 months ago
There are other AI-powered APIs out there to consider, too. For example, Deepgram can be used to transcribe audio (better than Whisper, offered by OpenAI), ElevenLabs can be used to generate speech from text (including using custom voices, which OpenAI's TTS can't currently do), etc. Depending on what you're trying to make, a combination of these services may be what you need. In any case, Python is going to be... Source: 5 months ago
This guide delves deep into the world of YouTube video summarization, harnessing the power of cutting-edge technologies including Deepgram for superior audio transcription, Langchain for harvesting the power of the LLM, and Mistral 7B, a state-of-the-art and open-source LLM. - Source: dev.to / 5 months ago
Historically it's been challenging to provide closed captioning for live experiences, be it a live interview, a sports game with commentary, or a livestream. But Deepgram's AI tooling has changed this, allowing users to easily convert realtime streams of audio into accurate transcripts. - Source: dev.to / 6 months ago
To deliver the best possible output, we’ve tested a wide range of AI models and supporting tools. Today, we’re producing industry-leading clinical notes text by combining Deepgram’s highly capable transcription models, ScienceIO’s healthcare-specific AI for processing medical data, and the Microsoft Azure OpenAI GPT-4 large language model. We’ve worked closely with all three teams to bring this new capability to... - Source: dev.to / 7 months ago
Paid API: If you're technical this is your best option. Personally I'm a fan of deepgram - https://deepgram.com/. They give you $200 in free credit too which in my case worked out to a couple hundred hours of transcription. Source: 10 months ago
2) It runs it via whisper (an open source tool which does audio to text) Https://github.com/openai/whisper?ysclid=lhqc2hzssa59016522 Or sends it for external processing to an API of something like this https://deepgram.com/. Source: 12 months ago
I recently tried a number of options for streaming STT. Because my use case was very sensitive to latency, I ultimately went with https://deepgram.com/ - but https://github.com/ggerganov/whisper.cpp provided a great stepping stone while prototyping a streaming use case locally on a laptop. - Source: Hacker News / about 1 year ago
I’ve seen good things with deepgram for speech to text. Source: about 1 year ago
Deepgram is an AI Speech Platform that provides developers with a simple-to-use Speech-to-Text API. Source: about 1 year ago
I've been using Deepgram for videos that have no subtitles at all. You can generate a transcript and use something else to translate it. It's far from perfect and tends to skip words but if it's important I crosscheck it with YouTube's autosubs (this is where your tool will help a lot) and then translate. Source: over 1 year ago
AWS has a service called transcribe that’s good. Another possibility more sophisticated service is https://deepgram.com/. Source: over 1 year ago
In terms of outsourcing you could also try other services like https://deepgram.com/ or https://www.assemblyai.com/ to name just a few that focus on ASR. Or services like https://elsaspeak.com/en/elsa-api/, https://www.soapboxlabs.com/, azure has a pronunciation service too. Source: over 1 year ago
When I came across the GitHub Community, this seemed like a good solution for what we're working on at Deepgram. And so the Deepgram Community was born along with accompanying discussions. - Source: dev.to / over 1 year ago
(The company is https://deepgram.com/, and yes we are hiring). Source: almost 2 years ago
As other answers mentioned, in-house STT is most of the time prohibitively costly for a variety of reasons. You can check other STT offerings from companies like assembly.ai or deepgram.com that may cater for more specific needs like yours mentioned (customization for specific industries, better pricing). Source: almost 2 years ago
The Deepgram Hackathon took place from March 10-April 11, 2022. In partnership with our friends over at Deepgram, we were excited to host a fun and truly unique contest that challenged the community to write essays and case studies about Deepgram, build applications utilizing Deepgram, and cheer one another on. - Source: dev.to / almost 2 years ago
As part of making the call experience better for everyone, we have introduced the ability to add live captions to Daily domains with our startTranscription() instance method in partnership with Deepgram. - Source: dev.to / about 2 years ago
In this digital age where virtual conferences are a dime a dozen, we see a large number of them recorded for future records. There are many uses for these records, including sharing with people who were unable to attend live, distributing for use as training, and keeping backups for future reference. One aspect of these recordings that is taken for granted, however, is accessibility. In this blog, we will... - Source: dev.to / about 2 years ago
My submission is Quick, Guess!—a guessing game similiar to pictionary, but you do the guessing while the computer does all the drawing. The computer listens to and evaluates your guesses with the help of Deepgram's Speech-to-Text service and draws pictures using the Quick, Draw! Data set. - Source: dev.to / about 2 years ago
I have built a simple but useful solution using Deepgram python sdk and language tool opensource library. - Source: dev.to / about 2 years ago
Do you know an article comparing Deepgram to other products?
Suggest a link to a post with product alternatives.
This is an informative page about Deepgram. You can review and discuss the product here. The primary details have not been verified within the last quarter, and they might be outdated. If you think we are missing something, please use the means on this page to comment or suggest changes. All reviews and comments are highly encouranged and appreciated as they help everyone in the community to make an informed choice. Please always be kind and objective when evaluating a product and sharing your opinion.