Software Alternatives, Accelerators & Startups

Silero VAD VS Descript

Compare Silero VAD VS Descript and see what are their differences

Silero VAD logo Silero VAD

Stellar quality.Highly portable.No strings attached.Supports 8 kHz and 16 kHz.Models < one megabyte in size.Supports 30, 60 and 100 ms chunks.Trained on 100+ languages, generalizes well.One chunk ~ 1ms on a single thread.

Descript logo Descript

Text-based audio editor and automated transcription
  • Silero VAD Landing page
    Landing page //
    2023-09-21
  • Descript Landing page
    Landing page //
    2023-10-20

Silero VAD videos

No Silero VAD videos yet. You could help us improve this page by suggesting one.

+ Add video

Descript videos

Descript - Hands On: Ultimate Podcast / YouTube Editor

More videos:

  • Review - Descript&#39;s Podcast Studio launches: we try it out
  • Review - Audio Editing with Descript Software
  • Demo - Introducing Descript

Category Popularity

0-100% (relative to Silero VAD and Descript)
AI
11 11%
89% 89
Transcription
0 0%
100% 100
Text To Speech
100 100%
0% 0
Productivity
0 0%
100% 100

User comments

Share your experience with using Silero VAD and Descript. For example, how are they different and which one is better?
Log in or Post with

Social recommendations and mentions

Based on our record, Descript should be more popular than Silero VAD. It has been mentiond 12 times since March 2021. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.

Silero VAD mentions (5)

  • New models and developer products announced at OpenAI DevDay
    >How do you detect speech starting and stopping? https://github.com/snakers4/silero-vad. - Source: Hacker News / 7 months ago
  • [Discussion] Video Translation Task
    You could look into https://github.com/guillaumekln/faster-whisper especially the VAD section (Voice Activity Detector) using https://github.com/snakers4/silero-vad. Source: 11 months ago
  • Using Whisper to transcribe the entire Forensic Files series
    I also had the same synchronization issue, so I wrote a WebUI/CLI that uses Silero-VAD that first splits the audio whenever there a silent portion (or every 30 seconds), and I haven't experienced it since:. Source: about 1 year ago
  • Whisper - A new free AI model from OpenAI that can transcribe Japanese (and many other languages) at up to "human level" accuracy
    By the way, I've updated the WebUI to now also support using Silero VAD to break up the audio into distinct sections, and run Whisper on each section and then combine them into one single transcript/SRT file. Source: over 1 year ago
  • Whisper - A new free AI model from OpenAI that can transcribe Japanese (and many other languages) at up to "human level" accuracy
    And while googling this, I stumbled upon this discussion on the Whisper GitHub repository, which seems to suggest that the issue is that the current VAD (Voice Activity Detection) is quite poor, and that it can be resolved by using another VAD (like silero-vad). This might be something I want to add to my WebUI in the future. Source: over 1 year ago

Descript mentions (12)

  • how do you handle your podcasts transcription and timestamps
    For transcripts, I use Descript. Descript is able to identify all four of our panel members, and I usually spend an hour or so cleaning it up and setting the transcript into a video for YouTube. Source: about 1 year ago
  • looking for video editor that auto cuts conversations
    I don't understand exactly what you are trying to do, but I'm pretty sure Descript can do what you want. Source: over 1 year ago
  • What is your preferred way to make a voice-over?
    I tried to use descript.com but found out that they didn't have a download for Linux and that their online version doesn't allow you to edit your transcript. Source: over 1 year ago
  • Needing some assistances
    Edit your audio with software like Descript or Audacity. Source: about 2 years ago
  • Video snippit of your episode text animation recommendations
    Looks like an 'audiogram' from descript.com - you can make them on their paid service. Source: about 2 years ago
View more

What are some alternatives?

When comparing Silero VAD and Descript, you can also consider the following products

The Parodist App - Super-realistic celebs' voices made by AI

HappyScribe - Happy Scribe automatically transcribes your interviews

Whisper Memos - Whisper Memos turns your ramblings into paragraphed articles, and emails them to you.

Trint - Transcribe spoken words from your video & audio files

Replica - Simple way for save articles, stories and web pages for reading: offline, organized and clean...

Sonix - Automatically convert audio & video to text in minutes