Silero VAD VS Sonix

Silero VAD

Stellar quality.Highly portable.No strings attached.Supports 8 kHz and 16 kHz.Models < one megabyte in size.Supports 30, 60 and 100 ms chunks.Trained on 100+ languages, generalizes well.One chunk ~ 1ms on a single thread.

Sonix

Automatically convert audio & video to text in minutes

Landing page //
2023-09-21

Landing page //
2023-01-31

Silero VAD

Website: github.com
Pricing URL: -

Edit details

Sonix

Website: sonix.ai
Pricing URL: Official Sonix Pricing

Edit details

Silero VAD videos

No Silero VAD videos yet. You could help us improve this page by suggesting one.

+ Add video

Sonix videos

+ Add

Review of Sonix.ai audio transcription service

Category Popularity

0-100% (relative to Silero VAD and Sonix)

Silero VAD

Sonix

AI

17 17%

AI

83% 83

Transcription

2 2%

Transcription

98% 98

Text To Speech

100 100%

Text To Speech

0% 0

Audio Transcription

0 0%

Audio Transcription

100% 100

User comments

Share your experience with using Silero VAD and Sonix. For example, how are they different and which one is better?

Social recommendations and mentions

Based on our record, Sonix should be more popular than Silero VAD. It has been mentiond 11 times since March 2021. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.

Silero VAD mentions (5)

New models and developer products announced at OpenAI DevDay
>How do you detect speech starting and stopping? https://github.com/snakers4/silero-vad. - Source: Hacker News / 7 months ago
[Discussion] Video Translation Task
You could look into https://github.com/guillaumekln/faster-whisper especially the VAD section (Voice Activity Detector) using https://github.com/snakers4/silero-vad. Source: 11 months ago
Using Whisper to transcribe the entire Forensic Files series
I also had the same synchronization issue, so I wrote a WebUI/CLI that uses Silero-VAD that first splits the audio whenever there a silent portion (or every 30 seconds), and I haven't experienced it since:. Source: 12 months ago
Whisper - A new free AI model from OpenAI that can transcribe Japanese (and many other languages) at up to "human level" accuracy
By the way, I've updated the WebUI to now also support using Silero VAD to break up the audio into distinct sections, and run Whisper on each section and then combine them into one single transcript/SRT file. Source: over 1 year ago
Whisper - A new free AI model from OpenAI that can transcribe Japanese (and many other languages) at up to "human level" accuracy
And while googling this, I stumbled upon this discussion on the Whisper GitHub repository, which seems to suggest that the issue is that the current VAD (Voice Activity Detection) is quite poor, and that it can be resolved by using another VAD (like silero-vad). This might be something I want to add to my WebUI in the future. Source: over 1 year ago

Sonix mentions (11)

Help with the translation of a horror story on youtube
There's dozens of tools out there for this these days. I'd recommend sonix.ai they give you 30 minutes free. Source: 11 months ago
What is the quickest/most practical way to make podcast clips? Selecting clips, trimming, exporting etc etc.
Do you have a budget? If so, there's this tool I've worked with called Sonix that generates transcripts of what you feed into it. It's not super accurate, but it's good enough. One of the features is that you can "highlight" chunks of text, and have it spit out an XML that will have a sequence containing only the highlighted text. Source: about 1 year ago
Interview with Kim Golden - Victim of 03/01/2023 Pit Bull Attack - Links in comments
Sonix was the one I used because it had 30 free minutes and the video was only 10-11 minutes long. It seems to have done a really decent job, but not sure if that's because the source audio is pretty clear. Source: about 1 year ago
What do you use to transcribe your data?
Sonix.ai does many languages and is quite good. Source: over 1 year ago
Is there any languages you wish you could learn, but are discouraged by the lack of resources for?
I am struggling with this as well, but one good tool for me has been sonix.ai, which can transcribe pretty well (posted a little while ago about it). Source: about 2 years ago