Whisper Memos VS Silero VAD

Whisper Memos

Whisper Memos turns your ramblings into paragraphed articles, and emails them to you.

Stellar quality.Highly portable.No strings attached.Supports 8 kHz and 16 kHz.Models < one megabyte in size.Supports 30, 60 and 100 ms chunks.Trained on 100+ languages, generalizes well.One chunk ~ 1ms on a single thread.

Landing page //
2023-10-15

Landing page //
2023-09-21

Whisper Memos

Website: whispermemos.com
Categories: #Audio #Audio Recording #Email Marketing #AI

Edit details

Silero VAD

Website: github.com
Categories: #Internet Of Things #AI #GitHub #Knowledge Sharing

Edit details

Category Popularity

0-100% (relative to Whisper Memos and Silero VAD)

Whisper Memos

Silero VAD

AI

40 40%

AI

60% 60

Audio

100 100%

Audio

0% 0

Knowledge Sharing

44 44%

Knowledge Sharing

56% 56

Transcription

0 0%

Transcription

100% 100

User comments

Share your experience with using Whisper Memos and Silero VAD. For example, how are they different and which one is better?

Social recommendations and mentions

Based on our record, Silero VAD should be more popular than Whisper Memos. It has been mentiond 5 times since March 2021. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.

Whisper Memos mentions (3)

What else can it do / what niche things do you use your watch for?
Taking notes by talking with Whisper Memos. Source: 5 months ago
OpenAI releases Whisper v3, new generation open source ASR model
I don't understand how a pop. 10M country - Czech Republic is among the best. And I can confirm - my app Whisper Memos (https://whispermemos.com) is very popular in Czech Republic. It makes perfect sense. Whisper is almost as good as transcribing Czech as English! - Source: Hacker News / 6 months ago
New models and developer products announced at OpenAI DevDay
Too bad they didn't upgrade Whisper API yet. Can't wait to make it available in https://whispermemos.com. - Source: Hacker News / 6 months ago

Silero VAD mentions (5)

New models and developer products announced at OpenAI DevDay
>How do you detect speech starting and stopping? https://github.com/snakers4/silero-vad. - Source: Hacker News / 6 months ago
[Discussion] Video Translation Task
You could look into https://github.com/guillaumekln/faster-whisper especially the VAD section (Voice Activity Detector) using https://github.com/snakers4/silero-vad. Source: 10 months ago
Using Whisper to transcribe the entire Forensic Files series
I also had the same synchronization issue, so I wrote a WebUI/CLI that uses Silero-VAD that first splits the audio whenever there a silent portion (or every 30 seconds), and I haven't experienced it since:. Source: 11 months ago
Whisper - A new free AI model from OpenAI that can transcribe Japanese (and many other languages) at up to "human level" accuracy
By the way, I've updated the WebUI to now also support using Silero VAD to break up the audio into distinct sections, and run Whisper on each section and then combine them into one single transcript/SRT file. Source: over 1 year ago
Whisper - A new free AI model from OpenAI that can transcribe Japanese (and many other languages) at up to "human level" accuracy
And while googling this, I stumbled upon this discussion on the Whisper GitHub repository, which seems to suggest that the issue is that the current VAD (Voice Activity Detection) is quite poor, and that it can be resolved by using another VAD (like silero-vad). This might be something I want to add to my WebUI in the future. Source: over 1 year ago