No Silero VAD videos yet. You could help us improve this page by suggesting one.
Based on our record, MacWhisper should be more popular than Silero VAD. It has been mentiond 24 times since March 2021. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
>How do you detect speech starting and stopping? https://github.com/snakers4/silero-vad. - Source: Hacker News / 6 months ago
You could look into https://github.com/guillaumekln/faster-whisper especially the VAD section (Voice Activity Detector) using https://github.com/snakers4/silero-vad. Source: 10 months ago
I also had the same synchronization issue, so I wrote a WebUI/CLI that uses Silero-VAD that first splits the audio whenever there a silent portion (or every 30 seconds), and I haven't experienced it since:. Source: 12 months ago
By the way, I've updated the WebUI to now also support using Silero VAD to break up the audio into distinct sections, and run Whisper on each section and then combine them into one single transcript/SRT file. Source: over 1 year ago
And while googling this, I stumbled upon this discussion on the Whisper GitHub repository, which seems to suggest that the issue is that the current VAD (Voice Activity Detection) is quite poor, and that it can be resolved by using another VAD (like silero-vad). This might be something I want to add to my WebUI in the future. Source: over 1 year ago
I've really enjoyed this macOS Whisper GUI[1]. It doesn't use MLX, but does use Metal. 1. https://goodsnooze.gumroad.com/l/macwhisper. - Source: Hacker News / 5 months ago
Finding a OOS business model is non-trivial. Maybe you should talk to https://goodsnooze.gumroad.com/l/macwhisper to get some inspiration? People are paying for convenience. As for the technology itself: the B2B market is super-super early and I understand everybody is in goldrush mode, however 98% of all startups will not survive the next 3-5 years. From the demand site: Companies are still sleeping, you can see... - Source: Hacker News / 6 months ago
Because MacWhisper is free for the tiny and base models and the paid version only costs $25 - https://goodsnooze.gumroad.com/l/macwhisper. - Source: Hacker News / 9 months ago
What was the fine tune? How does this compare to what is possible using https://goodsnooze.gumroad.com/l/macwhisper for example? Thanks! - Source: Hacker News / 9 months ago
You can use Whisper to transcribe the audio to text locally on the mac. You have a great Open-source implementation named whisper.cpp and a few graphical user interfaces for it: https://github.com/ggerganov/whisper.cpp https://goodsnooze.gumroad.com/l/macwhisper Personally I use MacWhisper pro because it’s very convenient. - Source: Hacker News / 10 months ago
Whisper Memos - Whisper Memos turns your ramblings into paragraphed articles, and emails them to you.
Whisper.sh - Whisper is the best place to express yourself online. Connect with likeminded individuals and discover the unseen world around you.
The Parodist App - Super-realistic celebs' voices made by AI
AudioPen - The easiest way to convert messy thoughts into clear text
WOMBO - Make your selfies sing
Otter Voice Notes - Remember, search, and share your voice conversations