Silero VAD might be a bit more popular than Typing Mind. We know about 5 links to it since March 2021 and only 4 links to Typing Mind. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
>How do you detect speech starting and stopping? https://github.com/snakers4/silero-vad. - Source: Hacker News / 7 months ago
You could look into https://github.com/guillaumekln/faster-whisper especially the VAD section (Voice Activity Detector) using https://github.com/snakers4/silero-vad. Source: 11 months ago
I also had the same synchronization issue, so I wrote a WebUI/CLI that uses Silero-VAD that first splits the audio whenever there a silent portion (or every 30 seconds), and I haven't experienced it since:. Source: 12 months ago
By the way, I've updated the WebUI to now also support using Silero VAD to break up the audio into distinct sections, and run Whisper on each section and then combine them into one single transcript/SRT file. Source: over 1 year ago
And while googling this, I stumbled upon this discussion on the Whisper GitHub repository, which seems to suggest that the issue is that the current VAD (Voice Activity Detection) is quite poor, and that it can be resolved by using another VAD (like silero-vad). This might be something I want to add to my WebUI in the future. Source: over 1 year ago
Have you tried using typindmind.com? The interface is amazing but I find the quality of the answers it provides to not be as detailed as ChatGPT. Which I find strange. Source: 11 months ago
In comparison, typingmind.com charges you and chatfriday.com is not open source. Source: about 1 year ago
You mean like a frontend for it? I use https://typingmind.com/, it's pretty nifty. I've since upgraded to plus for GPT-4 so I don't use it as much, but the UI is actually better than the ChatGPT. Source: about 1 year ago
Can use your api key with typingmind.com or chatfriday.com. Source: about 1 year ago
The Parodist App - Super-realistic celebs' voices made by AI
ChatGPT - ChatGPT is a powerful, open-source language model.
Whisper Memos - Whisper Memos turns your ramblings into paragraphed articles, and emails them to you.
DapperGPT - Better UI for ChatGPT with Customize Chat, Notes & Extension
Replica - Simple way for save articles, stories and web pages for reading: offline, organized and clean...
Writesonic - If you’ve ever been stuck for words or experienced writer’s block when it comes to coming up with copy, you know how frustrating it is.