Poe might be a bit more popular than Silero VAD. We know about 7 links to it since March 2021 and only 5 links to Silero VAD. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
Poe, is a tool that allows you to try lots of different LLMs like ChatGPT, Anthropic, LLama and more. It is a great sandbox to get started and see the difference in the tools. Note; not all the interface/features are available all the time. In ChatGPT 4.0 ability to search the web or generate an image from in-line hasn't yet been updated. Still, it's not a bad tool to try everything. [via Poe]. Source: 7 months ago
Now, the placement of the "Definition (Advanced)" section is less clear. Given its large size (up to 32000 characters), it's doubtful that it's part of the initial prompt. It's more likely that it's maintained as a separate reference knowledge base. This is a feature that other AI chatbots, like Poe's bots, can also utilize. You can visit the Poe website here to check it out. Source: 7 months ago
I saw a post earlier this week saying that they feel the outputs from ChatGPT have been declining, and a lot of people agreed. There are a good amount of quality AI chat alternatives out there besides ChatGPT and some even offer GPT-4 for free! Here's a list of alternative chatbots to try out (I've tried all of these not some bs list): Perplexity: "The first conversational search engine" (GPT-3.5 Free / GPT-4... Source: almost 1 year ago
Poe: Quora's AI app with multiple models (GPT-3.5 Free / GPT-4 free with 'limited access'). Source: almost 1 year ago
Is poe.com completely free? Is there an API access? I haven't looked much into it but it feels like it's just a relay service to openai? Source: over 1 year ago
>How do you detect speech starting and stopping? https://github.com/snakers4/silero-vad. - Source: Hacker News / 8 months ago
You could look into https://github.com/guillaumekln/faster-whisper especially the VAD section (Voice Activity Detector) using https://github.com/snakers4/silero-vad. Source: 11 months ago
I also had the same synchronization issue, so I wrote a WebUI/CLI that uses Silero-VAD that first splits the audio whenever there a silent portion (or every 30 seconds), and I haven't experienced it since:. Source: about 1 year ago
By the way, I've updated the WebUI to now also support using Silero VAD to break up the audio into distinct sections, and run Whisper on each section and then combine them into one single transcript/SRT file. Source: over 1 year ago
And while googling this, I stumbled upon this discussion on the Whisper GitHub repository, which seems to suggest that the issue is that the current VAD (Voice Activity Detection) is quite poor, and that it can be resolved by using another VAD (like silero-vad). This might be something I want to add to my WebUI in the future. Source: over 1 year ago
ChatGPT - ChatGPT is a powerful, open-source language model.
Whisper Memos - Whisper Memos turns your ramblings into paragraphed articles, and emails them to you.
Perplexity.ai - Ask anything
MacWhisper - High Quality Text Transcription with OpenAI's Whisper on Mac
HuggingChat - Open source alternative to ChatGPT. Making the best open source AI chat models available to everyone.
WOMBO - Make your selfies sing