Based on our record, Bard AI seems to be a lot more popular than Silero VAD. While we know about 111 links to Bard AI, we've tracked only 5 mentions of Silero VAD. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
Gemini: Developed by Google, Gemini is a powerful conversational AI platform that leverages the tech giant's vast resources and expertise. With its cutting-edge algorithms and scalable infrastructure, Gemini enables businesses to build intelligent chatbots and virtual assistants that deliver seamless experiences across platforms. - Source: dev.to / 16 days ago
Google AI Studio serves as a hub for integrating Google's advanced AI models, including the newly launched Gemini series. It offers a streamlined interface and features like the Gemini 1.5 model, which boasts a 1-million-token context window for handling complex datasets and queries efficiently. - Source: dev.to / 28 days ago
At this point, probably everyone has heard about OpenAI, GPT-4, Claude or any of the popular Large Language Models (LLMs). However, using these LLMs in a production environment can be expensive or nondeterministic regarding its results. I guess that is the downside of being good at everything; you could be better at performing one specific task. This is where HuggingFace can utilized. HuggingFace provides... - Source: dev.to / about 2 months ago
Https://archive.md/gkMOo *In business. Summarized in one paragraph via: https://gemini.google.com. - Source: Hacker News / 2 months ago
I'm currently trying to find proof of concept around this idea I have. It's pretty early and any feedback is appreciated. Currently in email marketing you can personalize marketing emails based on tags and group them into fx. a segment. This allows the marketer to personalize the email to the extend an limited to the segmentation, tags of the user and usage of those tags in the emails. But what if you could fully... - Source: Hacker News / 3 months ago
>How do you detect speech starting and stopping? https://github.com/snakers4/silero-vad. - Source: Hacker News / 7 months ago
You could look into https://github.com/guillaumekln/faster-whisper especially the VAD section (Voice Activity Detector) using https://github.com/snakers4/silero-vad. Source: 11 months ago
I also had the same synchronization issue, so I wrote a WebUI/CLI that uses Silero-VAD that first splits the audio whenever there a silent portion (or every 30 seconds), and I haven't experienced it since:. Source: 12 months ago
By the way, I've updated the WebUI to now also support using Silero VAD to break up the audio into distinct sections, and run Whisper on each section and then combine them into one single transcript/SRT file. Source: over 1 year ago
And while googling this, I stumbled upon this discussion on the Whisper GitHub repository, which seems to suggest that the issue is that the current VAD (Voice Activity Detection) is quite poor, and that it can be resolved by using another VAD (like silero-vad). This might be something I want to add to my WebUI in the future. Source: over 1 year ago
ChatGPT - ChatGPT is a powerful, open-source language model.
The Parodist App - Super-realistic celebs' voices made by AI
Perplexity.ai - Ask anything
Whisper Memos - Whisper Memos turns your ramblings into paragraphed articles, and emails them to you.
HuggingChat - Open source alternative to ChatGPT. Making the best open source AI chat models available to everyone.
Replica - Simple way for save articles, stories and web pages for reading: offline, organized and clean...