Dub AI is designed to help creators translate and dub their audio and videos using artificial intelligence.
It supports more than 25+ languages currently with voice cloning and multi-speaker capabilities. The multi-speaker feature can support up to ten speakers simultaneously and comes with an automatic detection function. The tool also provides access to translated transcripts and audio clips, adding value for post-processing use.
Dub AI is very user-friendly, allowing for quick uploading of audio and video files or YouTube URLs to start dubbing. It takes 3 simple steps to dub a video and allows users to download all the assets including transcripts and audio segments.
No Silero VAD videos yet. You could help us improve this page by suggesting one.
Based on our record, Silero VAD seems to be more popular. It has been mentiond 5 times since March 2021. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
>How do you detect speech starting and stopping? https://github.com/snakers4/silero-vad. - Source: Hacker News / 7 months ago
You could look into https://github.com/guillaumekln/faster-whisper especially the VAD section (Voice Activity Detector) using https://github.com/snakers4/silero-vad. Source: 11 months ago
I also had the same synchronization issue, so I wrote a WebUI/CLI that uses Silero-VAD that first splits the audio whenever there a silent portion (or every 30 seconds), and I haven't experienced it since:. Source: about 1 year ago
By the way, I've updated the WebUI to now also support using Silero VAD to break up the audio into distinct sections, and run Whisper on each section and then combine them into one single transcript/SRT file. Source: over 1 year ago
And while googling this, I stumbled upon this discussion on the Whisper GitHub repository, which seems to suggest that the issue is that the current VAD (Voice Activity Detection) is quite poor, and that it can be resolved by using another VAD (like silero-vad). This might be something I want to add to my WebUI in the future. Source: over 1 year ago
The Parodist App - Super-realistic celebs' voices made by AI
Rask AI - Say goodbye to expensive translators. Our goal is to provide a dubbing and translation experience with AI that is as good as a human
Whisper Memos - Whisper Memos turns your ramblings into paragraphed articles, and emails them to you.
Dubverse - Dubverse enables the content creators to dub the video from one language to another, in real-time for 1/5th the cost using Deep Learning and AI.
Replica - Simple way for save articles, stories and web pages for reading: offline, organized and clean...
Transcri.io - Online Sofware (SaaS) for Audio Transcription and Subtitle Generation | Powered by Artificial Intelligence