Software Alternatives & Reviews

Voysis VS Silero VAD

Compare Voysis VS Silero VAD and see what are their differences

Voysis logo Voysis

The complete independent Voice AI platform

Silero VAD logo Silero VAD

Stellar quality.Highly portable.No strings attached.Supports 8 kHz and 16 kHz.Models < one megabyte in size.Supports 30, 60 and 100 ms chunks.Trained on 100+ languages, generalizes well.One chunk ~ 1ms on a single thread.
  • Voysis Landing page
    Landing page //
    2021-10-16
  • Silero VAD Landing page
    Landing page //
    2023-09-21

Voysis videos

Voysis acquired by Apple | VUX World

More videos:

  • Review - Why Apple acquired Voysis and what it means for Siri | VUX World

Silero VAD videos

No Silero VAD videos yet. You could help us improve this page by suggesting one.

+ Add video

Category Popularity

0-100% (relative to Voysis and Silero VAD)
AI
39 39%
61% 61
Text To Speech
100 100%
0% 0
Transcription
0 0%
100% 100
Productivity
100 100%
0% 0

User comments

Share your experience with using Voysis and Silero VAD. For example, how are they different and which one is better?
Log in or Post with

Social recommendations and mentions

Based on our record, Silero VAD seems to be more popular. It has been mentiond 5 times since March 2021. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.

Voysis mentions (0)

We have not tracked any mentions of Voysis yet. Tracking of Voysis recommendations started around Mar 2021.

Silero VAD mentions (5)

  • New models and developer products announced at OpenAI DevDay
    >How do you detect speech starting and stopping? https://github.com/snakers4/silero-vad. - Source: Hacker News / 6 months ago
  • [Discussion] Video Translation Task
    You could look into https://github.com/guillaumekln/faster-whisper especially the VAD section (Voice Activity Detector) using https://github.com/snakers4/silero-vad. Source: 10 months ago
  • Using Whisper to transcribe the entire Forensic Files series
    I also had the same synchronization issue, so I wrote a WebUI/CLI that uses Silero-VAD that first splits the audio whenever there a silent portion (or every 30 seconds), and I haven't experienced it since:. Source: 11 months ago
  • Whisper - A new free AI model from OpenAI that can transcribe Japanese (and many other languages) at up to "human level" accuracy
    By the way, I've updated the WebUI to now also support using Silero VAD to break up the audio into distinct sections, and run Whisper on each section and then combine them into one single transcript/SRT file. Source: over 1 year ago
  • Whisper - A new free AI model from OpenAI that can transcribe Japanese (and many other languages) at up to "human level" accuracy
    And while googling this, I stumbled upon this discussion on the Whisper GitHub repository, which seems to suggest that the issue is that the current VAD (Voice Activity Detection) is quite poor, and that it can be resolved by using another VAD (like silero-vad). This might be something I want to add to my WebUI in the future. Source: over 1 year ago

What are some alternatives?

When comparing Voysis and Silero VAD, you can also consider the following products

Snips Voice Platform - The first AI-powered voice assistant with privacy

Whisper Memos - Whisper Memos turns your ramblings into paragraphed articles, and emails them to you.

Amazon Polly - Named for a parrot, Amazon Polly is a text-to-speech (TTS) software that makes your text come to life in a natural, authentic way. The software has many lifelike voices, both male and female, and in a variety of languages.

The Parodist App - Super-realistic celebs' voices made by AI

Lyrebird - Copy the voice of anyone, using a voice imitation algorithm

MacWhisper - High Quality Text Transcription with OpenAI's Whisper on Mac