Software Alternatives, Accelerators & Startups

Silero VAD VS Otter Voice Notes

Compare Silero VAD VS Otter Voice Notes and see what are their differences

Silero VAD logo Silero VAD

Stellar quality.Highly portable.No strings attached.Supports 8 kHz and 16 kHz.Models < one megabyte in size.Supports 30, 60 and 100 ms chunks.Trained on 100+ languages, generalizes well.One chunk ~ 1ms on a single thread.

Otter Voice Notes logo Otter Voice Notes

Remember, search, and share your voice conversations
  • Silero VAD Landing page
    Landing page //
    2023-09-21
  • Otter Voice Notes Landing page
    Landing page //
    2023-10-10

Category Popularity

0-100% (relative to Silero VAD and Otter Voice Notes)
AI
100 100%
0% 0
Productivity
0 0%
100% 100
Knowledge Sharing
100 100%
0% 0
Transcription
5 5%
95% 95

User comments

Share your experience with using Silero VAD and Otter Voice Notes. For example, how are they different and which one is better?
Log in or Post with

Social recommendations and mentions

Based on our record, Otter Voice Notes seems to be a lot more popular than Silero VAD. While we know about 232 links to Otter Voice Notes, we've tracked only 5 mentions of Silero VAD. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.

Silero VAD mentions (5)

  • New models and developer products announced at OpenAI DevDay
    >How do you detect speech starting and stopping? https://github.com/snakers4/silero-vad. - Source: Hacker News / 6 months ago
  • [Discussion] Video Translation Task
    You could look into https://github.com/guillaumekln/faster-whisper especially the VAD section (Voice Activity Detector) using https://github.com/snakers4/silero-vad. Source: 10 months ago
  • Using Whisper to transcribe the entire Forensic Files series
    I also had the same synchronization issue, so I wrote a WebUI/CLI that uses Silero-VAD that first splits the audio whenever there a silent portion (or every 30 seconds), and I haven't experienced it since:. Source: 12 months ago
  • Whisper - A new free AI model from OpenAI that can transcribe Japanese (and many other languages) at up to "human level" accuracy
    By the way, I've updated the WebUI to now also support using Silero VAD to break up the audio into distinct sections, and run Whisper on each section and then combine them into one single transcript/SRT file. Source: over 1 year ago
  • Whisper - A new free AI model from OpenAI that can transcribe Japanese (and many other languages) at up to "human level" accuracy
    And while googling this, I stumbled upon this discussion on the Whisper GitHub repository, which seems to suggest that the issue is that the current VAD (Voice Activity Detection) is quite poor, and that it can be resolved by using another VAD (like silero-vad). This might be something I want to add to my WebUI in the future. Source: over 1 year ago

Otter Voice Notes mentions (232)

  • How to build a Google Meet AI assistant app in 10 minutes without coding
    Of course, there are many existing solutions like Otter.ai or Fathom in the market. But in case you want to build a tool yourself and customize the output of it, then you are on the same page as me. To develop this application, we will use Unbody to convert input video transcriptions into intelligence/generative content and Appsmith to make it easy to design and build the UI of our app without extensive front-end... - Source: dev.to / 5 months ago
  • I'm looking for a particular type of software and I don't know where else to go.
    This is weird but I wonder if you could use something like https://otter.ai/. Record your notes as you are going. That should give you at least text of all of your welds. You’d still have to punch it later. Seems like there’s got to be a better way to do this. Stopping every time to break your flow sounds like a huge pain in the ass. Curious what you come up with. Source: 5 months ago
  • How I tackled more than 2+ conflicting meetings
    Is there any app from otter.ai that you run on personal machine? How does otter.ai process 4 different audio streams? Source: 5 months ago
  • How I tackled more than 2+ conflicting meetings
    Job laptop -> 3.5mm aux (this turns into speaker output) -> 3.5mm mic/audio splitter (this turns into microphone input) -> 3.5mm to usb-c adapter (cause my macbook only has 1 3.5mm aux) --> now the personal macbook has a new "mic input" from the job laptop. Which you can use to pipe audio into otter.ai to transcribe audio. You have to manually name them, but they learn in subsequent meetings. Source: 5 months ago
  • How I tackled more than 2+ conflicting meetings
    I recently started to use AI transcription services (you can use any.. But I'm currently using otter.ai) , but I don't have the service join the meetings. I use a 3.5mm aux splitter to pipe the audio out of my job laptop (so one pipe goes to the audio mixer), and use the other split to be passed into a microphone adapter to 3.5mm aux and then to my personal laptop. The personal laptop runs otter.ai to transcribe... Source: 5 months ago
View more

What are some alternatives?

When comparing Silero VAD and Otter Voice Notes, you can also consider the following products

Whisper Memos - Whisper Memos turns your ramblings into paragraphed articles, and emails them to you.

AudioPen - The easiest way to convert messy thoughts into clear text

The Parodist App - Super-realistic celebs' voices made by AI

Whisper.sh - Whisper is the best place to express yourself online. Connect with likeminded individuals and discover the unseen world around you.

MacWhisper - High Quality Text Transcription with OpenAI's Whisper on Mac

Taped.ai - Taped.ai is an AI tool that quickly transcribes and summarizes audio, images, and text. It reimagines note-taking with AI by transforming messy thoughts into organized notes.