Local vad service #2502

thestumonkey · 2025-06-03T14:51:14Z

On a local environment the speech detection was not working, leading to the "teach omi your voice" being present all the time.

The silero model was not returning segments as it needs 16000Hz and not 8000Hz
The code was hardcoded to look for a seperate vad service which can be configured in backend/modal

This PR:

Fixes silero model use
Adds logic to fallback to silero if the HOSTED_VAD_API_URL is not set
Simplified the docker container to import the nvidia cuda image directly instead of building from scratch
Changed default vad port so doesn't conflict with server
Updated the code to call the correct /v1/vad endpoint instead of root
Added logic to either be able to give the google service account as a file or a json string
Updated docs
Updated a couple of libraries that were failing on the ios app build

Changed port so it doesn't run on same one as server changed to using a docker image to build CUDA updated vad to have correct modal endpoint and fix the silero fallback if not set added flag to disable translation as often not needed in dev updated docs for the vad service

vercel · 2025-06-03T14:51:19Z

@thestumonkey is attempting to deploy a commit to the kodjima33's projects Team on Vercel.

A member of the Team first needs to authorize it.

thestumonkey added 2 commits June 3, 2025 15:31

updated intl and skeletoniser which were failing on build

28a7613

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Local vad service #2502

Local vad service #2502

thestumonkey commented Jun 3, 2025

Uh oh!

vercel bot commented Jun 3, 2025

Uh oh!

Uh oh!

Local vad service #2502

Are you sure you want to change the base?

Local vad service #2502

Conversation

thestumonkey commented Jun 3, 2025

Uh oh!

vercel bot commented Jun 3, 2025

Uh oh!

Uh oh!