Transcription for Live Streamed Event

The video below shows an example of Voicegain Live Transcribe used to provide transcription for an event streamed over video.
‍

‍

Here are some details about this particular setup:

the video part is streamed using BoxCast
the audio for transcription is tapped live at the source on site
audio is streamed to Voicegain Cloud for processing using a small Java client running on raspberry pi computer
the audio client was downloaded pre-configured from the Voicegain portal and reads audio directly from USB audio device plugged into raspberry pi
speech is transcribed in the Cloud using Voicegain semi-real-time mode which delivers results in about 30 seconds (the real-time mode delivers results will less than 1 second delay))
the transcription output goes via a delay component that allows us to dial in the precise delay to match the streaming video delay - in this case the delay was 35.5 seconds
the transcribed words are sent to a Web Client over websocket - each word is sent with the set delay
the words are displayed with the gray font shade corresponding to the confidence in the words and the gap proportional to the gap between the spoken words
the Acoustic Model used here has been custom trained with additional 200h+ hours from this particular speaker
custom training data consisted simply of previously transcribed speeches by the speaker that were readily available on the website
we are also using a custom Language Model (on top of the base NLM) that was created from user provided corpus

Voicegain: Voice AI Under Your Control

Voicegain: Build Voice AI apps with our Speech-to-Text and LLM-powered NLU APIs. Record & Transcribe meetings, contact center calls, videos, etc. Get LLM-powered Summary, Sentiment and more. Build Conversational Voice Bots that integrate with your On-prem or cloud CCaaS platform. Get started today.

See how Voicegain works — get a demo of Voicegain today.

Tell us what you are building!

We love talking with you about generative AI, speech & transcription, & privacy—whether you're a startup, a Fortune 500 company, or anywhere in between.

Thank you for reaching us!
We will be in touch with you shortly.

Back to Home Write a New Message

Oops! Something went wrong while submitting the form. Please, try again!

Casey

AI Voice Agent Platform

Transcribe

Transcription for Live Streamed Event - an example

Voicegain: Voice AI Under Your Control

Tell us what you are building!