data_web_background

Speech-to-Text

that learns

Accurate low cost Speech-to-Text Platform and APIs for Bots, IVR, Speech Analytics, Transcription & more 
PUT VOICEGAIN TO WORK
 

Free Tier available

Build Custom models trained for your domain

  • Significantly enhance recognition accuracy by customizing the core acoustic DNN model.
  • Optimize speech to text to your audio quality, industry jargon, background noise, accents, etc.

Transcribe Audio and Extract insights

  • Convert audio into text both realtime and offline
  • Extract analytics (keywords, topics, sentiment) from calls, meetings, podcasts (both live and recorded) 
  • Integrate with Contact Center and Unified Communication systems - both cloud & on-premise

Build Voice Bots, Assistants or IVR

  • Build a voice bot or speech IVR using any backend programming language with our APIs
  • Support for large vocabulary and grammar based speech recognition
  • Stream audio realtime using TCP-based - gRPC, WebSockets, http/2 and UDP-based - WebRTC,  SIP/RTP, and MRCP protocols

Deploy on the Edge or access via Cloud  

  • Access Speech-to-text as a cloud service on our Google Cloud GPU infrastructure.
  • Install Voicegain as a containerized application on your GPU infrastructure - Datacenter or Pvt Cloud.

Heading 4

Amazing accuracy at an awesome price 

Voicegain is the most improved speech to text engine in the market.

Based on an accuracy benchmark conducted in September 2020, Voicegain offers the same accuracy as Amazon Transcribe and better than Google Standard.

 

We do this at an amazingly competitive price of 1 cent/minute.

Most importantly, we can build a custom acoustic model that can perform even better.

You can read more about this dataset and our continuous improvements in accuracy in our blog post.

 

 
 
Technology

Voicegain’s Speech to Text engine utilizes multiple Deep Neural Network models running on modern GPUs to achieve high recognition accuracy. 

 

Voicegain can be deployed either as a containerized application on the Edge or accessed using our APIs on our modern cloud infrastructure. 

Read more

Applications

Voicegain supports both large vocabulary speech-to-text as well as recognition using context free grammars enabling applications like: 

  • Speech analytics and transcription – both real-time and offline 

  • Voice Bots/Assistants that allow users to speak to the application

  • Live Agent Assist and Speech IVR in call centers or help desks 

  • Embedding Speech-to-Text into products

Read more

Difference
  1. Full featured APIs that cover most scenarios for your apps

  2. Enhanced RTC APIs for Bots, Speech IVRs and Agent Assist

  3. Privacy and Control with Edge deployment 

  4. Custom Acoustic Model to improve accuracy 

  5. Stream audio over gRPC, SIP,  WebRTC, WebSockets

Read more

 
PRICING
Pricing is based on usage of platform resources.
Volume discounts are available above 500K mins/month. Edge pricing has minimum revenue commitments.  Click here for more information.

VOICEGAIN CLOUD

OFF-LINE 
SPEECH TO TEXT
1.00 cent
per minute
REALTIME
SPEECH TO TEXT
1.25 cents
per minute
RTC SESSION TIME
WEBRTC/SIP
0.25 cents
per minute

VOICEGAIN EDGE

Subject to minimum monthly commitments

SPEECH TO TEXT
STANDARD 
1.00 cent
per minute

CUSTOM
SPEECH TO TEXT

1.50 cents
per minute

RTC SESSION (SIP/WEBRTC)

0.25 cents
per minute
 
VOICEGAIN SIGNUP

Build something awesome with the Voicegain platform today!

Free Tier Offer: For a limited time, new accounts on Voicegain cloud receive free 600 minutes* of monthly platform use. No credit card required if you sign up today. Use Voicegain APIs to build your amazing app. Maybe it is a unique voice bot, a Speech IVR, or an app that analyzes your audio - call recordings, podcasts or meetings. 

Features available immediately after sign-up

  • Full set of RESTful Speech-to-Text (STT) APIs

  • Realtime STT support with gRPC, Web-sockets and MRCP

  • Offline Transcription from Voicegain Web Portal

  • Live Transcription with broadcast via Web-sockets  

Early access Alpha features

  • Speech Analytics UI - for contact centers 

  • RTC Callback APIs for Voice Bots & Speech IVRs(supports SIP/RTP and WebRTC)

Have questions - visit Free Tier FAQs on our support website.

 

If you are interested in Edge Deployment for Speech IVR, check out our $9999 offer.

** Free 600 minutes include either Offline or Realtime STT.

1505 LBJ Fwy, Ste 255

Dallas, TX 75234

Contact: 972-518-0863

Contact Us
September 2020 44+20 files accuracy benchmark

Summarized results of speech-to-text accuracy benchmark comparing Voicegain to Google and Amazon