Voicegain
Speech Analytics API
INGEST AUDIO FROM DIVERSE SOURCES
Voicegain Speech Analytics can ingest audio from diverse sources like files, object storage (e.g. S3), Web APIs, SIP/SIPREC.
SPEECH ANALYTICS AT THE EDGE
Voicegain platform can offer control, data privacy and security by providing analytics at the Edge (i.e. on client’s infrastructure). Edge also offers lower pricing.
CUSTOM ACOUSTIC MODELS
Voicegain can train the acoustic model to include utterances related to industry jargon, speaker style and accents, etc. Custom trained models cut the word-error-rate in half.
AFFORDABLE PRICING
Voicegain is disruptively priced to analyze 100% of recordings. Why sample, when you can know it all.
Pricing is based on usage, so there are no fixed costs.
BOTH AUDIO AND TEXT
ANALYZED BY AI
Our AI algorithms are applied both to call audio and the transcript in order to extract features like sentiment throughout the call, relevant entities, etc.
CONTACT CENTER
SUPPORT
Use with Voicegain Speech-to-Text APIs during the IVR and live agent interaction to enhance automation with Voice Bots
KEY FEATURES
Voicegain
Speech-to-Text APIs
SUPPORT FOR STREAMING
Real-time streaming input is supported over web-sockets, gRPC and SDK. We also directly integrate with Twilio Media Streams.
DEPLOY AT THE EDGE OR ON CLOUD
Access APIs on our Cloud infrastructure or through containerized deployment on the Edge
AFFORDABLE PRICE AT GREAT ACCURACY
Our disruptive pricing enables mass adoption of speech. We have a free tier that can help you get started with your app immediately.
RESTFUL API
Voicegain provides standard RESTful APIs. Documentation is provided in Open API 3.0 format and has been verified to work with code generation tools.
COMPREHENSIVE SET OF APIs
We provide a comprehensive set of APIs for a wide array of use cases. For example, a word-tree output for large vocabulary and n-best results for grammar-based speech-to-text
CUSTOM ACOUSTIC MODELS
Voicegain provides APIs and tools to train the acoustic model to their specific needs. Training does not require time annotation in transcripts.
KEY FEATURES
Voicegain Telephony
Bot APIs
SIMPLICITY OF INTEGRATION
Telephony Bot APIs are simple to integrate. You invite us to a single session for as long you need to communicate with a caller.
SUPPORT BOTH NLU AND DIRECTED DIALOG
API is easy to integrate with popular NLU engines like RASA and Dialog-flow. We also offer easy options to specify directed dialog logic.
LARGE VOCABULARY, SPEECH GRAMMARS
We support both large vocabulary and speech grammars. You can use grammars to constrain the recognizer for high accuracy & simpler intent capture.
INTEGRATE WITH YOUR CPAAS
The API integrates with CPaas Platforms like Twilio, SignalWire, Amazon Voice Connector. Basically any platform from which you can do SIP INVITE
CALLBACK APIs
The callback API makes requests to your app logic upon significant events. In response you may invoke commands to play a prompt, ask a question, or convert an utterance to text.
APP/BOT LOGIC IN ANY LANGUAGE
Developers can write app logic in a programming language of their choice – Python, Node.js or Java.
KEY FEATURES
Voicegain Transcribe
& Captioning
KEY FEATURES
TIMESTAMP INFORMATION
Voicegain provides a timestamp for each word transcribed. Time data is retained when editing /correcting the transcript and can be included in exported files
TRANSCRIPTION AT THE EDGE
Voicegain platform offers control, data privacy and security by providing transcription at the Edge (i.e. on client’s infrastructure).
CUSTOM ACOUSTIC MODELS
Voicegain can train the acoustic model to include utterances related to industry jargon, Speaker style and accents, etc.
CUSTOM VOCABULARY
Voicegain provides the ability to add new words (that may be specific to the client’s domain) to the language model.
SUPPORT FOR A VARIETY OF AUDIO INPUTS
We provide multiple ways to submit audio and receive text output that cover most common use cases.
PUNCTUATION
We add capitalization and punctuation automatically using deep learning, so that the output is more intelligible and can be used with minimal editing.
Voicegain MRCP ASR
VOICEXML/MRCP SUPPORT
Voicegain ASR is invoked from any VoiceXML IVR platform over MRCP. We are compatible with VXML platforms like Avaya, Genesys, Cisco, etc.
DEPLOY AT THE EDGE OR ON CLOUD
The software is available on the Cloud or deployable On-Premise/at the Edge (i.e. on client’s infrastructure).
CUSTOM ACOUSTIC MODELS
Clients can train the acoustic model to include utterances related to industry jargon, Speaker style and accents, etc.
TUNING &
TESING TOOLS
Voicegain provides tools for tuning and testing grammars. Same tools can be used to collect data for acoustic model training.
LICENSING BASED ON USAGE
Voicegain ASR is licensed based on usage. This helps clients avoid significant upfront capital outlays.
SPEECH GRAMMARS +
LARGE VOCABULARY
We provide full support for grammars like GRXML and JSGF. We include a library of built-in grammars and support large vocabulary models too.