Pay-as-you-go usage-based pricing with no commitments. $50 in Credits provided on signup, No Credit Card Required to start today. Rate-limits apply; get custom rate-limits with revenue commits. Please contact for details.
Get Started - Free Credit1. Platform usage is measured and billed per second but our billing system displays usage in hours.
2. Each API request is subject to a minimum billing of 6 seconds and 1 second increment after that. A real-time request of 4 second is billed for 6 seconds or $0.0012 ($0.00020*6) and a real-time request for 7 seconds is billed $0.00020*7.
3. STT Offline-Basic offers STT on a mono-channel with no Diarization and no PII Redaction. Voicegain Whisper-small is provided at the Basic price. STT Offline-Enhanced offers Diarization and PII Redaction in addition the Transcription. Voicegain Whisper-medium is provided at Enhanced price. It also supports 2-channel for Call Center recordings where Agent & Caller are on separate channels. STT-Offline - Multi-Channel is for meeting recordings on Zoom or any other meeting platform where each speaker is on a separate audio file.
4. STT Realtime-Transcription is for Voicegain's streaming Speech-to-Text over Web-sockets. Price on the table is per channel. We provide a 50% discount to call center customers where the Agent Channel and the Caller Channel are streamed over separate channels.
5. Custom Speech-to-Text model is built by training our standard model with additional client data (using transfer learning). Please contact us for pricing.
6. STT-Realtime with MRCP or Telephony Bot API is the price for use of our Speech-to-Text/ASR as part of an MRCP or Telephony Bot API Session. This price is applicable for the entire duration of the MRCP or Telephony Bot/SIP Session. It does not include 100% whole-call recording of sessions.
7. Rate Limits apply for pay as you go. We offer higher rate limits and lower pricing with volume & term commits. Please contact us at sales@voicegain.ai to get the details.
Deploy Voicegain on your private infrastructure. Free 30 day trial provided. Port-based or Usage-based licensing offered. Minimum purchase of ports/usage is applicable. Additional Annual Support Costs maybe applicable.
Contact Us1. Voicegain Edge refers to our platform being deployed on client infrastructure (bare-metal or VPC). Voicegain is deployed on a Kubernetes Cluster. We prefer NVIDIA GPUs for apps that require high concurrency. CPUs are supported for low concurrency apps. Orchestration of the cluster is from Voicegain cloud.
2. Client shall incur infrastructure costs and is responsible for monitoring of Kubernetes. For VPC, we recommend managed Kubernetes from the cloud provider and for Datacenter, you can contact us for support options.
3. "Port" - for STT Offline - is defined as throughput. So 25 Ports would allow you to transcribe 25 hours of offline audio per hour. For Real-time STT, Port is the number of concurrent web-socket sessions. E.g 25 Ports means a maximum of 25 Concurrent Real-time STT sessions during a month.
4. For usage based licensing, each request is subject to a minimum billing of 6 seconds and 1 second increment after that. E.g. a real-time request for 4 seconds shall be billed for 6 seconds or $0.0012 ($0.00020*6) and a real-time request for 7 seconds shall be billed for 7 seconds.
5. Voicegain offers discounts for volume & term commits. Please contact us at sales@voicegain.ai to receive custom pricing.
You can find the complete code (minus the RASA logic - you will have to supply your own) at our github repository.
The setup allows you to call a phone number and then interact with a Voicebot that uses RASA as the dialog logic engine.
November 2021 Update: We do not recommend S3 and AWS Lambda for a production setup. A more up to date review of various options to build a Voice Bot is described here. You should consider replacing the functionality of S3 and AWS Lambda with a web server that is able to maintain state - like Node.js or Python Flask.
The sequence diagram is provided below. Basically, the sequence of operations is as follows:
Interested in customizing the ASR or deploying Voicegain on your infrastructure?