• Jacek Jarmulak

Word Clouds


Voicegain Web Console has been providing a word cloud view of the transcript already for well over a year. It was a very simple word cloud generated directly from the text. We were not very happy with that word cloud, in particular because too many stop words in it made it not very useful for its primary purpose, which is to quickly tell what the text is about.


In Release 1.25.0 we have made two improvements to the word cloud:

  • Word cloud is now generated on the server back-end using smarter algorithms which remove stop words that do not affect the meaning and which now recognize frequently occurring word bigrams (2-word phrases)

  • Word cloud is now available in the /asr/transcribe API and in the /sa (speech analytics APIs.

Let us know what you think about the new word clouds. Are they an improvement? Is there anything else that you would like us to modify with respect to word clouds? BTW, release 1.26.0 will include a preview of the top words from the word cloud in the transcript list to help you quickly identify what the transcripts are about.

12 views0 comments