Microsoft Speaker Recognition API is a cloud-based APIs that provide the most advanced algorithms for speaker verification and speaker identification that can be divided into two categories: speaker verification and speaker identification.
Hidden Markov Model Toolkit (HTK) is a portable toolkit for building and manipulating hidden Markov models that is primarily used for speech recognition research although it has been used for numerous other applications including research into speech synthesis, character recognition and DNA sequencing.
Google Cloud Speech-to-Text is a service that enables developers to convert audio to text by applying neural network models in an easy to use API, it recognizes over 80 languages and variants, to support global user base and can transcribe the text of users dictating to an application's microphone, enable command-and-control through voice, or transcribe audio files, among many other use cases.
IBM Watson Speech to Text is a tool that can be used anywhere if there is a need to bridge the gap between the spoken word and its written form, it uses machine intelligence to combine information about grammar and language structure with knowledge of the composition of an audio signal to generate an accurate transcription.
Amazon Transcribe is an automatic speech recognition (ASR) service that makes it easy for developers to add speech to text capability to their applications. Using the Amazon Transcribe API, you can analyze audio files stored in Amazon S3 and have the service return a text file of the transcribed speech.
Sayint, an AI based conversation analytics solution, helps you uncover valuable insights to improve agent performance, enhance customer satisfaction and drive operational efficiencies. Sayint can analyse both real-time and historical communications (voice, chat, email and social feeds)
BigHand Speech Recognition transcribes voice to text with 97%+ accuracy. With BigHand Speech Recognition, busy people can use their voice to get more done, whether in the office and on the move. Get in touch to find out more.
Microsoft Custom Recognition Intelligent Service (CRIS) is a tool that overcome speech recognition barriers like speaking style, background noise, and vocabulary and enables user to customize Microsoft's speech-to-text engine for application
ResourceMate provides comprehensive cataloguing, searching and circulating software as well as unmatched technical support to not only libraries, schools, churches, museums, government, medical/nursing - but any organization that needs to be organized.
PromptSmart Pro is the market leader in mobile teleprompter software. With our patented VoiceTrack speech recognition technology, PromptSmart follows your every word during your speech, automatically scrolling the text at your natural pace in real time without the need for an internet connection. If you ad-lib or go off script, PromptSmart stops and waits for you to go back on script