List of speech recognition software

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

This is a list of speech recognition software.

Open source acoustic models and speech corpus (compilation)[edit]

The following list presents notable speech recognition software engines with a brief synopsis of characteristics.

Application name Description Open Source License Operating System Programming Language Supported Language/Note Offline vs Online
CMU Sphinx HMM Yes BSD style Multi-platform C/Java/Python English Offline
HTK HMM neural net. No HTK Specific License Multi-platform C English. Version 3.5 released December 2015.
Julius HMM trigrams Yes BSD-like Multi-platform C Japanese, English(non-commercial)[1] Offline
Kaldi Neural net Yes Apache Multi-platform C++ English
RWTH ASR RWTH Aachen University Yes RWTH ASR License Linux, macOS C++ English. Non-commercial use only

The following lists open-source applications that provide convenient user interfaces for the above.

Application name Description Open Source License Operating System Programming Language Supported Language/Note
Simon Supports Sphinx, HTK, Julius Yes GPLv2 Multi-platform C++ English
Jasper project Raspberry Pi front-end for CMU Sphinx or Julius Yes MIT License Linux Python English

Macintosh[edit]

Application name Description Open Source License Price Note
Dragon Dictate Mac OS (by Nuance) No Proprietary
MacSpeech Scribe Transcription from recorded text
iListen PowerPC Macintosh
Speakable items Included with Mac OS
ViaVoice IBM Product. Purchased by Nuance.
Voice Navigator Original GUI voice control (1989)

Cross-platform web apps based on Chrome[edit]

The following list presents notable speech recognition software that operate in a Chrome browser as web apps. They make use of HTML5 Web-Speech-API.[1]

Application name Description Open Source License Price Note
Sonix Independently reviewed as most accurate automated transcription service No Commercial 10¢/minute w/ subscription
Happy Scribe[2] Cloud based transcription service No Commercial From 10¢/minute
Voice Notebook Free dictation, voice typing to clipboard and to web text fields. Windows and Linux integration No Commercial Free
SpeechTexter[3] Online speech recognition and editor No Commercial Free
Speechnotes Dictation notepad - professional speech recognizing text editor web app No Commercial Free
Go-Transcribe.com Cloud based transcription service No Commercial Subscription based
Speechmatics[4] Cloud based and on-premise automatic speech recognition No Commercial From £0.06 per minute of audio
Trint Convert audio/video to text, search and verify in online editor that glues audio to text. No Commercial From 17¢/minute

Mobile devices and smartphones[edit]

Many cell-phone handsets have basic dial-by-voice features built in. Smartphones such as iPhones and BlackBerrys also support this. A number of third-party apps have implemented natural-language speech-recognition support, including: