Jump to content

List of speech recognition software

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by 216.50.37.231 (talk) at 18:13, 3 December 2016 (Minor correction). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Open source acoustic models and speech corpus (compilation)

The following list presents notable speech recognition software engines with a brief synopsis of characteristics.

Application name Description Website Open Source License Operating System Programming Language Supported Language/Note
CMU Sphinx HMM CMU: Sourceforge Yes BSD style Multi-platform Java English
HTK HMM HTK Web Site No HTK Specific License Multi-platform C English. Version 3.5 released December 2015.
Julius HMM trigrams Julius Home page Yes BSD-like Multi-platform C English
Kaldi Deep neural net. Kaldi Web Site Yes Apache Multi-platform C++ English
iATROS LDA (Latent Dirichlet) iATROS Yes miss Linux C English. Currently inactive (last update 2009)
RWTH ASR RWTH Aachen University RWTH ASR No RWTH ASR License Linux, Mac OS X ? English. Non-commercial use only

The following lists open-source applications that provide convenient user interfaces for the above.

Application name Description Website Open Source License Operating System Programming Language Supported Language/Note
Simon Supports Sphinx, HTK, Julius Simon Yes GPLv2 Multi-platform C++ English
Jasper project Raspberry Pi front-end for CMU Sphinx or Julius Jasper Project Yes MIT License Linux Python English

Macintosh

Application name Description Website Open Source License Price Note
Dragon Dictate Mac OS (by Nuance) No Proprietary
MacSpeech Dictate Medical Medical dictation product
Macspeech Dictate Legal Legal-focused dictation
MacSpeech Scribe Transcription from recorded text
iListen PowerPC Macintosh
Speakable items Included with Mac OS
ViaVoice IBM Product. Purchased by Nuance.
Voice Navigator Original GUI voice control (1989)
Power Secretary[1]
Vestec Inc. ASR, NLU, TTS, VSLIC [2]

Cross-platform web apps based on Chrome

The following list presents notable speech recognition software that operate in a Chrome browser as web apps. They make use of HTML5 Web-Speech-API.[2]

Application name Description Website Open Source License Price Note
Speech Pad Free dictation, voice typing to clipboard and to web text fields. Windows and Linux integration Speech Pad Web App No Commercial Free
SpeechTexter Online Speech Recognition SpeechTexter Web App No Commercial Free
Speechnotes Dictation notepad - professional speech recognizing text editor web app Speechnotes Web App No Commercial Free
Trint Convert audio/video to text, search and verify in online editor that glues audio to text. Trint No Commercial From 17¢/minute

Mobile devices and smartphones

Many cell-phone handsets have basic dial-by-voice features built in. Smartphones such as iPhones and BlackBerrys also support this. A number of third-party apps have implemented natural-language speech-recognition support, including: