List of speech recognition software
This is a list of speech recognition software.
Contents
- 1 Open source acoustic models and speech corpus (compilation)
- 2 Macintosh
- 3 Cross-platform web apps based on Chrome
- 4 Mobile devices and smartphones
- 5 Windows
- 6 Built-in software
- 7 Interactive voice response
- 8 Unix-like x86 and x86_64 speech transcription software
- 9 Discontinued software
- 10 See also
- 11 References
Open source acoustic models and speech corpus (compilation)[edit]
The following list presents notable speech recognition software engines with a brief synopsis of characteristics.
| Application name | Description | Open Source | License | Operating System | Programming Language | Supported Language/Note | Offline vs Online |
|---|---|---|---|---|---|---|---|
| CMU Sphinx | HMM | Yes | BSD style | Multi-platform | C/Java/Python | English | Offline |
| HTK | HMM neural net. | No | HTK Specific License | Multi-platform | C | English. Version 3.5 released December 2015. | |
| Julius | HMM trigrams | Yes | BSD-like | Multi-platform | C | Japanese, English(non-commercial)[1] | Offline |
| Kaldi | Neural net | Yes | Apache | Multi-platform | C++ | English | |
| RWTH ASR | RWTH Aachen University | No | RWTH ASR License | Linux, macOS | C++ | English. Non-commercial use only |
The following lists open-source applications that provide convenient user interfaces for the above.
| Application name | Description | Open Source | License | Operating System | Programming Language | Supported Language/Note |
|---|---|---|---|---|---|---|
| Simon | Supports Sphinx, HTK, Julius | Yes | GPLv2 | Multi-platform | C++ | English |
| Jasper project | Raspberry Pi front-end for CMU Sphinx or Julius | Yes | MIT License | Linux | Python | English |
Macintosh[edit]
| Application name | Description | Open Source | License | Price | Note |
|---|---|---|---|---|---|
| Dragon Dictate | Mac OS (by Nuance) | No | Proprietary | ||
| MacSpeech Scribe | Transcription from recorded text | ||||
| iListen | PowerPC Macintosh | ||||
| Speakable items | Included with Mac OS | ||||
| ViaVoice | IBM Product. Purchased by Nuance. | ||||
| Voice Navigator | Original GUI voice control (1989) |
Cross-platform web apps based on Chrome[edit]
The following list presents notable speech recognition software that operate in a Chrome browser as web apps. They make use of HTML5 Web-Speech-API.[1]
| Application name | Description | Open Source | License | Price | Note |
|---|---|---|---|---|---|
| Happy Scribe[2] | Cloud based transcription service | No | Commercial | From 10¢/minute | |
| Voice Notebook | Free dictation, voice typing to clipboard and to web text fields. Windows and Linux integration | No | Commercial | Free | |
| SpeechTexter[3] | Online speech recognition and editor | No | Commercial | Free | |
| Speechnotes | Dictation notepad - professional speech recognizing text editor web app | No | Commercial | Free | |
| Go-Transcribe.com | Cloud based transcription service | No | Commercial | Subscription based | |
| Speechmatics[4] | Cloud based and on-premise automatic speech recognition | No | Commercial | From £0.06 per minute of audio | |
| Trint | Convert audio/video to text, search and verify in online editor that glues audio to text. | No | Commercial | From 17¢/minute |
Mobile devices and smartphones[edit]
Many cell-phone handsets have basic dial-by-voice features built in. Smartphones such as iPhones and BlackBerrys also support this. A number of third-party apps have implemented natural-language speech-recognition support, including:
Windows[edit]Windows built-in speech recognition[edit]The Windows Speech Recognition version 8.0 by Microsoft comes built into Windows Vista, Windows 7 and Windows 8. Speech Recognition is available only in English, French, Spanish, German, Japanese, Simplified Chinese, and Traditional Chinese and only in the corresponding version of Windows; meaning you cannot use the speech recognition engine in one language if you use a version of Windows in another language. Windows 7 Ultimate and Windows 8 Pro allow you to change the system language, and therefore change which speech engine is available. Windows Speech Recognition evolved into Cortana (software), a personal assistant found in Windows 10. Add-ons for Windows 7 speech recognition[edit]
Windows 7/8/10 third-party speech recognition[edit]
Windows XP or 2000 only[edit]
Built-in software[edit]
Interactive voice response[edit]The following are IVR/Interactive Voice response systems:
Unix-like x86 and x86_64 speech transcription software[edit]
Discontinued software[edit]
See also[edit]References[edit]
|