List of speech recognition software
|This section needs additional citations for verification. (January 2013)|
The following list presents notable speech recognition software with a brief synopsis of characteristics.
- 1 Open Source
- 2 Macintosh
- 3 Mobile Devices / Smartphones
- 4 Windows
- 5 Programs for controlling a computer's screens and desktop applications with claps
- 6 Built-in Software
- 7 Interactive voice response
- 8 Unix-like x86 and x86_64 Speech Transcription Software
- 9 Discontinued software
- 10 See also
- 11 References
- CMU Sphinx — open source under a BSD-style license 
- Julius — BSD-style license with citation requirement , distributes models for Japanese.
- Kaldi — Apache license 
- simon — GPL; Uses Julius and the HTK, an alternative backend using PocketSphinx was developed during Google Summer of Code 2012 
- iATROS-speech — Released under a GPL license. 
- RWTH ASR — QPL-style but non-free (due to its ban on commercial use, which is not in the QPL) license.
- Zanzibar OpenIVR - MRCPv2 Server with the Sphinx4 ASR engine and FreeTTS speech synthesizer.
Open source acoustic models and speech corpus
- Dragon Dictate for Mac – From Nuance Communications, renamed from MacSpeech Dictate ca. 2010.
- MacSpeech Dictate Medical – Dictation product for Intel-based Macintosh with included vocabularies for 54 medical and dental specialties. Developed by MacSpeech; acquired by Nuance Communications in 2010.
- MacSpeech Dictate Legal – Dictation product for Intel-based Macintosh with a vocabulary of legal terms. Developed by MacSpeech; acquired by Nuance Communications in 2010.
- MacSpeech Scribe – By Nuance Communications. Transcription product for automatically transcribing recorded dictation into text.
- iListen – Product from MacSpeech, developed and supported for PowerPC-based Macintosh until ca. 2009.
- Speakable items – Included with Mac OS ca. 1993 onward. Apple's speech synthesis and recognition technology is collectively called PlainTalk.
- ViaVoice – Product from IBM, developed and supported until ca. 2007.
- Voice Navigator - First voice control system for a graphical user interface, by Articulate Systems in 1989.
Mobile Devices / Smartphones
Many cell phone handsets have basic dial-by-voice features built in. Smartphones such as iPhone or Blackberry also support this. A number of 3rd party Apps have implemented natural language speech recognition support, including:
Windows 7 built-in speech recognition
The Windows Speech Recognition by Microsoft is the speech recognition system that comes built into Windows Vista and Windows 7. Windows Vista and Windows 7 include version 8.0 of the Microsoft speech recognition engine. Speech Recognition is available only in English, French, Spanish, German, Japanese, Simplified Chinese, and Traditional Chinese and only in the correspondingly version of Windows. That means that you can not use the French speech recognition engine if you use an English version of Windows.
Add-ons for Windows 7 speech recognition
- VoiceAttack - is used primarily by the gaming community to allow hands-free keyboard and mouse input in Windows 8, Windows 7, Windows Vista and Windows XP.
- VAC-Voice activated commands is a feature rich speech recognition solution for games. It works with Windows 7, Windows Vista and Windows XP.
- Voice Finger – software for Windows Vista and Windows 7 that improves the Windows speech recognition system by adding several extensions to accelerate and improve the mouse and keyboard control.
- WSRToolkit – adds dictionaries, macros and other features similar to Dragon
- Trigamtech – adds features for medical users similar to Dragon
- Vocola – a macro language
Windows 7 third-party speech recognition
- Auditory Sciences -- transcription software for captioning whatever someone says.
- Dragon NaturallySpeaking from Nuance Communications – Successor to the older DragonDictate product. Focus on dictation. 64-bit Windows support since version 10.1.
- Freesr Speech Recognition Software – Create voice interfaces for any application, window in an application, or website/webpage. Works with Windows Speech Recognition or as add-on to NaturallySpeaking.
- SpeechGear's Interact - combines speech recognition with language translation.
- Sonic Extractor from Digital Syphon – Supports 22 languages. Focus on broadcasting and telephony.
- SpeechMagic – Nuance Communications acquired Philips owned. Medical industry focus according to Frost & Sullivan. Standalone or embedded.
- Tazti - Create speech command interfaces to play PC based video games and control applications. Create speech commands to open files, folders, webpages, applications. Windows 7, Vista and XP versions.
- VoxCommando – Voice command utility for Windows Vista or later. It interfaces with various programs and devices to allow control of multimedia, communication, and home automation.
Windows XP or 2000 only
- e-Speaking – software for Windows XP that facilitates use of the Microsoft Speech API by adding ability to create commands to perform custom actions.
- Microsoft Speech API – Speech recognition functionality included as part of Microsoft Office and on Tablet PCs running Microsoft Windows XP Tablet PC Edition. It can also be downloaded as part of the Speech SDK 5.1 for Windows applications, but since that is aimed at developers building speech applications, the pure SDK form lacks any user interface, and thus is unsuitable for end users.
Programs for controlling a computer's screens and desktop applications with claps
- Clap Commander – Novel Human-Computer Interface, based on hand clap recognition idea. For remotely controlling your computer from another part of the room by clapping your hands. Windows XP, Windows Vista and Windows 7.
- Microsoft Kinect does include a built-in software which allow speech-recognition of commands.
- Older generation of Nokia phones like Nokia N97 series ( before using Windows 7 mobile technology) used speech-recognition with family names from contact list and other few commands.
Interactive voice response
- AT&T Watson
- CSLU Toolkit
- Genesys Telecommunications Laboratories
- HTK — copyrighted by Microsoft, but altering the software for the Licensee's internal use is allowed.
- iSpeech API
- Freesr Speech Recognition Software
- Loquendo ASR
- Verbio VoiceXML Voice XML
- LumenVox ASR
- MIRSK ASR
- Telisma ASR
- Nuance Recognizer ASR
- Rubidium Ltd. ASR
- Proteus Conversational Interface
- Simmortel Voice
- Tellme Networks (acquired by Microsoft)
- Parlance nameConnector
- Verbyx Inc
Unix-like x86 and x86_64 Speech Transcription Software
- SpeechWorks from Nuance Communications.
- Quack.com (acquired by AOL) The name has now been reused for an iPad search app.
- IBM ViaVoice – Embedded version still maintained by IBM. No longer supported for versions above Windows Vista. Untested above Mac OS X 10.4 or on Macintoshes with an Intel chipset.
- Game Commander 2 by Mindmaker. Gaming oriented voice recognition. Voice commands can be assigned to issue keystrokes and key combinations.
- "PowerSecretary Announcement".
- Philips SpeechMagic named European Technology Leader by Frost & Sullivan
- Dube, Ryan (2009-02-09). "Tazti – Search Web, Play Music & Control Programs using Voice". MakeUseOf. Retrieved 2013-06-03.