Speech recognition in Linux
||This article has multiple issues. Please help improve it or discuss these issues on the talk page.
Native Linux speech recognition 
Current development status 
Recently, there has been a push to get a high-quality native Linux speech recognition engine developed. As a result, numerous projects dedicated to creating Linux speech recognition solutions were established. One major hurdle is the compilation of a speech corpus to enable production of acoustic models. In response, VoxForge, which aims to collect transcribed speech for the use with free and open-source speech recognition engines under the GPL license, was set up.
SpeechRecognition concept 
Record an audio stream on your linux machine. Now you have two options:
- process the voice recognition on your local machine or
- submit the audio file to a remote server for converting the audio file into a text string.
The second option, OpenMoko Speech Recognition, is used mainly on smartphones, because they do not have the performance and disk space to process the speech recognition on the phone.
Free speech recognition engines 
The following is a list of current projects dedicated to implementing speech recognition in Linux, as well as major native solutions:
- CMU Sphinx is a general term to describe a group of speech recognition systems developed at Carnegie Mellon University.
- Julius is a high-performance, two-pass large vocabulary continuous speech recognition (LVCSR) decoder software for speech-related researchers and developers.
- Speech uses Google's speech recognition engine to support dictation in many different languages.
- Speech Control: is a Qt-based application that uses CMU Sphinx's tools like SphinxTrain and PocketSphinx to provide speech recognition utilities like desktop control, dictation and transcribing to the Linux desktop.
- Platypus is an open source shim that will allow Dragon NaturallySpeaking running under wine to work with any linux x11 application.
- Vedics is a speech assistant for GNOME Environment
- Xvoice (requires ViaVoice to function)
- GnomeVoiceControl is a dialogue system to control the GNOME Desktop that was developed in the Google Summer of Code in 2007.
- CVoiceControl is a KDE and X Window independent version of its predecessor KVoiceControl
- SphinxKeys lets you essentially type keyboard keys and mouse clicks by speaking into your microphone. It's simple but works pretty much out of the box.
- Open Mind Speech, a part of the Open Mind Initiative, aims to develop free (GPL) speech recognition tools and applications, as well as collect speech data.
- PerlBox is a perl based control and speech output.
- VoxForge is a free speech corpus and acoustic model repository for open source speech recognition engines.
- Simon aims at being extremely flexible to compensate dialects or even speech impairments. It requires HTK and Julius.
It is possible, though complicated, for advanced developers to create Linux speech recognition software by using existing packages derived from open-source projects.
Proprietary speech recognition engines 
- Wizzscribe SI is a commercial speech recognition server for Linux, launched by Wizzard software in 2006.
- Verbio ASR is a commercial speech recognition server for Linux and windows platforms.
- DynaSpeak, from SRI International, (speaker-independent speech recognition software development kit that scales from small- to large-scale systems, for use in commercial, consumer, and military applications)
- Janus Recognition Toolkit (JRTk) is a closed source speech recognition toolkit mainly targeted at Linux developed by the Interactive Systems Laboratories developed at Carnegie Mellon University and Karlsruhe Institute of Technology for which commercial and research licenses are available.
- LumenVox Speech Engine is a commercial library for Linux and Windows for inclusion in other software. It has been integrated into the Asterisk private branch exchange system.
Voice control and keyboard shortcuts 
Speech recognition usually refers to software that attempts to distinguish thousands of words in a human language. Voice control may refer to software used for sending operational commands to a computer or appliance. Voice control typically requires a much smaller vocabulary and thus is much easier to implement.
Simple software combined with keyboard shortcuts, have the earliest potential for practically accurate voice control in Linux.
Running Windows speech recognition software with Linux 
Using a compatibility layer 
Using virtualized Windows 
It is also possible to use Windows speech recognition software under Linux. Using no-cost virtualization software, it is possible to run Windows and NaturallySpeaking under Linux. VMware Server or VirtualBox support copy and paste to/from a virtual machine, making dictated text easily transferable to/from the virtual machine.
See also 
||This article needs additional citations for verification. (February 2012)|
- Speech Control
- Open Mind Speech
- Open Mind Initiative
- Wizzscribe SI
- Verbio ASR
- "Speech Recognition Software - LumenVox". Retrieved 2013-02-28.
- Speech-to-text software by Vocapia
- Dragon NaturallySpeaking - Wine Application Database
||This article's use of external links may not follow Wikipedia's policies or guidelines. (February 2012)|