Kaldi (software)

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search
Kaldi
Developer(s)Daniel Povey and others
Stable release
Revision 3122 / October 2013; 6 years ago (2013-10)
Repositoryhttps://github.com/kaldi-asr/kaldi
Written inC++
Operating systemUnix systems (GNU/Linux, BSD, OSX 10.{8,9} etc.), Windows (via Cygwin)
TypeSpeech recognition
LicenseApache License v.2.0[1]
Websitekaldi-asr.org

Kaldi is an open-source speech recognition toolkit written in C++ for speech recognition and signal processing, freely available under the Apache License v2.0.

Kaldi aims to provide software that is flexible and extensible,[2] and is intended for use by automatic speech recognition (ASR) researchers for building a recognition system.

It supports linear transforms, MMI, boosted MMI and MCE discriminative training, feature-space discriminative training, and deep neural networks.[3]

Kaldi is capable of generating features like mfcc, fbank, fMLLR, etc. Hence in recent deep neural network research, a popular usage of Kaldi is to pre-process raw waveform into acoustic feature for end-to-end neural models.

Kaldi has been incorporated as part of the CHiME Speech Separation and Recognition Challenge over several successive events.[4][5][6] The software was initially developed as part of a 2009 workshop at Johns Hopkins University.[7]

Kaldi was the name of the Ethiopian goat herder who discovered the coffee plant, according to the Kaldi website.

See also[edit]

References[edit]

  1. ^ "Kaldi: Legal stuff". kaldi-asr.org.
  2. ^ "Kaldi: About the Kaldi project". kaldi-asr.org.
  3. ^ "Kaldi: Deep Neural Networks in Kaldi". kaldi-asr.org.
  4. ^ "The 4th CHiME Speech Separation and Recognition Challenge". Retrieved 15 February 2017.
  5. ^ "The 3rd CHiME Speech Separation and Recognition Challenge". Retrieved 15 February 2017.
  6. ^ Emmanuel Vincent, Jon Barker, Shinji Watanabe, Jonathan Le Roux, Francesco Nesta, et al.. The second 'CHiME' Speech Separation and Recognition Challenge: Datasets, tasks and baselines. ICASSP - 38th International Conference on Acoustics, Speech, and Signal Processing - 2013, May 2013, Vancouver, Canada. pp.126-130, 2013.
  7. ^ "History of the Kaldi project". Retrieved 26 July 2017.

External links[edit]