Jump to content

Babel program

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Dithridge (talk | contribs) at 18:08, 27 July 2017 (adding darpa category). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

The IARPA Babel program developed speech recognition technology for noisy telephone conversations. The main goal of the program was to improve the performance of keyword search on languages with very little transcribed data, i.e. low-resource languages. Data from 26 languages was collected with certain languages being held-out as "surprise" languages to test the ability of the teams to rapidly build a system for a new language.[1]

Two industry-led teams (IBM and BBN) and two university-led teams (ICSI led by Nelson Morgan and CMU) participated.[2]

Some of the funding from Babel was used to further develop the Kaldi tookit.[3] The speech data was later made available through the Linguistic Data Consortium.

References

  1. ^ Harper, Mary. "Data Resources to Support the Babel Program Intelligence Advanced Research Projects Activity" (PDF). Retrieved 26 July 2017.
  2. ^ "Babel". IARPA. Retrieved 26 July 2017.
  3. ^ "History of the Kaldi project". Retrieved 26 July 2017.