TIMIT

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Kri (talk | contribs) at 16:50, 7 February 2016 (New section: See also + added Comparison of datasets in machine learning + renamed other sections). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

TIMIT is a corpus of phonemically and lexically transcribed speech of American English speakers of different sexes and dialects. Each transcribed element has been delineated in time.

TIMIT was designed to further acoustic-phonetic knowledge and automatic speech recognition systems. It was commissioned by DARPA and worked on by many sites, including Texas Instruments (TI) and Massachusetts Institute of Technology (MIT), hence the corpus' name.[1] There is also a telephone bandwidth version called NTIMIT (Network TIMIT). TIMIT and NTIMIT are not freely available - either membership of the Linguistic Data Consortium, or a monetary payment, is required for access to the dataset.

References

  1. ^ Fisher, William M. (1986). The DARPA Speech Recognition Research Database: Specifications and Status. pp. 93–99. {{cite book}}: Unknown parameter |booktitle= ignored (help); Unknown parameter |coauthors= ignored (|author= suggested) (help)

See also

External links