TIMIT

TIMIT is a corpus of phonemically and lexically transcribed speech of American English speakers of different sexes and dialects. Each transcribed element has been delineated in time.

TIMIT was designed to further acoustic-phonetic knowledge and automatic speech recognition systems. It was commissioned by DARPA and worked on by many sites, including Texas Instruments (TI) and Massachusetts Institute of Technology (MIT), hence the corpus' name.^[1] There is also a telephone bandwidth version called NTIMIT (Network TIMIT). TIMIT and NTIMIT are not freely available - either membership of the Linguistic Data Consortium, or a monetary payment, is required for access to the dataset.

References

^ Fisher, William M. (1986). The DARPA Speech Recognition Research Database: Specifications and Status. pp. 93–99. {{cite book}}: Unknown parameter |booktitle= ignored (help); Unknown parameter |coauthors= ignored (|author= suggested) (help)

External links

TIMIT Acoustic-Phonetic Continuous Speech Corpus

[1] Fisher, William M. (1986). The DARPA Speech Recognition Research Database: Specifications and Status. pp. 93–99. {{cite book}}: Unknown parameter |booktitle= ignored (help); Unknown parameter |coauthors= ignored (|author= suggested) (help)

[1]

v t e Corpus linguistics
Text corpora, English	American National Corpus Bank of English Bergen Corpus of London Teenage Language British National Corpus Brown Corpus Buckeye Corpus Cambridge English Corpus Corpus of Contemporary American English Enron Corpus EnTenTen International Corpus of English Lancaster-Oslo-Bergen Corpus Oxford English Corpus PropBank Spoken English Corpus Switchboard Telephone Speech Corpus TIMIT VerbNet Wellington Corpus of Spoken New Zealand English
Text corpora, non-English	Bijankhan Corpus CHILDES CorCenCC National Corpus of Contemporary Welsh Croatian Language Corpus Croatian National Corpus Czech National Corpus Europarl Corpus German Reference Corpus Hamshahri Corpus National Corpus of Polish Neo-Assyrian Text Corpus Project Persian Speech Corpus Quranic Arabic Corpus Russian National Corpus Scottish Corpus of Texts and Speech Slovenian National Corpus TalkBank Tatoeba Tehran Monolingual Corpus Tekstaro de Esperanto TenTen Corpus Family Thesaurus Linguae Graecae
Organizations	BNC consortium COBUILD Sketch Engine

References

See also

External links