Jump to content

Semantic audio

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Kvng (talk | contribs) at 14:20, 21 May 2015 (simplify). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Semantic audio is the extraction of symbols or meaning from an audio stream. Speech recognition is an important semantic audio application. But for speech, other semantic operations include language, speaker or gender identification. For more general audio or music, it includes identifying a piece of music (e.g. Shazam (service)) or a movie soundtrack.

Areas of research in semantic audio include the ability to label an audio waveform with where the harmonies change and what they are and where material is repeated and what instruments are playing.