Spectral modeling synthesis

From Wikipedia, the free encyclopedia
Jump to: navigation, search
Spectral modeling synthesis (based on Curtis Roads 1996, p. 153)

Spectral modeling synthesis or simply SMS is an acoustic modeling approach for speech and other signals.

SMS considers sounds as a combination of harmonic content and noise content. Harmonic components are identified based on peaks in the frequency spectrum of the signal, normally as found by the short-time Fourier transform. The signal that remains following removal of the spectral components, sometimes referred to as the residual, is then modeled as white noise passed through a time-varying filter. The output of the model, then, are the frequencies and levels of the detected harmonic components and the coefficients of the time-varying filter.

Intuitively, the model can be applied to many types of audio signals. Speech signals, for example, include slowly changing harmonic sounds caused by vibration of the vocal cords plus wideband, noise-like sounds caused by the lips and mouth. Musical instruments also produce sounds containing both harmonic components and percussive, noise-like sounds when the notes are struck or changed.

See also[edit]


Xavier Serra (2003). "Spectral Modeling Synthesis: Past and Present" (PDF). p. 20. Retrieved May 11, 2010. 

Xavier Serra. "Spectral Modeling Synthesis Tools". Retrieved May 11, 2010. 

Julius O. Smith III (28 December 2005). "Spectral Modeling". Retrieved April 19, 2008.