Speech analysis and synthesis using an AM-FM modulation model

Potamianos, A; Maragos, P

dc.contributor.author	Potamianos, A	en
dc.contributor.author	Maragos, P	en
dc.date.accessioned	2014-03-01T01:15:12Z
dc.date.available	2014-03-01T01:15:12Z
dc.date.issued	1999	en
dc.identifier.issn	0167-6393	en
dc.identifier.uri	https://dspace.lib.ntua.gr/xmlui/handle/123456789/13380
dc.subject	multiband demodulation	en
dc.subject	energy separation algorithm	en
dc.subject	AM-FM modulation model	en
dc.subject	pitch tracking	en
dc.subject	AM-FM vocoder	en
dc.subject	speech synthesis	en
dc.subject.classification	Acoustics	en
dc.subject.classification	Communication	en
dc.subject.classification	Computer Science, Interdisciplinary Applications	en
dc.subject.classification	Language & Linguistics	en
dc.subject.other	Algorithms	en
dc.subject.other	Amplitude modulation	en
dc.subject.other	Frequency modulation	en
dc.subject.other	Mathematical models	en
dc.subject.other	Resonance	en
dc.subject.other	Signal filtering and prediction	en
dc.subject.other	Speech	en
dc.subject.other	Speech coding	en
dc.subject.other	Speech synthesis	en
dc.subject.other	AM-FM modulation model	en
dc.subject.other	Multiband demodulation analysis	en
dc.subject.other	Multiband filtering	en
dc.subject.other	Pitch harmonics	en
dc.subject.other	Speech analysis	en
dc.title	Speech analysis and synthesis using an AM-FM modulation model	en
heal.type	journalArticle	en
heal.identifier.primary	10.1016/S0167-6393(99)00012-6	en
heal.identifier.secondary	http://dx.doi.org/10.1016/S0167-6393(99)00012-6	en
heal.language	English	en
heal.publicationDate	1999	en
heal.abstract	In this paper, the AM-FM modulation model is applied to speech analysis, synthesis and coding. The AM-FM model represents the speech signal as the sum of formant resonance signals each of which contains amplitude and frequency modulation. Multiband filtering and demodulation using the energy separation algorithm are the basic tools used for speech analysis, First, multiband demodulation analysis OC IDA) is applied to the problem of fundamental frequency estimation using the average instantaneous frequency as estimates of pitch harmonics. The MDA pitch tracking algorithm is shown to produce smooth and accurate fundamental frequency contours. Next, the AM-FM modulation vocoder is introduced, which represents speech as the sum of resonance signals. A time-varying filterbank is used to extract the formant bands and then the energy separation algorithm is used to demodulate the resonance signals into the amplitude envelope and instantaneous frequency signals. Efficient modeling and coding (at 4.8-9.6 kbits/sec) algorithms are proposed for the amplitude envelope and instantaneous frequency of speech resonances. Finally, the perceptual importance of modulations in speech resonances is investigated and it is shown that amplitude modulation patterns are both speaker and phone dependent. (C) 1999 Elsevier Science B.V. All rights reserved.	en
heal.publisher	Elsevier Science Publishers B.V., Amsterdam, Netherlands	en
heal.journalName	Speech Communication	en
dc.identifier.doi	10.1016/S0167-6393(99)00012-6	en
dc.identifier.isi	ISI:000081568100002	en
dc.identifier.volume	28	en
dc.identifier.issue	3	en
dc.identifier.spage	195	en
dc.identifier.epage	209	en