dc.contributor.author |
Potamianos, A |
en |
dc.contributor.author |
Maragos, P |
en |
dc.date.accessioned |
2014-03-01T01:15:12Z |
|
dc.date.available |
2014-03-01T01:15:12Z |
|
dc.date.issued |
1999 |
en |
dc.identifier.issn |
0167-6393 |
en |
dc.identifier.uri |
https://dspace.lib.ntua.gr/xmlui/handle/123456789/13380 |
|
dc.subject |
multiband demodulation |
en |
dc.subject |
energy separation algorithm |
en |
dc.subject |
AM-FM modulation model |
en |
dc.subject |
pitch tracking |
en |
dc.subject |
AM-FM vocoder |
en |
dc.subject |
speech synthesis |
en |
dc.subject.classification |
Acoustics |
en |
dc.subject.classification |
Communication |
en |
dc.subject.classification |
Computer Science, Interdisciplinary Applications |
en |
dc.subject.classification |
Language & Linguistics |
en |
dc.subject.other |
Algorithms |
en |
dc.subject.other |
Amplitude modulation |
en |
dc.subject.other |
Frequency modulation |
en |
dc.subject.other |
Mathematical models |
en |
dc.subject.other |
Resonance |
en |
dc.subject.other |
Signal filtering and prediction |
en |
dc.subject.other |
Speech |
en |
dc.subject.other |
Speech coding |
en |
dc.subject.other |
Speech synthesis |
en |
dc.subject.other |
AM-FM modulation model |
en |
dc.subject.other |
Multiband demodulation analysis |
en |
dc.subject.other |
Multiband filtering |
en |
dc.subject.other |
Pitch harmonics |
en |
dc.subject.other |
Speech analysis |
en |
dc.title |
Speech analysis and synthesis using an AM-FM modulation model |
en |
heal.type |
journalArticle |
en |
heal.identifier.primary |
10.1016/S0167-6393(99)00012-6 |
en |
heal.identifier.secondary |
http://dx.doi.org/10.1016/S0167-6393(99)00012-6 |
en |
heal.language |
English |
en |
heal.publicationDate |
1999 |
en |
heal.abstract |
In this paper, the AM-FM modulation model is applied to speech analysis, synthesis and coding. The AM-FM model represents the speech signal as the sum of formant resonance signals each of which contains amplitude and frequency modulation. Multiband filtering and demodulation using the energy separation algorithm are the basic tools used for speech analysis, First, multiband demodulation analysis OC IDA) is applied to the problem of fundamental frequency estimation using the average instantaneous frequency as estimates of pitch harmonics. The MDA pitch tracking algorithm is shown to produce smooth and accurate fundamental frequency contours. Next, the AM-FM modulation vocoder is introduced, which represents speech as the sum of resonance signals. A time-varying filterbank is used to extract the formant bands and then the energy separation algorithm is used to demodulate the resonance signals into the amplitude envelope and instantaneous frequency signals. Efficient modeling and coding (at 4.8-9.6 kbits/sec) algorithms are proposed for the amplitude envelope and instantaneous frequency of speech resonances. Finally, the perceptual importance of modulations in speech resonances is investigated and it is shown that amplitude modulation patterns are both speaker and phone dependent. (C) 1999 Elsevier Science B.V. All rights reserved. |
en |
heal.publisher |
Elsevier Science Publishers B.V., Amsterdam, Netherlands |
en |
heal.journalName |
Speech Communication |
en |
dc.identifier.doi |
10.1016/S0167-6393(99)00012-6 |
en |
dc.identifier.isi |
ISI:000081568100002 |
en |
dc.identifier.volume |
28 |
en |
dc.identifier.issue |
3 |
en |
dc.identifier.spage |
195 |
en |
dc.identifier.epage |
209 |
en |