Please use this identifier to cite or link to this item:
http://repo.lib.jfn.ac.lk/ujrr/handle/123456789/2142
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Thiruvaran, T. | |
dc.contributor.author | Ambikairajah, E. | |
dc.contributor.author | Epps, A. | |
dc.date.accessioned | 2021-03-26T03:20:07Z | |
dc.date.accessioned | 2022-06-27T10:01:59Z | - |
dc.date.available | 2021-03-26T03:20:07Z | |
dc.date.available | 2022-06-27T10:01:59Z | - |
dc.date.issued | 2006 | |
dc.identifier.citation | Thiruvaran, T., Ambikairajah, E., & Epps, J. (2006, December). Speaker identification using FM features. In 11th Australian International Conference on Speech Science and Technology (pp. 148-152). | en_US |
dc.identifier.uri | http://repo.lib.jfn.ac.lk/ujrr/handle/123456789/2142 | - |
dc.description.abstract | The AM-FM modulation model of speech is a nonlinear model that has been successfully used in several branches of speech-related research. However, the significance of the AM-FM features extracted from this model has not been fully explored in applications such as speaker identification systems. This paper shows that frequency modulation (FM) features can improve speaker identification accuracy. Due to the similarity between amplitude modulation (AM) feature and the conventional Mel frequency cepstrum coefficients (MFCC), this paper mainly focuses on the FM feature. The correlation between FM feature components is shown to be very small compared with that of Mel filterbank log energies, thus reducing the need for decorrelation. FM feature components are shown to be very nearly Gaussian distributed. Further, speech synthesis using AM-FM features is performed to compare four existing AM-FM demodulation methods based on the perceptual quality of the synthesized speech. Of these, Digital Energy Separation Algorithm (DESA) gives the best synthesized speech, and is thus used as a front-end in our speaker identification system. Evaluation of speaker identification using FM features on the NIST 2001 database shows a relative improvement in speaker identification accuracy of 2% for male speakers and 9% for female speakers over the conventional MFCC-based frontend. | en_US |
dc.language.iso | en | en_US |
dc.title | Speaker Identification using FM Features | en_US |
dc.type | Article | en_US |
Appears in Collections: | Electrical & Electronic Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Speaker Identification using FM Features.pdf | 42.56 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.