Compact Speech Features Based on Wavelet Transform and PCA with Application to Speaker Identification | |
---|---|
學年 | 91 |
學期 | 1 |
發表日期 | 2002-08-22 |
作品名稱 | Compact Speech Features Based on Wavelet Transform and PCA with Application to Speaker Identification |
作品名稱(其他語言) | |
著者 | Hsieh, Ching-Tang; Lai, Eugene; Chen, Wan-Chen; Wang, You-Chuang |
作品所屬單位 | 淡江大學電機工程學系 |
出版者 | |
會議名稱 | 第三屆國際中文口述語言處理研討會暨海峽兩岸口語語音處理論壇 |
會議地點 | 臺北縣, 臺灣 |
摘要 | The main goal of this paper is to find some effective methods to improve the performance of speaker identification system. In speaker identification, we use wavelet transform to decompose the speech signals into several frequency bands and then use cepstral coefficients to capture the individualities of vocal track within the interested bands based on the acoustic characteristic of human ear. In addition, an adaptive wavelet-based filtering mechanism is applied to eliminate the small variation of wavelet coefficients caused by noise. In order to effectively utilize all these multi-band speech features, we propose a modified vector quantization method called multi-layer eigen-codebook vector quantization (MLECVQ) as the identifier. This model uses the multi-layer concept to eliminate the interference between the multi-band coefficients and then uses the principal component analysis (PCA) method to evaluate the codebooks for capturing more details of phoneme character. Experimental results show that the proposed method is better than the GMM+MFCC model on computational cost and recognition performance under clean and noisy speech data evaluations. |
關鍵字 | 主元件分析;特徵辨識;特徵萃取;多頻帶特徵;小波轉換;Principal Component Analysis;Feature Recognition;Fecture Extraction;Multiband Feature;Wavelet Transform |
語言 | en |
收錄於 | |
會議性質 | 國際 |
校內研討會地點 | |
研討會時間 | 20020822~20020824 |
通訊作者 | |
國別 | TWN |
公開徵稿 | Y |
出版型式 | 紙本 |
出處 | 第三屆國際中文口述語言處理研討會暨海峽兩岸口語語音處理論壇論文集,頁165-168 |
相關連結 |
機構典藏連結 ( http://tkuir.lib.tku.edu.tw:8080/dspace/handle/987654321/96043 ) |