Compact Speech Features Based on Wavelet Transform and PCA with Application to Speaker Identification
學年 91
學期 1
發表日期 2002-08-22
作品名稱 Compact Speech Features Based on Wavelet Transform and PCA with Application to Speaker Identification
作品名稱(其他語言)
著者 Hsieh, Ching-Tang; Lai, Eugene; Chen, Wan-Chen; Wang, You-Chuang
作品所屬單位 淡江大學電機工程學系
出版者
會議名稱 第三屆國際中文口述語言處理研討會暨海峽兩岸口語語音處理論壇
會議地點 臺北縣, 臺灣
摘要 The main goal of this paper is to find some effective methods to improve the performance of speaker identification system. In speaker identification, we use wavelet transform to decompose the speech signals into several frequency bands and then use cepstral coefficients to capture the individualities of vocal track within the interested bands based on the acoustic characteristic of human ear. In addition, an adaptive wavelet-based filtering mechanism is applied to eliminate the small variation of wavelet coefficients caused by noise. In order to effectively utilize all these multi-band speech features, we propose a modified vector quantization method called multi-layer eigen-codebook vector quantization (MLECVQ) as the identifier. This model uses the multi-layer concept to eliminate the interference between the multi-band coefficients and then uses the principal component analysis (PCA) method to evaluate the codebooks for capturing more details of phoneme character. Experimental results show that the proposed method is better than the GMM+MFCC model on computational cost and recognition performance under clean and noisy speech data evaluations.
關鍵字 主元件分析;特徵辨識;特徵萃取;多頻帶特徵;小波轉換;Principal Component Analysis;Feature Recognition;Fecture Extraction;Multiband Feature;Wavelet Transform
語言 en
收錄於
會議性質 國際
校內研討會地點
研討會時間 20020822~20020824
通訊作者
國別 TWN
公開徵稿 Y
出版型式 紙本
出處 第三屆國際中文口述語言處理研討會暨海峽兩岸口語語音處理論壇論文集,頁165-168
相關連結

機構典藏連結 ( http://tkuir.lib.tku.edu.tw:8080/dspace/handle/987654321/96043 )

機構典藏連結