教師資料查詢 | 類別: 會議論文 | 教師: 謝景棠 Hsieh Ching-tang (瀏覽個人網頁)

標題:Compact Speech Features Based on Wavelet Transform and PCA with Application to Speaker Identification
學年91
學期1
發表日期2002/08/22
作品名稱Compact Speech Features Based on Wavelet Transform and PCA with Application to Speaker Identification
作品名稱(其他語言)
著者Hsieh, Ching-Tang; Lai, Eugene; Chen, Wan-Chen; Wang, You-Chuang
作品所屬單位淡江大學電機工程學系
出版者
會議名稱第三屆國際中文口述語言處理研討會暨海峽兩岸口語語音處理論壇
會議地點臺北縣, 臺灣
摘要The main goal of this paper is to find some effective methods to improve the performance of speaker identification system. In speaker identification, we use wavelet transform to decompose the speech signals into several frequency bands and then use cepstral coefficients to capture the individualities of vocal track within the interested bands based on the acoustic characteristic of human ear. In addition, an adaptive wavelet-based filtering mechanism is applied to eliminate the small variation of wavelet coefficients caused by noise. In order to effectively utilize all these multi-band speech features, we propose a modified vector quantization method called multi-layer eigen-codebook vector quantization (MLECVQ) as the identifier. This model uses the multi-layer concept to eliminate the interference between the multi-band coefficients and then uses the principal component analysis (PCA) method to evaluate the codebooks for capturing more details of phoneme character. Experimental results show that the proposed method is better than the GMM+MFCC model on computational cost and recognition performance under clean and noisy speech data evaluations.
關鍵字主元件分析;特徵辨識;特徵萃取;多頻帶特徵;小波轉換;Principal Component Analysis;Feature Recognition;Fecture Extraction;Multiband Feature;Wavelet Transform
語言英文
收錄於
會議性質國際
校內研討會地點
研討會時間20020822~20020824
通訊作者
國別中華民國
公開徵稿Y
出版型式紙本
出處第三屆國際中文口述語言處理研討會暨海峽兩岸口語語音處理論壇論文集頁165-168
相關連結
Google+ 推薦功能,讓全世界都能看到您的推薦!