期刊論文

學年 102
學期 1
出版(發表)日期 2013-09-01
作品名稱 應用切片逆迴歸法於區間型符號資料之維度縮減
作品名稱(其他語言) The Application of Sliced Inverse Regression for Dimension Reduction of the Interval-Valued Symbolic Data
著者 陳業勛; 吳漢銘
單位 淡江大學數學學系
出版者 臺北市:中國統計學社
著錄名稱、卷期、頁數 中國統計學報=Journal of the Chinese Statistical Association 51(3),頁327-351
摘要 運用切片逆迴歸法(sliced inverse regression, SIR)可以找出有效的維度縮減方向來探索高維度資料的內在結構。針對單一反應變數逆迴歸問題,SIR已發展並應用在各種資料型態上,例如:存活資料、時間序列資料、函數型資料及縱向資料等等。本研究中,我們推展SIR方法到區間型符號資料。首先利用頂點法或中心法將區間資料做轉換,再應用SIR於轉換後的資料上。模擬資料分析結果顯示,不同的切片策略會產生不同的維度縮減方向及呈現不同的低維度視覺化結果,因此找出合適的切片策略有助於正確地分析這類型高維度資料所隱含的結構與資訊。故我們進一步採用以群集為基礎的切片逆迴歸法來分析區間型符號資料,並和符號型主成份分析法相比較,評估它們在低維度空間中區別能力及視覺化的表現。 Sliced inverse regression (SIR) was introduced by Li (1991) to find the effective dimension reduction directions for exploring the intrinsic structure of high-dimensional data. For univariate response regression, SIR has been extended and applied to different data types. Examples were the cases of the survival data, the time series data, the functional data and the longitudinal data. This study intends to develop SIR for the interval-valued symbolic data. Firstly, the interval-valued data was transformed into the conventional data matrix using the vertices method or the centers method. Then the classical SIR algorithm was directly applied to the transformed data. The simulation results shown that using different slicing schemes produced different projection directions and different lower-dimensional visualization. Therefore, a suitable slicing scheme is needed for correctly investigating the embedded structure and information of the high-dimensional interval-valued symbolic data in the lower-dimensional plots. The results motivated us to adopt the clustered-based SIR to improve the implementation of the symbolic SIR. We compared and evaluated the results with those obtained with several existing symbolic dimension reduction techniques (such as the symbolic principal component analysis) for discriminative and visualization purposes.
關鍵字 資料視覺化; 逆迴歸法; 充份維度縮減法; 符號型資料分析法; 符號型主成份分析法; data visualization; nverse regression; sufficient dimension reduction; symbolic data analysis; symbolic principal component analysis
語言 zh_TW
ISSN 0529-6528
期刊性質 國內
收錄於
產學合作
通訊作者 吳漢銘
審稿制度
國別 TWN
公開徵稿
出版型式 紙本
相關連結

機構典藏連結 ( http://tkuir.lib.tku.edu.tw:8080/dspace/handle/987654321/92415 )

機構典藏連結