期刊論文

學年 113
學期 2
出版(發表)日期 2025-04-25
作品名稱 MMDL: A Multi-Modal Deep Learning for Video Highlight Detection in Sports
作品名稱(其他語言)
著者 Q. Zhang; C. Y. Chang; S. J. Wu; H. C. Chang; D. S. Roy
單位
出版者
著錄名稱、卷期、頁數 International Journal of Multimedia Information Retrieval 14(18)
摘要 With the growing interest in sports events, the ability to capture highlights has become increasingly important. Traditionally, the process of editing these highlights required significant time and manpower. To address this challenge, this paper introduces an innovative multi-modal deep learning method for highlight detection (MMDL). The proposed MMDL integrates information from multiple modalities, including subtitles, static skeletal features, and video content, to gain a deep understanding of specific behaviors and identify sub-videos containing those highlights. Additionally, the proposed MMDL employed Siamese networks to accurately capture different aspects of behavior by comparing the similarity between input and training videos across different modalities. Experiments conducted on two datasets, MLB-YouTube and ELTA, demonstrate that the proposed MMDL significantly outperforms existing models, achieving at least a 5% improvement in F1-Score compared to the baseline models, such as I3D and NPL.
關鍵字
語言 en
ISSN 2192-662X
期刊性質 國外
收錄於 SCI
產學合作
通訊作者
審稿制度
國別 USA
公開徵稿
出版型式 ,電子版
相關連結

機構典藏連結 ( http://tkuir.lib.tku.edu.tw:8080/dspace/handle/987654321/128632 )