期刊論文
| 學年 | 114 |
|---|---|
| 學期 | 1 |
| 出版(發表)日期 | 2025-08-20 |
| 作品名稱 | Toward Interpretable Multimodal Violence Detection with Knowledge Distillation and Modality-Aligned Preprocessing |
| 作品名稱(其他語言) | |
| 著者 | W. D. Jiang; C. Y. Chang; M. Y. Su; Y. S. Lee; D. S. Roy |
| 單位 | |
| 出版者 | |
| 著錄名稱、卷期、頁數 | IEEE Transactions on Systems, Man and Cybernetics: Systems 55(9), p. 6215-6228 |
| 摘要 | Social violence presents a compelling challenge to public safety, yet existing multimodal detection systems exhibit excessive reliance on RGB image semantics and opaque decision-making processes. Despite leveraging visual and auditory data, current models demonstrate RGB bias in feature prioritization, as evidenced by explainability analyzes, thereby limiting their generalization for behavioral understanding. Additionally, modality inconsistency and inefficient fusion mechanisms impair model transparency and training stability. To bridge these gaps, this study proposes modality-aligned preprocessing (VAJ) that structurally unifies visual-auditory features through conflict resolution and input optimization, explicitly suppressing color dominance while enhancing interpretable feature representations. Complementing this, we design DTVDS, an interpretable detection framework integrating knowledge distillation to transfer distilled behavioral insights from a cumbersome teacher network to an efficient student model. This dual strategy not only addresses computational overhead but also clarifies decision logic through simplified inference pathways. Evaluations on XD-Violence and UCF-Crime benchmarks demonstrate superior performance, with AP (89.64%) and AUC (88.35%) outperforming existing methods. Qualitative evaluations further validate interpretability, revealing modality-coherent attention maps and human-aligned rationale visualization. The proposed method advances violence detection by addressing persistent shortcomings in multimodal alignment and model explainability. |
| 關鍵字 | |
| 語言 | en |
| ISSN | 2168-2232 |
| 期刊性質 | 國外 |
| 收錄於 | SCI |
| 產學合作 | |
| 通訊作者 | |
| 審稿制度 | 否 |
| 國別 | USA |
| 公開徵稿 | |
| 出版型式 | ,電子版 |
| 相關連結 |
機構典藏連結 ( http://tkuir.lib.tku.edu.tw:8080/dspace/handle/987654321/128634 ) |