AI-Based Multimodal Anomaly Detection for Industrial Machine Operations
學年 113
學期 2
出版(發表)日期 2025-03-15
作品名稱 AI-Based Multimodal Anomaly Detection for Industrial Machine Operations
作品名稱(其他語言)
著者 Qiaoyun Zhang,Hsiang-Chuan Chang,Chia-Ling Ho,Huan-Chao Keh,Diptendu Sinha Roy,
單位
出版者
著錄名稱、卷期、頁數 vol. 26, no 2, pp. 255-264
摘要 In the manufacturing process involving grinding wheels, challenges in fine-tuning grinding machines are typically addressed by craftsmen through subjective observations of sparks and sounds. However, most current anomaly detection methods mainly aim at a single modality, whereas existing multimodal methods cannot effectively cope with a common issue. To address this, this paper introduces an innovative mechanism, AI-Based Multimodal Anomaly Detection (AMAD), designed to optimize the efficiency and accuracy of grinding wheel production lines. The proposed AMAD includes data preprocessing and multimodal anomaly detection, accurately identifying anomalies in grinding wheel operation videos. In the data preprocessing phase, the proposed AMAD utilizes Mel Frequency Cepstral Coefficients (MFCC) and AutoEncoder for audio processing and segmentation for video processing. In the multimodal anomaly detection phase, the proposed AMAD employs Convolutional Neural Networks (CNN) for audio analysis and Convolutional Long Short-Term Memory (ConvLSTM) for video analysis. By combining both audio and video modalities, the proposed AMAD effectively predicts whether the input video represents normal or abnormal grinding wheel operations. This multimodal approach not only improves the accuracy of anomaly detection but also enhances the robustness of the system. Simulation results demonstrate that the proposed AMAD significantly improves performance in anomaly detection in terms of precision, recall, and F1-Score.
關鍵字 MFCC, ConvLSTM, CNN, Anomaly Detection
語言 en
ISSN 1607-9264
期刊性質 國內
收錄於
產學合作
通訊作者
審稿制度
國別 TWN
公開徵稿
出版型式 ,電子版