期刊論文

翁慶昌 WONG CHING-CHANG

個人網頁

Visual object recognition and pose estimation based on a deep semantic segmentation network
學年	107
學期	1
出版（發表）日期	2018-11-15
作品名稱	Visual object recognition and pose estimation based on a deep semantic segmentation network
作品名稱（其他語言）
著者	Chien-Ming Lin; Chi-Yi Tsai; Yu-Cheng Lai; Shin-An Li; Ching-Chang Wong
單位
出版者
著錄名稱、卷期、頁數	IEEE Sensors Journal 18(22), p.1-11
摘要	In recent years, deep learning-based object recognition algorithms become emerging in robotic vision applications. This paper addresses the design of a novel deep learning-based visual object recognition and pose estimation system for a robot manipulator to handle random object picking tasks. The proposed visual control system consists of a visual perception module, an object pose estimation module, a data argumentation module, and a robot manipulator controller. The visual perception module combines deep convolution neural networks (CNNs) and a fully connected conditional random field layer to realize an image semantic segmentation function, which can provide stable and accurate object classification results in cluttered environments. The object pose estimation module implements a model-based pose estimation method to estimate the 3D pose of the target for picking control. In addition, the proposed data argumentation module automatically generates training data for training the deep CNN. Experimental results show that the proposed scene segmentation method used in the data argumentation module reaches a high accuracy rate of 97.10% on average, which is higher than other state-of-the-art segment methods. Moreover, with the proposed data argumentation module, the visual perception module reaches an accuracy rate over than 80% and 72% in the case of detecting and recognizing one object and three objects, respectively. In addition, the proposed model-based pose estimation method provides accurate 3D pose estimation results. The average translation and rotation errors in the three axes are all smaller than 0.52 cm and 3.95 degrees, respectively. These advantages make the proposed visual control system suitable for applications of random object picking and manipulation.
關鍵字	Pose estimation;Three-dimensional displays;Robots;Visual perception;Image segmentation;Object recognition;Semantics
語言	en_US
ISSN	1530-437X
期刊性質	國內
收錄於
產學合作
通訊作者
審稿制度	否
國別	TWN
公開徵稿
出版型式	,電子版
相關連結	機構典藏連結 ( http://tkuir.lib.tku.edu.tw:8080/dspace/handle/987654321/115395 )

Tamkang University Teacher ePortfolio System - All Rights Reserved © by OIS, TKU

系統維護: 資訊處曾江安
聯絡電話: (02)26215656 分機 3484

GoogleAnalytics

教師歷程問與答:
Q: 開放給何種身份使用?
A: 目前開放給淡江大學教師(含專、兼任)使用。
Q: 資料不完整(正確)?
A: 教師歷程系統介接自七大系統，並包含某些「CSV匯入」；分別有不同的資料交換方式與頻率。。
Q: 我無法登入?
A: 請確認您的單一登入帳號密碼無誤。若需進一步協助，請洽詢資訊處數位設計組分機:2897

適用以下瀏覽器: Microsoft IE8 / IE9 Mozilla Firefox Google Chrome Apple Safari

期刊論文 翁慶昌 WONG CHING-CHANG 個人網頁

Tamkang University Teacher ePortfolio System - All Rights Reserved © by OIS, TKU

期刊論文

翁慶昌 WONG CHING-CHANG

個人網頁