OAK

GIST Library Login

Metadata Downloads

Author(s): Lee, Joosoon; Back, Seunghyeok; Kim, Taewon; Shin, Sungho; Noh, Sangjun; Kang, Raeyoung; Kim, Jongwon; Lee, Kyoobin

Citation: 2021 21st International Conference on Control, Automation and Systems (ICCAS), pp.1599 - 1605

Abstract: We present a Synthetic RGB-D Fusion Mask R-CNN (SF Mask R-CNN) for unseen object instance segmentation. Our key idea is to fuse RGB and depth with a learnable spatial attention estimator, named Self-Attention-based Confidence map Estimator (SACE), in four scales upon a category-agnostic instance segmentation model. We pre-trained this SF Mask R-CNN on a large synthetic dataset and evaluated it on a public dataset, WISDOM, after fine-tuning on only a small number of real-world datasets. Our experiments showed the state-of-the-art performance of SACE in unseen object segmentation. Also, we compared the feature maps varying the input modality and fusion method and showed that SACE could be helpful to learn distinctive object-related features. The codes, dataset, and models are available at https://github.com/gist-ailab/SF-Mask-RCNN

Appears in Collections:: Department of AI Convergence > 2. Conference Papers

공개 및 라이선스

qrcode

OAK GIST Repository는 국립중앙도서관 OAK Repository 보급사업으로 구축되었습니다.