OAK

Improving small objects detection using transformer

Metadata Downloads
Abstract
General artificial intelligence counteracts the inductive bias of an algorithm and tunes the algorithm for out-of-distribution generalization. A conspicuous impact of the inductive bias is an unceasing trend in improving deep learning performance. Although a quintessential attention-based object detection technique, DETR, shows better accuracy than its predecessors, its accuracy deteriorates for detecting small-sized (in-perspective) objects. This study examines the inductive bias of DETR and proposes a normalized inductive bias for object detection using data fusion, SOF-DETR. A technique of lazy-fusion of features is introduced in SOF-DETR, which sustains deep contextual information of objects present in an image. The features from multiple subsequent deep layers are fused for object queries that learn long and short-distance spatial association in an image using the attention mechanism. Experimental results on the MS COCO and Udacity Self Driving Car datasets assert the effectiveness of the added normalized inductive bias and feature fusion techniques, showing increased COCO mAP scores on small-sized objects.
Author(s)
Dubey, ShikhaOlimov, FarrukhRafique, Muhammad AasimJeon, Moongu
Issued Date
2022-11
Type
Article
DOI
10.1016/j.jvcir.2022.103620
URI
https://scholar.gist.ac.kr/handle/local/10549
Publisher
ACADEMIC PRESS INC ELSEVIER SCIENCE
Citation
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, v.89
ISSN
1047-3203
Appears in Collections:
Department of Electrical Engineering and Computer Science > 1. Journal Articles
공개 및 라이선스
  • 공개 구분공개
파일 목록

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.