A Study on Self-Supervised Object Detection Pretraining

Dang, Trung; Kornblith, Simon; Nguyen, Huy Thong; Chin, Peter; Khademi, Maryam

Computer Science > Computer Vision and Pattern Recognition

arXiv:2207.04186 (cs)

[Submitted on 9 Jul 2022 (v1), last revised 10 Aug 2022 (this version, v2)]

Title:A Study on Self-Supervised Object Detection Pretraining

Authors:Trung Dang, Simon Kornblith, Huy Thong Nguyen, Peter Chin, Maryam Khademi

View PDF

Abstract:In this work, we study different approaches to self-supervised pretraining of object detection models. We first design a general framework to learn a spatially consistent dense representation from an image, by randomly sampling and projecting boxes to each augmented view and maximizing the similarity between corresponding box features. We study existing design choices in the literature, such as box generation, feature extraction strategies, and using multiple views inspired by its success on instance-level image representation learning techniques. Our results suggest that the method is robust to different choices of hyperparameters, and using multiple views is not as effective as shown for instance-level image representation learning. We also design two auxiliary tasks to predict boxes in one view from their features in the other view, by (1) predicting boxes from the sampled set by using a contrastive loss, and (2) predicting box coordinates using a transformer, which potentially benefits downstream object detection tasks. We found that these tasks do not lead to better object detection performance when finetuning the pretrained model on labeled data.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2207.04186 [cs.CV]
	(or arXiv:2207.04186v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2207.04186

Submission history

From: Trung Dang [view email]
[v1] Sat, 9 Jul 2022 03:30:44 UTC (1,172 KB)
[v2] Wed, 10 Aug 2022 20:11:15 UTC (1,172 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:A Study on Self-Supervised Object Detection Pretraining

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:A Study on Self-Supervised Object Detection Pretraining

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators