OAK

MAE-based Hybrid Convolutional ViT for Self-Supervised Learning

Metadata Downloads
Abstract
In this study, we aim to achieve lightweight models by adopting the Convolutional Vision Transformer (CvT) as the backbone and incorporating key techniques of Inpainting, namely update mask strategy and skip connections, to enhance the model's performance. Experiments were conducted on the Tiny-ImageNet-200 and ImageNet-1k datasets. The results of our approach demonstrate the effectiveness of model lightweightization and novel training strategies in improving performance. This provides a new direction for achieving efficient model training even in the context of limited resources.
Author(s)
Nami Seo
Issued Date
2023
Type
Thesis
URI
https://scholar.gist.ac.kr/handle/local/19472
Alternative Author(s)
서나미
Department
대학원 AI대학원
Advisor
Ahn, Chang Wook
Degree
Master
Appears in Collections:
Department of AI Convergence > 3. Theses(Master)
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.