OAK

MAE-based Hybrid Convolutional ViT for Self-Supervised Learning

Metadata Downloads
Author(s)
Nami Seo
Type
Thesis
Degree
Master
Department
대학원 AI대학원
Advisor
Ahn, Chang Wook
Abstract
In this study, we aim to achieve lightweight models by adopting the Convolutional Vision Transformer (CvT) as the backbone and incorporating key techniques of Inpainting, namely update mask strategy and skip connections, to enhance the model's performance. Experiments were conducted on the Tiny-ImageNet-200 and ImageNet-1k datasets. The results of our approach demonstrate the effectiveness of model lightweightization and novel training strategies in improving performance. This provides a new direction for achieving efficient model training even in the context of limited resources.
URI
https://scholar.gist.ac.kr/handle/local/19472
Fulltext
http://gist.dcollection.net/common/orgView/200000883875
Alternative Author(s)
서나미
Appears in Collections:
Department of AI Convergence > 3. Theses(Master)
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.