OAK

Feed-Forward Gaussian Splatting from Asymmetric Dual-lens Stereo

Metadata Downloads
Author(s)
Hojoon Lee
Type
Thesis
Degree
Master
Department
정보컴퓨팅대학 AI융합학과
Advisor
Jeon, Hae-Gon
Abstract
Modern devices commonly deploy multiple cameras, such as dual-lens camera systems on smartphones. Because the field-of-views differ between the cameras, image resolutions in overlapping regions can vary. Unfortunately, feed-forward 3D Gaussian Splatting (3DGS) methods are designed only for images with the same resolutions, which limits their performance with dual-lens images. We observe that existing works struggle to preserve high-frequency details and to account for 3D geometry from images with different resolutions and narrow baselines. In this paper, we propose a novel feed-forward 3DGS method that handles dual-lens inputs. To do this, we first consider a dual-lens camera system as a cross-scale input. We then adopt a coarse-to-fine feature encoding strategy to enhance low-resolution image features by leveraging geometric information from a depth foundation model. We improve low-resolution images to provide high-frequency details using a cross-scale enhancement module. Finally, we predict Gaussian parameters through a decoding unit to yield 3D Gaussian primitives. Our model achieves state-of-the-art performance on various datasets.
URI
https://scholar.gist.ac.kr/handle/local/31895
Fulltext
http://gist.dcollection.net/common/orgView/200000901901
Alternative Author(s)
이호준
Appears in Collections:
Department of AI Convergence > 3. Theses(Master)
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.