OAK

AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification

Metadata Downloads
Author(s)
Cho, GeonwooLee, JaemoonIm, JaegyunLee, SubiLee, JihwanKim, Sundong
Type
Conference Paper
Citation
ICLR 2026 (The Fourteenth International Conference on Learning Representations), pp.1 - 37
Issued Date
2026-04-25
Abstract
Skill-based reinforcement learning (SBRL) enables rapid adaptation in environments with sparse rewards by pretraining a skill-conditioned policy. Effective skill learning requires jointly maximizing both exploration and skill diversity. However, existing methods often face challenges in simultaneously optimizing for these two conflicting objectives. In this work, we propose a new method, Adaptive Multi-objective Projection for balancing Exploration and skill Diversification (AMPED), which explicitly addresses both: during pre-training, a gradient-surgery projection balances the exploration and diversity gradients, and during fine-tuning, a skill selector exploits the learned diversity by choosing skills suited to downstream tasks. Our approach achieves performance that surpasses SBRL baselines across various benchmarks. Through an extensive ablation study, we identify the role of each component and demonstrate that each element in AMPED is contributing to performance. We further provide theoretical and empirical evidence that, with a greedy skill selector, greater skill diversity reduces fine-tuning sample complexity. These results highlight the importance of explicitly harmonizing exploration and diversity and demonstrate the effectiveness of AMPED in enabling robust and generalizable skill learning.
Publisher
ICLR
Conference Place
BL
브라질
URI
https://scholar.gist.ac.kr/handle/local/33593
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.