OAK

D2S: 강화학습을 이용한 매니퓰레이터 제어를 위한 보상 빈도 기반의 커리큘럼 학습 개발

Metadata Downloads
Author(s)
이상범김태원백승혁Lee, Kyoobin
Type
Conference Paper
Citation
제 36회 제어로봇시스템학회(ICROS2021), pp.550 - 551
Issued Date
2021-06-24
Abstract
In this paper, we present D2S(Dense-to-Sparse) curriculum learning to improve learning efficiency when using reinforcement learning to perform robotic manipulation tasks. We use D2S curriculum learning to perform robotic manipulation tasks on PyRep based RLBench. Firstly, we compared the learning results with using sparse reward alone and with using both sparse reward and distance-based dense rewards too. Next, we set up five phases with different frequencies of the reward and designed three experiments by combining the phases so that the robot can learn the task with gradually increasing difficulty. D2S curriculum learning allowed us to succeed the Reaching task, which was not well-trained when using a sparse reward. However, dividing the phases into three or more does not improve learning efficiency. In the future, we plan to optimize the condition where phases of the D2S curriculum learning ends and use it to perform more complex robotic manipulation tasks.
Publisher
한국제어로봇시스템학회
Conference Place
KO
여수 소노캄
URI
https://scholar.gist.ac.kr/handle/local/22064
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.