OAK

GIST Library Login

GIST Scholar College of Information and Computing Department of AI Convergence 3. Theses(Master)

Exploring Planning Capability of Large Language Model Using Abstraction and Reasoning Corpus Benchmark Woochang Sim Gwangju Institute of Science and Technology

Metadata Downloads

Author(s): 심우창

Type: Thesis

Degree: Master

Department: 대학원 AI대학원

Advisor: Kim, Sungdong

Abstract: 최근 대규모 언어 모델은 거대한 파라미터에 많은 데이터와 GPU 자원을 기반으로 학습하여 다양한 벤치마크에서 좋은 성능을 발휘하고 있다. 이러한 대규모 언어 모델이 더 발전하기 위해서는 처음 직면한 문제에 대해서 패턴을 파악하여 문제 해결을 위한 목표를 설정하고 이를 해결할 전략을 수립할 줄 알아야 한다. 즉, 대규모 언어 모델이 한 걸음 더 나아가기 위해서는 계획을 세우는 능력이 필요하다. 본 논문에서는 계획 능력을 목표 설정과 전략 수립 등의 2가지 하위 능력의 집합으로 정의하였다. 본 논문에서는 크게 2가지 종류의 실험을 진행하였다. 하나의 실험은 대규모 언어 모델의 계획 능력을 확인하는 실험이다. 즉, 대규모 언어 모델의 목표 설정 능력과 전략 수립 능력을 평가하 는 실험이다. 다른 하나는 계획 능력을 평가하기 최적의 실험을 할 수 있도록 프롬프트 셋팅을 찾는 실험을 진행하였다. 실험 결과를 통해 현재 대규모 언어 모델의 계획 능력 은 다른 벤치마크에 비해 부족하다는 것을 알 수 있었다. Feedback loop나 메타 개념의 계층화 등의 방법을 사용한다면 계획 능력 더 향상될 것으로 보인다.|Large language models show strong performance on benchmarks, trained on vast data with enormous parameters using substantial GPU resources. For these models to advance further, they must be able to identify patterns, set objectives when en- countering problems, and establish solving strategies. In short, large language models need planning abilities for advancement. In this paper, planning ability is defined as a set of two sub-capabilities: objective setting and strategy formulation. The paper conducted two main experiments. One verified the planning ability of large language models by evaluating their objective setting and strategy formulation. The other fo- cused on finding optimal prompt settings to evaluate planning abilities. Results showed that current LLMs’ planning abilities are insufficient compared to other benchmarks. Planning abilities could improve through feedback loops and hierarchical meta-concept organization.

URI: https://scholar.gist.ac.kr/handle/local/19277

Fulltext: http://gist.dcollection.net/common/orgView/200000853120

Alternative Author(s): Woochang Sim

Appears in Collections:: Department of AI Convergence > 3. Theses(Master)

메타데이터 간략히 보기메타데이터 전체 보기

공개 및 라이선스

공개 구분공개

qrcode

트윗하기

OAK GIST Scholar는 국립중앙도서관 OAK Repository 보급사업으로 구축되었습니다.