OAK

A Deep Reinforcement Learning-Based Scheme for Solving Multiple Knapsack Problems

Metadata Downloads
Abstract
A knapsack problem is to select a set of items that maximizes the total profit of selected items while keeping the total weight of the selected items no less than the capacity of the knapsack. As a generalized form with multiple knapsacks, the multi-knapsack problem (MKP) is to select a disjointed set of items for each knapsack. To solve MKP, we propose a deep reinforcement learning (DRL) based approach, which takes as input the available capacities of knapsacks, total profits and weights of selected items, and normalized profits and weights of unselected items and determines the next item to be mapped to the knapsack with the largest available capacity. To expedite the learning process, we adopt the Asynchronous Advantage Actor-Critic (A3C) for the policy model. The experimental results indicate that the proposed method outperforms the random and greedy methods and achieves comparable performance to an optimal policy in terms of the profit ratio of the selected items to the total profit sum, particularly when the profits and weights of items have a non-linear relationship such as quadratic forms.
Author(s)
Sur, GiwonRyu, Shun YuelKim, Jong WonLim, Hyuk
Issued Date
2022-03
Type
Article
DOI
10.3390/app12063068
URI
https://scholar.gist.ac.kr/handle/local/10918
Publisher
MDPI
Citation
APPLIED SCIENCES-BASEL, v.12, no.6
Appears in Collections:
Department of AI Convergence > 1. Journal Articles
공개 및 라이선스
  • 공개 구분공개
파일 목록

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.