dCollection 디지털 학술정보 유통시스템

Adaptive inventory replenishment using structured reinforcement learning by exploiting a policy structure

주제(키워드) Inventory replenishment policy , Reinforcement learning , Stochastic approximation , Structural properties
등재 SCOPUS
발행기관 Elsevier B.V.
발행년도 2023
총서유형 Journal
URI http://www.dcollection.net/handler/ewha/000000211845
본문언어 영어
Published As https://doi.org/10.1016/j.ijpe.2023.109029

초록/요약

We consider an inventory replenishment problem with unknown and non-stationary demand. We design a structured reinforcement learning algorithm that efficiently adapts the replenishment policy to changing demand without any prior knowledge. Our proposed method integrates the known structural properties of a well-performing inventory replenishment policy with reinforcement learning. By exploiting the policy structure, we tune reinforcement learning to characterize the inventory replenishment policy and approximate the value function. In particular, we propose two methods for stochastic approximation on the gradient of the objective function. These novel reinforcement learning algorithms ensure an efficient convergence rate and lower algorithmic complexity for solving practical problems. The numerical results demonstrate that the proposed algorithms adaptively update the policy to changing demand and lower inventory costs compared to various benchmarks. We also conduct a numerical validation for a South Korean retail shop to validate the practical feasibility of the proposed method. Understanding the policy structure is beneficial for designing reinforcement learning algorithms that can address the inventory replenishment problem. These well-designed reinforcement learning algorithms are particularly promising when we require policy updates based on observations without precise knowledge of non-stationary demand. These research findings could be extended to address the various inventory decisions in which policy structures are available. © 2023

반출 Meta View 목록

검색 상세

Adaptive inventory replenishment using structured reinforcement learning by exploiting a policy structure

초록/요약