このエントリーをはてなブックマークに追加


ID 68433
フルテキストURL
著者
Nomura, Yusuke Graduate School of Environmental, Life, Natural Science and Technology, Okayama University
Liu, Ziang Graduate School of Environmental, Life, Natural Science and Technology, Okayama University
Nishi, Tatsushi Graduate School of Environmental, Life, Natural Science and Technology, Okayama University ORCID Kaken ID researchmap
抄録
Perishable goods have a limited shelf life, and inventory should be discarded once it exceeds its shelf life. Finding optimal inventory management policies is essential since inefficient policies can lead to increased waste and higher costs. While many previous studies assume the perishable inventory is processed following the First In, First Out rule, it does not reflect customer purchasing behavior. In practice, customers' preferences are influenced by the shelf life and price of products. This study optimizes inventory and pricing policies for a perishable inventory management problem considering age-dependent probabilistic demand. However, introducing dynamic pricing significantly increases the complexity of the problem. To tackle this challenge, we propose eliminating irrational actions in dynamic programming without sacrificing optimality. To solve this problem more efficiently, we also implement a deep reinforcement learning algorithm, proximal policy optimization, to solve this problem. The results show that dynamic programming with action reduction achieved an average of 63.1% reduction in computation time compared to vanilla dynamic programming. In most cases, proximal policy optimization achieved an optimality gap of less than 10%. Sensitivity analysis of the demand model revealed a negative correlation between customer sensitivity to shelf lives or prices and total profits.
キーワード
reinforcement learning
supply chain
inventory management
perishable inventory
dynamic pricing
発行日
2025-02-24
出版物タイトル
Applied Sciences
15巻
5号
出版者
MDPI
開始ページ
2421
ISSN
2076-3417
資料タイプ
学術雑誌論文
言語
英語
OAI-PMH Set
岡山大学
著作権者
© 2025 by the authors.
論文のバージョン
publisher
DOI
Web of Science KeyUT
関連URL
isVersionOf https://doi.org/10.3390/app15052421
ライセンス
https://creativecommons.org/licenses/by/4.0/
Citation
Nomura, Y.; Liu, Z.; Nishi, T. Deep Reinforcement Learning for Dynamic Pricing and Ordering Policies in Perishable Inventory Management. Appl. Sci. 2025, 15, 2421. https://doi.org/10.3390/app15052421
助成機関名
Japan Society for the Promotion of Science
助成番号
JP22H01714
JP23K13514