このエントリーをはてなブックマークに追加


ID 68433
FullText URL
Author
Nomura, Yusuke Graduate School of Environmental, Life, Natural Science and Technology, Okayama University
Liu, Ziang Graduate School of Environmental, Life, Natural Science and Technology, Okayama University
Nishi, Tatsushi Graduate School of Environmental, Life, Natural Science and Technology, Okayama University ORCID Kaken ID researchmap
Abstract
Perishable goods have a limited shelf life, and inventory should be discarded once it exceeds its shelf life. Finding optimal inventory management policies is essential since inefficient policies can lead to increased waste and higher costs. While many previous studies assume the perishable inventory is processed following the First In, First Out rule, it does not reflect customer purchasing behavior. In practice, customers' preferences are influenced by the shelf life and price of products. This study optimizes inventory and pricing policies for a perishable inventory management problem considering age-dependent probabilistic demand. However, introducing dynamic pricing significantly increases the complexity of the problem. To tackle this challenge, we propose eliminating irrational actions in dynamic programming without sacrificing optimality. To solve this problem more efficiently, we also implement a deep reinforcement learning algorithm, proximal policy optimization, to solve this problem. The results show that dynamic programming with action reduction achieved an average of 63.1% reduction in computation time compared to vanilla dynamic programming. In most cases, proximal policy optimization achieved an optimality gap of less than 10%. Sensitivity analysis of the demand model revealed a negative correlation between customer sensitivity to shelf lives or prices and total profits.
Keywords
reinforcement learning
supply chain
inventory management
perishable inventory
dynamic pricing
Published Date
2025-02-24
Publication Title
Applied Sciences
Volume
volume15
Issue
issue5
Publisher
MDPI
Start Page
2421
ISSN
2076-3417
Content Type
Journal Article
language
English
OAI-PMH Set
岡山大学
Copyright Holders
© 2025 by the authors.
File Version
publisher
DOI
Web of Science KeyUT
Related Url
isVersionOf https://doi.org/10.3390/app15052421
License
https://creativecommons.org/licenses/by/4.0/
Citation
Nomura, Y.; Liu, Z.; Nishi, T. Deep Reinforcement Learning for Dynamic Pricing and Ordering Policies in Perishable Inventory Management. Appl. Sci. 2025, 15, 2421. https://doi.org/10.3390/app15052421
Funder Name
Japan Society for the Promotion of Science
助成番号
JP22H01714
JP23K13514