Deep Reinforcement Learning for Dynamic Pricing and Ordering Policies in Perishable Inventory Management

Nomura, Yusuke; Liu, Ziang; Nishi, Tatsushi

doi:10.3390/app15052421

Permalink : https://ousar.lib.okayama-u.ac.jp/68433

ID	68433
FullText URL	fulltext.pdf 786 KB
Author	Nomura, Yusuke Graduate School of Environmental, Life, Natural Science and Technology, Okayama University Liu, Ziang Graduate School of Environmental, Life, Natural Science and Technology, Okayama University Nishi, Tatsushi Graduate School of Environmental, Life, Natural Science and Technology, Okayama University ORCID Kaken ID researchmap
Abstract	Perishable goods have a limited shelf life, and inventory should be discarded once it exceeds its shelf life. Finding optimal inventory management policies is essential since inefficient policies can lead to increased waste and higher costs. While many previous studies assume the perishable inventory is processed following the First In, First Out rule, it does not reflect customer purchasing behavior. In practice, customers' preferences are influenced by the shelf life and price of products. This study optimizes inventory and pricing policies for a perishable inventory management problem considering age-dependent probabilistic demand. However, introducing dynamic pricing significantly increases the complexity of the problem. To tackle this challenge, we propose eliminating irrational actions in dynamic programming without sacrificing optimality. To solve this problem more efficiently, we also implement a deep reinforcement learning algorithm, proximal policy optimization, to solve this problem. The results show that dynamic programming with action reduction achieved an average of 63.1% reduction in computation time compared to vanilla dynamic programming. In most cases, proximal policy optimization achieved an optimality gap of less than 10%. Sensitivity analysis of the demand model revealed a negative correlation between customer sensitivity to shelf lives or prices and total profits.
Keywords	reinforcement learning supply chain inventory management perishable inventory dynamic pricing
Published Date	2025-02-24
Publication Title	Applied Sciences
Volume	volume15
Issue	issue5
Publisher	MDPI
Start Page	2421
ISSN	2076-3417
Content Type	Journal Article
language	English
OAI-PMH Set	岡山大学
Copyright Holders	© 2025 by the authors.
File Version	publisher
DOI	10.3390/app15052421
Web of Science KeyUT	001442368900001
Related Url	isVersionOf https://doi.org/10.3390/app15052421
License	https://creativecommons.org/licenses/by/4.0/
Citation	Nomura, Y.; Liu, Z.; Nishi, T. Deep Reinforcement Learning for Dynamic Pricing and Ordering Policies in Perishable Inventory Management. Appl. Sci. 2025, 15, 2421. https://doi.org/10.3390/app15052421
Funder Name	Japan Society for the Promotion of Science
助成番号	JP22H01714 JP23K13514