Ajou University repository

A hybrid deep reinforcement learning approach for a proactive transshipment of fresh food in the online–offline channel system
Citations

SCOPUS

4

Citation Export

Publication Year
2024-07-01
Publisher
Elsevier Ltd
Citation
Transportation Research Part E: Logistics and Transportation Review, Vol.187
Keyword
Deep reinforcement learningOnline–offline channel systemPerishable inventory managementProactive transshipmentSoft actor–critic
Mesh Keyword
Actor criticChannel systemsDeep reinforcement learningOffline channelOnline–offline channel systemPerishable inventory managementProactive transshipmentReinforcement learning approachReinforcement learningsSoft actor–critic
All Science Classification Codes (ASJC)
Business and International ManagementCivil and Structural EngineeringTransportation
Abstract
To reduce the waste of fresh foods, one of the e-commerce companies in South Korea utilizes lateral transshipment in the network of online platforms and offline shops, which is called the online–offline channel system (OOCS). Even though the OOCS has achieved success in real practice, there is room for further study on this system with regard to deriving a transshipment policy. For this reason, this study aims to develop a solution approach that could derive a promising policy and analyze the impacts of transshipment in the OOCS. The main contributions are summarized as follows. First, we propose a model to deal with the proactive transshipment of perishable products in the OOCS. In particular, this is the first study that introduces the concept of the heterogeneous shelf life considering different properties of online and offline channels. Second, we develop the hybrid deep reinforcement learning (DRL) approach by combining the soft actor–critic algorithm with two novel acceleration methods. The developed method could obtain a promising policy without assumptions about demand distribution and mitigate computational burdens by reducing action spaces. On a set of experiments carried out on real-world demand data, the transshipment policy derived from the hybrid DRL approach could obtain the best profit compared to existing algorithms. Third, we examine the impacts of transshipment by differing types of demand and varying the unit transshipment cost parameter. We find that transshipment substantially reduces the outdating cost by allowing the offline channel to make good use of the old products that will be discarded in the online channel, which is new to the literature.
ISSN
1366-5545
Language
eng
URI
https://dspace.ajou.ac.kr/dev/handle/2018.oak/34212
DOI
https://doi.org/10.1016/j.tre.2024.103576
Fulltext

Type
Article
Funding
The authors are grateful for the valuable comments from the guest editor and three anonymous reviewers. This work was supported by the National Research Foundation of Korea (NRF) grants funded by the Korea government (MSIT) (No. RS-2023-00218913 and No. RS-2024-00337285).The authors are grateful for the valuable comments from the guest editor and three anonymous reviewers. This work was supported by the National Research Foundation of Korea (NRF) grants funded by the Korea government (MSIT) (No. RS-2023-00218913 and No. NRF-2019R1A2C2084616 ).
Show full item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Shin, Youngchul  Image
Shin, Youngchul 신영철
Department of Industrial Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.