A hybrid deep reinforcement learning approach for a proactive transshipment of fresh food in the online–offline channel system

Lee, Junhyeok; Shin, Youngchul; Moon, Ilkyeong

Publication Year: 2024-07-01

Publisher: Elsevier Ltd

Citation: Transportation Research Part E: Logistics and Transportation Review, Vol.187

Keyword: Deep reinforcement learning Online–offline channel system Perishable inventory management Proactive transshipment Soft actor–critic

Mesh Keyword: Actor critic Channel systems Deep reinforcement learning Offline channel Online–offline channel system Perishable inventory management Proactive transshipment Reinforcement learning approach Reinforcement learnings Soft actor–critic

All Science Classification Codes (ASJC): Business and International Management Civil and Structural Engineering Transportation

Abstract: To reduce the waste of fresh foods, one of the e-commerce companies in South Korea utilizes lateral transshipment in the network of online platforms and offline shops, which is called the online–offline channel system (OOCS). Even though the OOCS has achieved success in real practice, there is room for further study on this system with regard to deriving a transshipment policy. For this reason, this study aims to develop a solution approach that could derive a promising policy and analyze the impacts of transshipment in the OOCS. The main contributions are summarized as follows. First, we propose a model to deal with the proactive transshipment of perishable products in the OOCS. In particular, this is the first study that introduces the concept of the heterogeneous shelf life considering different properties of online and offline channels. Second, we develop the hybrid deep reinforcement learning (DRL) approach by combining the soft actor–critic algorithm with two novel acceleration methods. The developed method could obtain a promising policy without assumptions about demand distribution and mitigate computational burdens by reducing action spaces. On a set of experiments carried out on real-world demand data, the transshipment policy derived from the hybrid DRL approach could obtain the best profit compared to existing algorithms. Third, we examine the impacts of transshipment by differing types of demand and varying the unit transshipment cost parameter. We find that transshipment substantially reduces the outdating cost by allowing the offline channel to make good use of the old products that will be discarded in the online channel, which is new to the literature.

ISSN: 1366-5545

Language: eng

URI: https://dspace.ajou.ac.kr/dev/handle/2018.oak/34212

DOI: https://doi.org/10.1016/j.tre.2024.103576

Fulltext

Type: Article

Funding: The authors are grateful for the valuable comments from the guest editor and three anonymous reviewers. This work was supported by the National Research Foundation of Korea (NRF) grants funded by the Korea government (MSIT) (No. RS-2023-00218913 and No. RS-2024-00337285).The authors are grateful for the valuable comments from the guest editor and three anonymous reviewers. This work was supported by the National Research Foundation of Korea (NRF) grants funded by the Korea government (MSIT) (No. RS-2023-00218913 and No. NRF-2019R1A2C2084616 ).

Show full item record

qrcode

트윗하기

Related Researcher

Shin, Youngchul 신영철: Department of Industrial Engineering

File Download

There are no files associated with this item.

Related Researcher

Total Views & Downloads

File Download