Multi-Objective Hierarchical Reinforcement Learning with Online Meta-Learning for Dynamic Pricing Strategy Optimization

Abstract

In this study, we propose a Multi-Objective Hierarchical Reinforcement Learning (MOHRL) approach with online meta-learning for dynamic pricing strategy optimization. Our method utilizes hierarchical RL layers to decompose the pricing decision-making process and a meta-learning adapter to accelerate the cold start. We compare MOHRL with baseline methods like DQN and NSGA-II. Experimental results show that MOHRL outperforms DQN by 25% in profit and 18% in retention rate, and NSGA-II by 30% in market share over a 30-day simulation. The simulation system built based on 100,000+ SKU data of an e-commerce platform demonstrates MOHRL's superiority in real-time dynamic pricing, especially in cold start scenarios. Ablation experiments confirm that the meta-learning.

Author Biography

Guangzeng Zhang, Anyang Vocational And technical College

Guangzeng Zhang was born in Anyang, HeNan,P.R. China, in 1978. He received the Master degree from JiangSu University, P.R. China. Now, he works in Anyang Vocational And technical College, His research interests include marketing and rural economy.

Authors

  • Guangzeng Zhang Anyang Vocational And technical College

DOI:

https://doi.org/10.31449/inf.v49i26.8595

Downloads

Published

07/03/2025

How to Cite

Zhang, G. (2025). Multi-Objective Hierarchical Reinforcement Learning with Online Meta-Learning for Dynamic Pricing Strategy Optimization. Informatica, 49(26). https://doi.org/10.31449/inf.v49i26.8595