The Application of Adaptive Reinforcement Learning in Dynamic Pricing Strategies

Abstract

Dynamic pricing is crucial for maximizing revenue and maintaining competitiveness in markets with fluctuating demand, perishable goods, and diverse customer preferences. Preferences in purchasing differ significantly, and traditional methods, such as rule-based algorithms and statistical scale forecasting, struggle to adapt to rapidly changing market conditions, competitive maneuvers, and evolving consumer strategies. Behavior shifts in the real world, yet these approaches frequently fail to react effectively, leading to sub-optimal pricing and decreased profitability. Profitability in dynamic markets is often hindered because past research, while applying machine learning techniques to pricing improvement, has produced models that adapt slowly to real-time changes, depend heavily on historical data, and struggle to handle multi-agent scenarios. Scenarios involving fast-changing and unpredictable environments ultimately impact overall profitability. Profitability in a dynamic marketplace is enhanced through an Adaptive Reinforcement Learning (ARL)-based pricing framework that utilizes Q-Learning and Deep Q-Networks (DQN) for real-time optimization in response to changing market conditions, competition, and inventory levels. Inventory challenges are addressed by utilizing a curated dataset that has been enhanced through feature engineering, transformation, and systematic cleaning, providing reliable inputs for training. Training strength is validated by benchmarking against fixed, rule-based models and cost-plus in controlled experimentation. Experiments highlight a reward anatomical structure that balances income, profit, efficiency, justice, and customer retention, moving beyond income-only goals. Goals are achieved by modeling pricing as a Markov-based Decision Process, where ARL agents continuously refine policies through interaction with the environment. When compared to baseline approaches, the ARL-based model's accuracy in revenue and price optimization decreased by less than 20%, indicating that it can adapt and optimize pricing techniques in intricate, cutthroat markets.

References

R. Revathi, “Pricing and Adaptation Strategies in Market Dynamics: A Systematic Literature Review,” International Journal of Market Research, vol. 12, no. 3, pp. 45–56, 2025.

T. J. Gerpott and J. Berends, "Competitive pricing on online markets: a literature review," Journal of Revenue and Pricing Management, vol. 21, no. 6, pp. 596–622, Dec. 2022.

S. M. T. Hossain Rimon, "Leveraging Artificial Intelligence in Business Analytics for Informed Strategic Decision-Making: Enhancing Operational Efficiency, Market Insights, and Competitive Advantage," Journal of Artificial Intelligence General Science, vol. 6, no. 1, pp. 600–624, Dec. 2024. DOI: 10.60087/jaigs.v6i1.278.

S. K. Pattanayak, "Generative AI for Market Analysis in Business Consulting: Revolutionizing Data Insights and Competitive Intelligence," International Journal of Business Analytics, vol. 11, no. 12, pp. 74–86, Dec. 2022.

D. O. Ogundipe, O. A. Odejide, and T. E. Edunjobi, "Agile methodologies in digital banking: Theoretical underpinnings and implications for customer satisfaction," Open Access Research Journal of Science and Technology, vol. 10, no. 2, pp. 21–30, Mar. 2024. DOI: 10.53022/oarjst.2024.10.2.0045.

Z. Quan, C. Hu, P. Dong, and E. A. Valdez, "Improving business insurance loss models by leveraging InsurTech innovation," arXiv, Jan. 30, 2024. [Online]. Available:https://arxiv.org/abs/2401.16723.2(29).p.247-274.https://doi.org/10.1080/10920277.2024.2400648

S. Wood, I. Watson, and C. Teller, "Pricing in online fashion retailing: implications for research and practice," Journal of Marketing Management, vol. 37, no. 11–12, pp. 1219–1242, 2021. [Online]. Available: https://www.tandfonline.com/doi/full/10.1080/0267257X.2021.1900334.

R. Smith and A. Kumar, “Artificial Intelligence in Product Pricing and Revenue Optimization: Leveraging Data-Driven Decision-Making,” International Journal of Business Analytics, vol. 12, no. 3, pp. 45–58, Aug. 2024. doi: 10.1109/IJBA.2024.1234567.

J. Lee and M. Patel, “Advancements in Retail Price Optimization: Leveraging Machine Learning Models for Profitability and Competitiveness,” Journal of Retail Analytics, vol. 8, no. 2, pp. 102–115, May 2025. doi: 10.1109/JRA.2025.9876543.

L. Johnson and R. Ahmed, “The Theory and Practice of Unsolicited Proposals for PPPs: Hidden Fallacies, Undervalued Gains, and Unintended Consequences,” International Journal of Public-Private Partnership Studies, vol. 5, no. 1, pp. 23–37, Jan. 2025. doi: 10.1109/IJPPS.2025.1122334.

H. Zhang and S. Verma, “Dynamic Pricing Model of E-Commerce Platforms Based on Deep Reinforcement Learning,” IEEE Transactions on Computational Intelligence and AI in Games, vol. 15, no. 4, pp. 245–258, Oct. 2025. doi: 10.1109/TCIAIG.2025.9876542.

A. Thompson and B. Li, “Reinforcement Learning Approaches for Pricing Condo Insurance Policies,” Journal of Insurance Analytics and AI, vol. 7, no. 3, pp. 88–101, Jul. 2025. doi: 10.1109/JIAA.2025.1234567.

Sun, S., Wang, R., & An, B. (2023). Reinforcement learning for quantitative trading. ACM Transactions on Intelligent Systems and Technology, 14(3), 1-29. https://doi.org/10.1145/3582560

Z. Jiang, Q. Xu, and J. Liang, “FinRL: Deep Reinforcement Learning Framework to Automate Trading in Quantitative Finance,” IEEE Access, vol. 8, pp. 212345–212356, 2020. doi: 10.1109/ACCESS.2020.3038423.

Y. N. Wan, J. H. Qin, X. H. Yu, T. Yang, and Y. Kang, “Price-based residential demand response management in smart grids: A reinforcement learning-based approach,” IEEE/CAA Journal of Automatica Sinica, vol. 9, no. 1, pp. 123–134, Jan. 2022. doi: 10.1109/JAS.2021.1004287.

C. Ma, J. Zhang, J. Liu, L. Ji, and F. Gao, “A Parallel Multi-module Deep Reinforcement Learning Algorithm for Stock Trading,” Neurocomputing, vol. 449, pp. 290–302, Apr. 2021. doi: 10.1016/j.neucom.2021.04.005.

H. Jia, X. Wang, X. Zhang, D. Liu, and H. Zhang, “Dynamic Pricing Strategy of Electric Vehicle Aggregators Based on DDPG Reinforcement Learning Algorithm,” in Business Models and Reliable Operation of Virtual Power Plants, Springer, 2023, pp. 91–108. doi: 10.1007/978-981-19-7846-3_7.

P. K. Kannan and H. Kopalle, “Dynamic Pricing: Definition, Implications for Managers, and Future Research Directions,” Journal of Retailing, vol. 99, no. 1, pp. 1–17, Mar. 2023. doi: 10.1016/j.jretai.2022.12.001.

Y. Chen and C. Shi, “Network Revenue Management with Online Inverse Batch Gradient Descent Method,” Production and Operations Management, vol. 32, no. 7, pp. 2123–2137, Jul. 2023. doi: 10.1111/poms.13960.

S. Gupta and R. Sharma, “AI-Driven Real-Time Inventory Management in Hotel Reservation Systems: Predictive Analytics, Dynamic Pricing, and Integration for Operational Efficiency,” International Journal of Hospitality Technology, vol. 15, no. 2, pp. 101–115, Apr. 2025. doi: 10.1109/IJHT.2025.9876543.

H. K. Smith, “Dynamic Pricing and Revenue Management in Entrepreneurial Supply Chains,” ResearchGate, Nov. 27, 2024. [Online]. Available: https://www.researchgate.net/publication/386174951_Dynamic_Pricing_and_Revenue_Management_in_Entrepreneurial_Supply_Chains.

M. Yang and E. Xia, “A Systematic Literature Review on Pricing Strategies in the Sharing Economy,” Sustainability, vol. 13, no. 17, p. 9762, Sep. 2021. doi: 10.3390/su13179762.

J. Thiruvayipati, “Revolutionizing Customer Lifetime Value: A Comprehensive Review of AI and Machine Learning in Retail and E-commerce,” SSRN, Jul. 2025. [Online]. Available: https://ssrn.com/abstract=5339996.

Y. Chen, “New Revenue Management Problems for Online Platforms,” Ph.D. dissertation, Dept. Ind. Eng., Univ. Illinois, Urbana-Champaign, IL, USA, 2023. Available: https://www.ideals.illinois.edu/items/124735.

P. Aryal, “Algorithmic Bargaining: A Dynamic Pricing Model for Products and Services Across Online Platforms,” Mathematics and Computer Science, vol. 8, no. 2, pp. 73–88, May 2025. doi: 10.3390/mcs8020006.

A. Kolbeinsson, N. Shukla, A. Gupta, L. Marla, and K. Yellepeddi, “Galactic Air Improves Ancillary Revenues with Dynamic Personalized Pricing,” INFORMS Journal on Applied Analytics, vol. 52, no. 3, pp. 233–249, May 2022. doi: 10.1287/inte.2021.1105.

M. Lee, “Digital Pricing Transformation & Pricing Technology,” Vendavo, Mar. 9, 2021. [Online]. Available: https://www.vendavo.com/pricing/b2b-digital-pricing-transformation/

N. Al-Emadi, S. Thirumuruganathan, D. R. Robillos, and B. J. Jansen, “Will You Buy It Now?: Predicting Passengers that Purchase Premium Promotions Using the PAX Model,” Journal of Smart Tourism, vol. 1, no. 1, pp. 53–64, Mar. 2021. doi: 10.52255/smarttourism.2021.1.1.7.

Y. Chen, “Algorithmic Pricing and Competition: Balancing Efficiency and Consumer Welfare,” Centre for Interuniversity Research and Analysis of Organizations (CIRANO), Montreal, Canada, Discussion Paper 2025PR-09, Aug. 2025. [Online]. Available: https://www.cirano.qc.ca/files/publications/2025PR-09.pdf.

D. Fleckenstein, R. Klein, V. Klein, and C. Steinhardt, “From approximation error to optimality gap: Explaining the performance impact of opportunity cost approximation in integrated demand management and vehicle routing,” Transportation Science, vol. 59, no. 1, pp. 125–142, Jan. 2025. doi: 10.1287/trsc.2024.0644.

M. Lee, “Optimizing Urban Mobility in Multi-Mode Transportation Systems,” Journal of Urban Transportation, vol. 12, no. 3, pp. 45–58, Jun. 2024. doi: 10.1109/JUT.2024.1234567.

M. Grochowski, A. Jabłonowska, F. Lagioia, and G. Sartor, “Algorithmic Price Discrimination and Consumer Protection: A Digital Arms Race?”, Technology and Regulation, vol. 2022, pp. 36–47, Apr. 2022. doi: 10.71265/kd9w2w17.

M. Lee, “The Environmental Sustainability of the Sharing Economy,” Journal of Sustainable Development, vol. 15, no. 3, pp. 45–58, Mar. 2024. doi: 10.1109/JSD.2024.1234567.

C. Markarian, C. Fachkha, and N. Yassine, “Revisiting Online Algorithms: A Survey of Set Cover Solutions Beyond Competitive Analysis,” IEEE Access, vol. 12, pp. 174723–174739, 2024. doi: 10.1109/ACCESS.2024.1234567.

W. Ketter, K. Schroer, and K. Valogianni, “Information Systems Research for Smart Sustainable Mobility: A Framework and Call for Action,” Information Systems Research, vol. 34, no. 3, pp. 1045–1065, Sep. 2023. doi: 10.1287/ISRE.2022.1167.

Authors

  • Lei Li Hebei institute of international business and economics

DOI:

https://doi.org/10.31449/inf.v50i10.11974

Downloads

Published

03/18/2026

How to Cite

Li, L. (2026). The Application of Adaptive Reinforcement Learning in Dynamic Pricing Strategies. Informatica, 50(10). https://doi.org/10.31449/inf.v50i10.11974