Real Time Sales Forecasting in Omnichannel Retail Using a Hadoop Based Hybrid CNN–LSTM Deep Learning Framework

Authors

  • Huanyu Liu Johns Hopkins University Carey Business School
  • Tian Qi University of San Francisco

DOI:

https://doi.org/10.70393/616a736d.323932

ARK:

https://n2t.net/ark:/40704/AJSM.v3n3a03

Disciplines:

Business

Subjects:

Marketing

References:

22

Keywords:

Hadoop, Hybrid CNN–LSTM Model, Omnichannel Retail, Real Time Sales Forecasting, Distributed Deep Learning, Big Data

Abstract

In an omnichannel retail environment, accurate real time sales forecasts are critical for inventory optimisation and dynamic pricing. This study proposes a Hadoop based hybrid CNN–LSTM deep learning framework that leverages Hadoop’s distributed computing capabilities to process almost 20 million multi source sales records collected over two years. The convolutional layers and recurrent layers cooperate to capture local pulses and long range dependencies, respectively. Systematic experiments show that, compared with classical ARIMA and various machine learning baselines, the proposed model reduces the mean squared error (MSE) by approximately 45 % and increases the coefficient of determination (R²) by about 15 %. Within randomly selected 30 day windows, the model stably tracks high frequency intra week fluctuations while effectively suppressing noise spikes. Moreover, the Hadoop cluster shortens the total training time from 14 h to 3.5 h and compresses single inference latency to 48 ms, satisfying second level business decision requirements. Ablation studies further verify the complementary benefits of the convolutional and recurrent components; removing either leads to significant performance degradation. After deployment at a partner retailer, the stock out rate and dead inventory were reduced by 7.8 % and 6.1 %, respectively, demonstrating the commercial value of the approach. Limitations include cold start bias for new items, underestimation of extreme promotion peaks and insufficient model interpretability. Future work will explore graph convolution to incorporate spatial correlations, self supervised pre training to alleviate cold starts and attention mechanisms to enhance interpretability—thus driving retail sales forecasting towards greater accuracy, trustworthiness and inclusiveness.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

Author Biographies

Huanyu Liu, Johns Hopkins University Carey Business School

Johns Hopkins University Carey Business School,Master of Science in Marketing,100 International Drive, Baltimore, MD 21202, USA.

Tian Qi, University of San Francisco

University of San Francisco, College of Arts and Sciences, 2130 Fulton Street, San Francisco, CA, USA.

References

[1] Sun, J., Zhang, S., Lian, J., Fu, L., Zhou, Z., Fan, Y., & Xu, K. (2024, December). Research on Deep Learning of Convolutional Neural Network for Action Recognition of Intelligent Terminals in the Big Data Environment and its Intelligent Software Application. In 2024 IEEE 7th International Conference on Automation, Electronics and Electrical Engineering (AUTEEE) (pp. 996–1004). IEEE.

[2] Xu, Q. (2022). Omnichannel retail strategies considering stock outs. China Collective Economy(26), 123–125.

[3] Lu, J., Liu, J., Xia, Y., & Cai, S. (2022). Real-time visual SLAM in dynamic environments based on deep learning. Journal of Computer Applications, 42(Suppl. 2), 86–91.

[4] Fang, L. (2025). The Impact of AI Tools on ESL Learners’ Engagement and Language Learning Motivation. Journal of Education and Educational Research, 12(3), 111–114. https://doi.org/10.54097/hvm6w044

[5] Zhou, Y., Shen, J., & Cheng, Y. (2025). Weak to strong generalization for large language models with multi-capabilities. In The Thirteenth International Conference on Learning Representations.

[6] Zhang, T. (2025). Combining Blockchain and AI to Optimize the Intelligent Risk Control Mechanism in Decentralized Finance. Journal of Industrial Engineering and Applied Science, 3(2), 26–32.

[7] He, Y., Li, S., Li, K., Wang, J., Li, B., Shi, T., ... & Wang, X. (2025). Enhancing Low-Cost Video Editing with Lightweight Adaptors and Temporal-Aware Inversion. arXiv preprint arXiv:2501.04606.

[8] Yu, D., Liu, L., Wu, S., Li, K., Wang, C., Xie, J., ... & Ji, R. (2024). Machine learning optimizes the efficiency of picking and packing in automated warehouse robot systems. In 2024 International Conference on Computer Engineering, Network and Digital Communication (CENDC 2024).

[9] Xie, R., & Zhang, Y. (2023). Inventory decisions under omnichannel retail considering returns. Chinese Journal of Management Science, 31(12), 128–137.

[10] Li, K., Wang, J., Wu, X., Peng, X., Chang, R., Deng, X., ... & Hong, B. (2024). Optimizing automated picking systems in warehouse robots using machine learning. arXiv preprint arXiv:2408.16633.

[11] Liu, Y., Tian, J., & Lu, X. (2021). Constructing an omnichannel business model for agricultural products in the new retail environment. Northern Horticulture(1), 168–173.

[12] Zuo, Q., Tao, D., Qi, T., Xie, J., Zhou, Z., Tian, Z., & Mingyu, Y. (2025). Industrial Internet Robot Collaboration System and Edge Computing Optimization. arXiv preprint arXiv:2504.02492.

[13] Wang, B. (2025). Big Data-Driven ESG Quantitative Investment Strategy. Journal of Economic Theory and Business Management, 2(2), 8–13.

[14] Wu, S., Fu, L., Chang, R., Wei, Y., Zhang, Y., Wang, Z., ... & Li, K. (2025). Warehouse Robot Task Scheduling Based on Reinforcement Learning to Maximize Operational Efficiency. Authorea Preprints.

[15] Li, K., Liu, L., Chen, J., Yu, D., Zhou, X., Li, M., ... & Li, Z. (2024, November). Research on reinforcement learning based warehouse robot navigation algorithm in complex warehouse layout. In 2024 6th International Conference on Artificial Intelligence and Computer Applications (ICAICA) (pp. 296–301). IEEE.

[16] Dong, S. (2022). Composite model stock index forecasting based on deep learning (Master’s thesis, Hebei University of Economics and Business).

[17] Mao, Y., Tao, D., Zhang, S., Qi, T., & Li, K. (2025). Research and Design on Intelligent Recognition of Unordered Targets for Robots Based on Reinforcement Learning. arXiv preprint arXiv:2503.07340.

[18] Zhou, Y., Zhang, J., Chen, G., Shen, J., & Cheng, Y. (2024). Less is more: Vision representation compression for efficient video generation with large language models.

[19] He, Y., Wang, J., Li, K., Wang, Y., Sun, L., Yin, J., ... & Wang, X. (2025). Enhancing Intent Understanding for Ambiguous Prompts through Human-Machine Co-Adaptation. arXiv preprint arXiv:2501.15167.

[20] Joseph, R. V., Mohanty, A., Tyagi, S., Mishra, S., Satapathy, S. K., & Mohanty, S. N. (2022). A hybrid deep learning framework with CNN and Bi-directional LSTM for store item demand forecasting. Computers and Electrical Engineering, 103, 108358.

[21] Ahaggach, H., Abrouk, L., & Lebon, E. (2024). Systematic Mapping Study of Sales Forecasting: Methods, Trends, and Future Directions. Forecasting, 6(3), 502-532.

[22] Deepika, M. (2019). AI & ML-Powering the Agents of Automation. BPB Publications.

Downloads

Published

2025-05-16

How to Cite

Liu, H., & Qi, T. (2025). Real Time Sales Forecasting in Omnichannel Retail Using a Hadoop Based Hybrid CNN–LSTM Deep Learning Framework. Academic Journal of Sociology and Management, 3(3), 18–23. https://doi.org/10.70393/616a736d.323932

Issue

Section

Articles

ARK