Hybrid Edge-AI Framework for Intelligent Mobile Applications: Leveraging Large Language Models for On-device Contextual Assistance and Code-Aware Automation
DOI:
https://doi.org/10.70393/6a69656173.323935ARK:
https://n2t.net/ark:/40704/JIEAS.v3n3a02Disciplines:
Computer ScienceSubjects:
Software EngineeringReferences:
27Keywords:
Edge-Cloud Hybrid AI, Mobile Development Automation, Dynamic LLM Orchestration, Context-Aware Programming Assistance, On-Device Language Models, Energy-Efficient AIAbstract
The integration of large language models (LLMs) into mobile development workflows has been fundamentally constrained by three competing requirements: computational efficiency, contextual awareness, and real-time responsiveness. While cloud-based LLMs offer unparalleled reasoning capabilities, their reliance on remote infrastructure introduces prohibitive latency, privacy risks, and energy inefficiencies for mobile environments. Conversely, on-device models, though responsive and privacy-preserving, often lack the contextual depth required for complex code understanding and automation tasks. To address these challenges, we present SolidGPT, a hybrid edge-cloud framework that achieves an optimal balance between these competing demands through three key architectural innovations.
First, we introduce a Markov Decision Process (MDP)-based dynamic routing system that intelligently allocates tasks between on-device lightweight models (DistilGPT, TinyLLaMA) and cloud-based LLMs (GPT-4). This system evaluates real-time parameters—including contextual complexity, hardware constraints, and network conditions—to minimize energy consumption (28.6% reduction) while maintaining high accuracy (91% diagnostic accuracy). Second, our deep integration with Android’s Model-View-ViewModel (MVVM) architecture enables semantic-aware analysis across UI layouts, business logic, and runtime logs, bridging the gap between static code analysis and dynamic mobile runtime environments. Third, a novel prompt engineering pipeline preserves codebase-specific context across execution boundaries, ensuring continuity between local and cloud processing.
To validate our framework, we conducted a 12-week deployment with United Airlines’ Android application (128,500 LOC), involving 43 developers across six feature teams. The results demonstrate significant improvements: bug resolution time decreased by 64.1% (*p*<0.001), cloud API calls were reduced by 56.3%, and 87% of developer queries were resolved with sub-second latency. Notably, the system maintained a median energy consumption of 0.81mJ/token for on-device operations, outperforming cloud-only alternatives. These advancements highlight the framework’s ability to harmonize the strengths of edge and cloud computing while addressing critical challenges in energy efficiency, privacy preservation, and toolchain integration.
Beyond mobile development, SolidGPT establishes a template for deploying LLM-powered assistants in resource-constrained edge environments, such as IoT devices and embedded systems. By combining adaptive task allocation, platform-aware semantic analysis, and context-preserving prompt design, our work paves the way for next-generation AI tools that are both powerful and pragmatic—capable of scaling across domains without compromising responsiveness or user trust.
Downloads
Metrics
References
[1] Mathai, A. (2024). Enhancing Education for Underprivileged Children Through AI-Powered Native Language Learning Inclusive Education Through AI-Powered Native Language Learning. Available at SSRN 4899553.
[2] Lin, W., Xiao, J., & Cen, Z. (2024). Exploring Bias in NLP Models: Analyzing the Impact of Training Data on Fairness and Equity. Journal of Industrial Engineering and Applied Science, 2(5), 24-28.
[3] Lyu, S. (2024). The Application of Generative AI in Virtual Reality and Augmented Reality. Journal of Industrial Engineering and Applied Science, 2(6), 1-9.
[4] Li, K., Chen, X., Song, T., Zhang, H., Zhang, W., & Shan, Q. (2024). GPTDrawer: Enhancing Visual Synthesis through ChatGPT. arXiv preprint arXiv:2412.10429.
[5] Lin, W. (2024). A Systematic Review of Computer Vision-Based Virtual Conference Assistants and Gesture Recognition. Journal of Computer Technology and Applied Mathematics, 1(4), 28-35.
[6] Luo, M., Zhang, W., Song, T., Li, K., Zhu, H., Du, B., & Wen, H. (2021, January). Rebalancing expanding EV sharing systems with deep reinforcement learning. In Proceedings of the Twenty-Ninth International Conference on International Joint Conferences on Artificial Intelligence (pp. 1338-1344).
[7] Alam, A., & Mohanty, A. (2023). Educational technology: Exploring the convergence of technology and pedagogy through mobility, interactivity, AI, and learning tools. Cogent Engineering, 10(2), 2283282.
[8] Dezhi Yu, Lipeng Liu, Siye Wu, et al. Machine learning optimizes the efficiency of picking and packing in automated warehouse robot systems. TechRxiv. January 21, 2025. DOI: 10.36227/techrxiv.173750249.93643684/v1.
[9] Wang J Y, Tse K T, Li S W. Integrating the effects of climate change using representative concentration pathways into typhoon wind field in Hong Kong[C]//Proceedings of the 8th European African Conference on Wind Engineering. 2022: 20-23.
[10] Lyu, S. (2024). Machine Vision-Based Automatic Detection for Electromechanical Equipment. Journal of Computer Technology and Applied Mathematics, 1(4), 12-20.
[11] Xia, T., Xu, Y., Shan, X. (2025). KOA-Monitor: A Digital Intervention and Functional Assessment System for Knee Osteoarthritis Patients. In: Gao, Q., Zhou, J. (eds) Human Aspects of IT for the Aged Population. HCII 2025. Lecture Notes in Computer Science, vol 15810. Springer, Cham.
[12] Zhu, H., Luo, Y., Liu, Q., Fan, H., Song, T., Yu, C. W., & Du, B. (2019). Multistep flow prediction on car-sharing systems: A multi-graph convolutional neural network with attention mechanism. International Journal of Software Engineering and Knowledge Engineering, 29(11n12), 1727–1740.
[13] Wu, S., Fu, L., Chang, R., Wei, Y., Zhang, Y., Wang, Z., ... & Li, K. (2025). Warehouse Robot Task Scheduling Based on Reinforcement Learning to Maximize Operational Efficiency. Authorea Preprints.
[14] He, Y., Wang, J., Li, K., Wang, Y., Sun, L., Yin, J., ... & Wang, X. (2025). Enhancing Intent Understanding for Ambiguous Prompts through Human-Machine Co-Adaptation. arXiv preprint arXiv:2501.15167.
[15] Li, X., Wang, X., Qi, Z., Cao, H., Zhang, Z., & Xiang, A. DTSGAN: Learning Dynamic Textures via Spatiotemporal Generative Adversarial Network. Academic Journal of Computing & Information Science, 7(10), 31-40.
[16] Turnbull, D., Chugh, R., & Luck, J. A. (2023). Learning management systems and social media: a case for their integration in higher education institutions.
[17] Li, K., Liu, L., Chen, J., Yu, D., Zhou, X., Li, M., ... & Li, Z. (2024, November). Research on reinforcement learning based warehouse robot navigation algorithm in complex warehouse layout. In 2024 6th International Conference on Artificial Intelligence and Computer Applications (ICAICA) (pp. 296-301). IEEE.
[18] Wang J, Cao S, Tim K T, et al. A novel life-cycle analysis framework to assess the performances of tall buildings considering the climate change[J]. Engineering Structures, 2025, 323: 119258.
[19] Sun, Y., & Ortiz, J. (2024). An ai-based system utilizing iot-enabled ambient sensors and llms for complex activity tracking. arXiv preprint arXiv:2407.02606.
[20] Lin, W. (2025). Enhancing Video Conferencing Experience through Speech Activity Detection and Lip Synchronization with Deep Learning Models. Journal of Computer Technology and Applied Mathematics, 2(2), 16-23.
[21] He, Y., Li, S., Li, K., Wang, J., Li, B., Shi, T., ... & Wang, X. (2025). Enhancing Low-Cost Video Editing with Lightweight Adaptors and Temporal-Aware Inversion. arXiv preprint arXiv:2501.04606.
[22] Luo, M., Du, B., Zhang, W., Song, T., Li, K., Zhu, H., ... & Wen, H. (2023). Fleet rebalancing for expanding shared e-Mobility systems: A multi-agent deep reinforcement learning approach. IEEE Transactions on Intelligent Transportation Systems, 24(4), 3868-3881.
[23] Nasta, L. (2025). Navigating the Paradoxes of Digital Transformation in the Creative and Cultural Industries: Embracing Innovation. Springer Nature.
[24] Li, K., Chen, X., Song, T., Zhou, C., Liu, Z., Zhang, Z., Guo, J., & Shan, Q. (2025a, March 24). Solving situation puzzles with large language model and external reformulation.
[25] Rrucaj, A. (2023). Creating and sustaining competitive advantage in the software as a service (SaaS) Industry: best practices for strategic management.
[26] Li, X., Cao, H., Zhang, Z., Hu, J., Jin, Y., & Zhao, Z. (2024). Artistic Neural Style Transfer Algorithms with Activation Smoothing. arXiv preprint arXiv:2411.08014.
[27] Xu, J., Wang, H., & Trimbach, H. (2016). An OWL ontology representation for machine-learned functions using linked data. 2016 IEEE International Congress on Big Data (BigData Congress), 319–322.

Downloads
Published
How to Cite
Issue
Section
ARK
License
Copyright (c) 2025 The author retains copyright and grants the journal the right of first publication.

This work is licensed under a Creative Commons Attribution 4.0 International License.