Liu, W. (2026) “KV Cache and Inference Scheduling: Energy Modeling for High-QPS Services”, Journal of Industrial Engineering and Applied Science. London, U.K, 4(1), pp. 34–41. doi: 10.70393/6a69656173.333930.