Liu, W. (2026). KV Cache and Inference Scheduling: Energy Modeling for High-QPS Services. Journal of Industrial Engineering and Applied Science, 4(1), 34–41. https://doi.org/10.70393/6a69656173.333930