[1]
W. Liu, “KV Cache and Inference Scheduling: Energy Modeling for High-QPS Services”, Journal of Industrial Engineering & Applied Science, vol. 4, no. 1, pp. 34–41, Feb. 2026.