[1]
W. Liu, “KV Cache and Inference Scheduling: Energy Modeling for High-QPS Services”, J. Ind. Eng. Appl. Sci., vol. 4, no. 1, pp. 34–41, Feb. 2026.