Liu, Wenwen. “KV Cache and Inference Scheduling: Energy Modeling for High-QPS Services”. Journal of Industrial Engineering and Applied Science, vol. 4, no. 1, Feb. 2026, pp. 34-41, doi:10.70393/6a69656173.333930.