Liu, Wenwen. 2026. “KV Cache and Inference Scheduling: Energy Modeling for High-QPS Services”. Journal of Industrial Engineering and Applied Science 4 (1). London, U.K.:34-41. https://doi.org/10.70393/6a69656173.333930.