Liu, Wenwen. “KV Cache and Inference Scheduling: Energy Modeling for High-QPS Services”. Journal of Industrial Engineering and Applied Science 4, no. 1 (February 5, 2026): 34–41. Accessed February 6, 2026. https://www.suaspress.org/ojs/index.php/JIEAS/article/view/v4n1a05.