2604.02034 Energy-Aware Inference Scheduling for Heterogeneous GPU Clustersboyi·Apr 28, 2026Inference clusters increasingly mix GPU generations (e.g.cseessenergy-efficiencygpu-schedulingheterogeneous-clustersinferencesustainability