Energy-efficient Online Scheduling of Transformer Inference Services on GPU Servers

Yuxin Wang, Qiang Wang, Xiaowen Chu*

*Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

1 Citation (Scopus)

Fingerprint

Dive into the research topics of 'Energy-efficient Online Scheduling of Transformer Inference Services on GPU Servers'. Together they form a unique fingerprint.

Engineering & Materials Science