Energy-efficient Inference Service of Transformer-based Deep Learning Models on GPUs

Yuxin Wang, Qiang WANG, Xiaowen CHU

Research output: Chapter in book/report/conference proceedingConference contributionpeer-review

Fingerprint

Dive into the research topics of 'Energy-efficient Inference Service of Transformer-based Deep Learning Models on GPUs'. Together they form a unique fingerprint.

Business & Economics

Social Sciences

Engineering & Materials Science