TY - JOUR
T1 - HARBOR: Harnessing Bandwidth, Computation, and Batch for Fair QoE Having Collaborative Edge-AI Services in Industrial CPS
AU - Chen, Long
AU - Zheng, Shaojie
AU - Wu, Jigang
AU - Dai, Hong-Ning
AU - Niyato, Dusit
AU - Wan, Jiafu
AU - Huang, Jiale
N1 - This work was supported by the Youth Enhancement Project of Natural Science Fund of Guangdong Province, granted by the Guangdong Basic and Applied Basic Research Foundation, under the Grant No.2023A1515030183.
The work of Prof. Niyato was supported by the National Research Foundation, Singapore and Infocomm Media Development Authority under its Future Communications Research & Development Programme (FCP-NTU-RG-2022-010 and FCP-ASTAR-TG2022-003), Singapore Ministry of Education (MOE) Tier 1 (RG87/22 and RG24/24), the NTU Centre for Computational Technologies in Finance (NTUCCTF), and the RIE2025 Industry Alignment Fund - Industry Collaboration Projects (IAF-ICP) (Award I2301E0026), administered by A*STAR. E-mail: ([email protected])
The work of Prof. Wan was supported by the Special Project on Cooperation and Exchange of Shanxi Province Science and Technology, China (No. 202204041101036).
Publisher Copyright:
© 2025 IEEE.
PY - 2025/5/30
Y1 - 2025/5/30
N2 - Inadequate resource coordination and control can result in poor quality of experience (QoE) for user devices in heterogeneous edge-enabled cyber-physical systems. Unfortunately, in a cooperative edge network, existing studies have rarely jointly optimized communication, computing resources, and batch size for QoE guarantee when controlling task offloading. To this end, we investigate the problem of harnessing bandwidth, computation, and batch size for fair quality of experience (HARBOR) in a practical collaborative edge-AI environment, where UEs have different accuracy requirements of inference services and edge devices possess different batch processing capabilities. Specifically, we introduce the task completion efficiency as the task-completion-time-to-deadline ratio to quantify individual QoE. Then, we formulate the problem HARBOR as a mixed integer nonlinear programming with constraints of accuracy, bandwidth, computation, task hard deadlines and so on. The objective is to minimize the maximum task completion efficiency among all tasks to achieve task-level fairness. After providing the NP-hardness proof for HARBOR, we then devise an efficient scheme named e-HARBOR with a competitive ratio guarantee, to solve the decoupled sub-problems of HARBOR with calibrated long short-term memory network for resource prediction. Both testbed and simulation experiments evidently demonstrate that the proposed scheme works efficiently and scales well compared to baselines.
AB - Inadequate resource coordination and control can result in poor quality of experience (QoE) for user devices in heterogeneous edge-enabled cyber-physical systems. Unfortunately, in a cooperative edge network, existing studies have rarely jointly optimized communication, computing resources, and batch size for QoE guarantee when controlling task offloading. To this end, we investigate the problem of harnessing bandwidth, computation, and batch size for fair quality of experience (HARBOR) in a practical collaborative edge-AI environment, where UEs have different accuracy requirements of inference services and edge devices possess different batch processing capabilities. Specifically, we introduce the task completion efficiency as the task-completion-time-to-deadline ratio to quantify individual QoE. Then, we formulate the problem HARBOR as a mixed integer nonlinear programming with constraints of accuracy, bandwidth, computation, task hard deadlines and so on. The objective is to minimize the maximum task completion efficiency among all tasks to achieve task-level fairness. After providing the NP-hardness proof for HARBOR, we then devise an efficient scheme named e-HARBOR with a competitive ratio guarantee, to solve the decoupled sub-problems of HARBOR with calibrated long short-term memory network for resource prediction. Both testbed and simulation experiments evidently demonstrate that the proposed scheme works efficiently and scales well compared to baselines.
KW - Edge computing
KW - collaborative edge-AI
KW - inference time
KW - quality of experience
KW - task offloading
UR - http://www.scopus.com/inward/record.url?scp=105007905386&partnerID=8YFLogxK
U2 - 10.1109/JSAC.2025.3574583
DO - 10.1109/JSAC.2025.3574583
M3 - Journal article
SN - 0733-8716
JO - IEEE Journal on Selected Areas in Communications
JF - IEEE Journal on Selected Areas in Communications
ER -