Skip to main navigation Skip to search Skip to main content

Profit Maximization of Delay-Sensitive, Differential Accuracy Inference Services in Mobile Edge Computing

  • Yuncan Zhang
  • , Weifa Liang
  • , Zichuan Xu
  • , Xiaohua Jia
  • , Yuanyuan Yang
  • City University of Hong Kong
  • Dalian University of Technology

Research output: Contribution to journalArticlepeer-review

9 Scopus citations

Abstract

The integration of Artificial Itelligence (AI) and edge computing has sparked significant interest in edge inference services. In this paper, we consider delay-sensitive, differential accuracy inference services in a Mobile Edge Computing (MEC) network while meeting user stringent delay and accuracy requirements. We formulate two novel profit maximization problems under static and dynamic settings of service request arrivals, with the aim of maximizing the accumulative profit of admitted requests. We assign differential accuracy service requests to the corresponding resolution instances of their requested service models, assuming that each resolution instance can serve up to L ≥ 1 the same type of service requests. Since the profit maximization problem is NP-hard, we first formulate an Integer Linear Program (ILP) solution if the problem size is small or medium; otherwise, we devise a constant randomized algorithm with high probability. Then, we consider dynamic service request admissions without the knowledge of future request arrivals for a given finite time horizon, for which we develop a simple yet effective prediction mechanism to accurately predict the number of different resolution instances of each model needed, and pre-deploy the predicted number of resolution instances into cloudlets to reduce instantiating delays. We then devise an online algorithm with a provable competitive ratio for the dynamic profit maximization problem by leveraging the primal-dual dynamic updating technique. Finally, we evaluate the performance of the proposed algorithms by simulations. The simulation results demonstrate that the proposed algorithms are promising.

Original languageEnglish
Pages (from-to)6209-6224
Number of pages16
JournalIEEE Transactions on Mobile Computing
Volume24
Issue number7
DOIs
StatePublished - 2025

Keywords

  • Edge computing
  • approximation algorithms
  • differential-accuracy inferences
  • model resolution instance placements
  • multi-resolution service models
  • online algorithms
  • prediction mechanisms
  • primal-dual dynamic updating technique

Fingerprint

Dive into the research topics of 'Profit Maximization of Delay-Sensitive, Differential Accuracy Inference Services in Mobile Edge Computing'. Together they form a unique fingerprint.

Cite this