• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

Collaborative Inference for Deep Neural Networks in Edge Environments


Abstract

Recent advances in deep neural networks (DNNs) have greatly improved the accuracy and universality of various intelligent applications, at the expense of increasing model size and computational demand. Since the resources of end devices are often too limited to deploy a complete DNN model, offloading DNN inference tasks to cloud servers is a common approach to meet this gap. However, due to the limited bandwidth of WAN and the long distance between end devices and cloud servers, this approach may lead to significant data transmission latency. Therefore, device-edge collaborative inference has emerged as a promising paradigm to accelerate the execution of DNN inference tasks where DNN models are partitioned to be sequentially executed in both end devices and edge servers. Nevertheless, collaborative inference in heterogeneous edge environments with multiple edge servers, end devices and DNN tasks has been overlooked in previous research. To fill this gap, we investigate the optimization problem of collaborative inference in a heterogeneous system and propose a scheme CIS, i.e., collaborative inference scheme, which jointly combines DNN partition, task offloading and scheduling to reduce the average weighted inference latency. CIS decomposes the problem into three parts to achieve the optimal average weighted inference latency. In addition, we build a prototype that implements CIS and conducts extensive experiments to demonstrate the scheme’s effectiveness and efficiency. Experiments show that CIS reduces 29% to 71% on the average weighted inference latency compared to the other four existing schemes.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
M. Liu, Y. Gu, S. Dong, L. Wei, K. Liu, Y. Yan, Y. Song, H. Cheng, L. Tang, S. Zhang, "Collaborative Inference for Deep Neural Networks in Edge Environments," KSII Transactions on Internet and Information Systems, vol. 18, no. 7, pp. 1749-1773, 2024. DOI: 10.3837/tiis.2024.07.003.

[ACM Style]
Meizhao Liu, Yingcheng Gu, Sen Dong, Liu Wei, Kai Liu, Yuting Yan, Yu Song, Huanyu Cheng, Lei Tang, and Sheng Zhang. 2024. Collaborative Inference for Deep Neural Networks in Edge Environments. KSII Transactions on Internet and Information Systems, 18, 7, (2024), 1749-1773. DOI: 10.3837/tiis.2024.07.003.

[BibTeX Style]
@article{tiis:100952, title="Collaborative Inference for Deep Neural Networks in Edge Environments", author="Meizhao Liu and Yingcheng Gu and Sen Dong and Liu Wei and Kai Liu and Yuting Yan and Yu Song and Huanyu Cheng and Lei Tang and Sheng Zhang and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2024.07.003}, volume={18}, number={7}, year="2024", month={July}, pages={1749-1773}}