Skip to main navigation Skip to search Skip to main content

Representation Similarity: A Better Guidance of DNN Layer Sharing for Edge Computing without Training

  • Stony Brook University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Edge computing has emerged as an alternative to reduce transmission and processing delay and preserve privacy of the video streams. However, the ever-increasing complexity of Deep Neural Networks (DNNs) used in video-based applications (e.g. object detection) exerts pressure on memory-constrained edge devices. Model merging is proposed to reduce the DNNs' memory footprint by keeping only one copy of merged layers' weights in memory. In existing model merging techniques, (i) only architecturally identical layers can be shared; (ii) requires computationally expensive retraining in the cloud; (iii) assumes the availability of ground truth for retraining. The re-evaluation of a merged model's performance, however, requires a validation dataset with ground truth, typically runs at the cloud. Common metrics to guide the selection of shared layers include the size or computational cost of shared layers or representation size. We propose a new model merging scheme by sharing representations (i.e., outputs of layers) at the edge, guided by representation similarity S. We show that S is extremely highly correlated with merged model's accuracy with Pearson Correlation Coefficient |r| > 0.94 than other metrics, demonstrating that representation similarity can serve as a strong validation accuracy indicator without ground truth. We present our preliminary results of the newly proposed model merging scheme with identified challenges, demonstrating a promising research future direction.

Original languageEnglish
Title of host publicationACM MobiCom 2024 - Proceedings of the 30th International Conference on Mobile Computing and Networking
PublisherAssociation for Computing Machinery, Inc
Pages2242-2244
Number of pages3
ISBN (Electronic)9798400704895
DOIs
StatePublished - Dec 4 2024
Event30th International Conference on Mobile Computing and Networking, ACM MobiCom 2024 - Washington, United States
Duration: Nov 18 2024Nov 22 2024

Publication series

NameACM MobiCom 2024 - Proceedings of the 30th International Conference on Mobile Computing and Networking

Conference

Conference30th International Conference on Mobile Computing and Networking, ACM MobiCom 2024
Country/TerritoryUnited States
CityWashington
Period11/18/2411/22/24

Fingerprint

Dive into the research topics of 'Representation Similarity: A Better Guidance of DNN Layer Sharing for Edge Computing without Training'. Together they form a unique fingerprint.

Cite this