TY - GEN
T1 - Representation Similarity
T2 - 30th International Conference on Mobile Computing and Networking, ACM MobiCom 2024
AU - Cao, Bryan Bo
AU - Sharma, Abhinav
AU - Singh, Manavjeet
AU - Gandhi, Anshul
AU - Das, Samir
AU - Jain, Shubham
N1 - Publisher Copyright:
© 2024 Copyright is held by the owner/author(s).
PY - 2024/12/4
Y1 - 2024/12/4
N2 - Edge computing has emerged as an alternative to reduce transmission and processing delay and preserve privacy of the video streams. However, the ever-increasing complexity of Deep Neural Networks (DNNs) used in video-based applications (e.g. object detection) exerts pressure on memory-constrained edge devices. Model merging is proposed to reduce the DNNs' memory footprint by keeping only one copy of merged layers' weights in memory. In existing model merging techniques, (i) only architecturally identical layers can be shared; (ii) requires computationally expensive retraining in the cloud; (iii) assumes the availability of ground truth for retraining. The re-evaluation of a merged model's performance, however, requires a validation dataset with ground truth, typically runs at the cloud. Common metrics to guide the selection of shared layers include the size or computational cost of shared layers or representation size. We propose a new model merging scheme by sharing representations (i.e., outputs of layers) at the edge, guided by representation similarity S. We show that S is extremely highly correlated with merged model's accuracy with Pearson Correlation Coefficient |r| > 0.94 than other metrics, demonstrating that representation similarity can serve as a strong validation accuracy indicator without ground truth. We present our preliminary results of the newly proposed model merging scheme with identified challenges, demonstrating a promising research future direction.
AB - Edge computing has emerged as an alternative to reduce transmission and processing delay and preserve privacy of the video streams. However, the ever-increasing complexity of Deep Neural Networks (DNNs) used in video-based applications (e.g. object detection) exerts pressure on memory-constrained edge devices. Model merging is proposed to reduce the DNNs' memory footprint by keeping only one copy of merged layers' weights in memory. In existing model merging techniques, (i) only architecturally identical layers can be shared; (ii) requires computationally expensive retraining in the cloud; (iii) assumes the availability of ground truth for retraining. The re-evaluation of a merged model's performance, however, requires a validation dataset with ground truth, typically runs at the cloud. Common metrics to guide the selection of shared layers include the size or computational cost of shared layers or representation size. We propose a new model merging scheme by sharing representations (i.e., outputs of layers) at the edge, guided by representation similarity S. We show that S is extremely highly correlated with merged model's accuracy with Pearson Correlation Coefficient |r| > 0.94 than other metrics, demonstrating that representation similarity can serve as a strong validation accuracy indicator without ground truth. We present our preliminary results of the newly proposed model merging scheme with identified challenges, demonstrating a promising research future direction.
UR - https://www.scopus.com/pages/publications/105002587784
U2 - 10.1145/3636534.3695903
DO - 10.1145/3636534.3695903
M3 - Conference contribution
AN - SCOPUS:105002587784
T3 - ACM MobiCom 2024 - Proceedings of the 30th International Conference on Mobile Computing and Networking
SP - 2242
EP - 2244
BT - ACM MobiCom 2024 - Proceedings of the 30th International Conference on Mobile Computing and Networking
PB - Association for Computing Machinery, Inc
Y2 - 18 November 2024 through 22 November 2024
ER -