Skip to main navigation Skip to search Skip to main content

HashVFL: Defending Against Data Reconstruction Attacks in Vertical Federated Learning

  • Pengyu Qiu
  • , Xuhong Zhang
  • , Shouling Ji
  • , Chong Fu
  • , Xing Yang
  • , Ting Wang
  • Zhejiang University
  • National University of Defense Technology

Research output: Contribution to journalArticlepeer-review

25 Scopus citations

Abstract

Vertical Federated Learning (VFL) is a trending collaborative machine learning model training solution. Existing industrial frameworks employ secure multi-party computation techniques such as homomorphic encryption to ensure data security and privacy. Despite these efforts, studies have revealed that data leakage remains a risk in VFL due to the correlations between intermediate representations and raw data. Neural networks can accurately capture these correlations, allowing an adversary to reconstruct the data. This emphasizes the need for continued research into securing VFL systems. Our work shows that hashing is a promising solution to counter data reconstruction attacks. The one-way nature of hashing makes it difficult for an adversary to recover data from hash codes. However, implementing hashing in VFL presents new challenges, including vanishing gradients and information loss. To address these issues, we propose HashVFL, which integrates hashing and simultaneously achieves learnability, bit balance, and consistency. Experimental results indicate that HashVFL effectively maintains task performance while defending against data reconstruction attacks. It also brings additional benefits in reducing the degree of label leakage, mitigating adversarial attacks, and detecting abnormal inputs. We hope our work will inspire further research into the potential applications of HashVFL.

Original languageEnglish
Pages (from-to)3435-3450
Number of pages16
JournalIEEE Transactions on Information Forensics and Security
Volume19
DOIs
StatePublished - 2024

Keywords

  • Vertical federated learning
  • deep hashing

Fingerprint

Dive into the research topics of 'HashVFL: Defending Against Data Reconstruction Attacks in Vertical Federated Learning'. Together they form a unique fingerprint.

Cite this