Skip to main navigation Skip to search Skip to main content

ViT-DAE: Transformer-Driven Diffusion Autoencoder for Histopathology Image Analysis

  • Stony Brook University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

14 Scopus citations

Abstract

Generative AI has received substantial attention in recent years due to its ability to synthesize data that closely resembles the original data source. While Generative Adversarial Networks (GANs) have provided innovative approaches for histopathological image analysis, they suffer from limitations such as mode collapse and overfitting in discriminator. Recently, Denoising Diffusion models have demonstrated promising results in computer vision. These models exhibit superior stability during training, better distribution coverage, and produce high-quality diverse images. Additionally, they display a high degree of resilience to noise and perturbations, making them well-suited for use in digital pathology, where images commonly contain artifacts and exhibit significant variations in staining. In this paper, we present a novel approach, namely ViT-DAE, which integrates vision transformers (ViT) and diffusion autoencoders for high-quality histopathology image synthesis. This marks the first time that ViT has been introduced to diffusion autoencoders in computational pathology, allowing the model to better capture the complex and intricate details of histopathology images. We demonstrate the effectiveness of ViT-DAE on three publicly available datasets. Our approach outperforms recent GAN-based and vanilla DAE methods in generating realistic images.

Original languageEnglish
Title of host publicationDeep Generative Models - Third MICCAI Workshop, DGM4MICCAI 2023, Held in Conjunction with MICCAI 2023, Proceedings
EditorsAnirban Mukhopadhyay, Ilkay Oksuz, Sandy Engelhardt, Dajiang Zhu, Yixuan Yuan
PublisherSpringer Science and Business Media Deutschland GmbH
Pages66-76
Number of pages11
ISBN (Print)9783031537660
DOIs
StatePublished - 2024
Event3rd Workshop on Deep Generative Models for Medical Image Computing and Computer Assisted Intervention, DGM4MICCAI 2023 Held in Conjunction with 26th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2023 - Vancouver, Canada
Duration: Oct 8 2023Oct 12 2023

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14533 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference3rd Workshop on Deep Generative Models for Medical Image Computing and Computer Assisted Intervention, DGM4MICCAI 2023 Held in Conjunction with 26th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2023
Country/TerritoryCanada
CityVancouver
Period10/8/2310/12/23

Keywords

  • Diffusion Autoencoders
  • Histopathology
  • Vision Transformers

Fingerprint

Dive into the research topics of 'ViT-DAE: Transformer-Driven Diffusion Autoencoder for Histopathology Image Analysis'. Together they form a unique fingerprint.

Cite this