Skip to main navigation Skip to search Skip to main content

Representation Recovering for Self-Supervised Pre-training on Medical Images

  • Xiangyi Yan
  • , Junayed Naushad
  • , Shanlin Sun
  • , Kun Han
  • , Hao Tang
  • , Deying Kong
  • , Haoyu Ma
  • , Chenyu You
  • , Xiaohui Xie
  • University of California at Irvine
  • Yale University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

13 Scopus citations

Abstract

Advances in self-supervised learning have drawn attention to developing techniques to extract effective visual representations from unlabeled images. Contrastive learning (CL) trains a model to extract consistent features by generating different views. Recent success of Masked Autoencoders (MAE) highlights the benefit of generative modeling in self-supervised learning. The generative approaches encode the input into a compact embedding and empower the model's ability of recovering the original input. However, in our experiments, we found vanilla MAE mainly recovers coarse high level semantic information and is inadequate in recovering detailed low level information. We show that in dense downstream prediction tasks like multi-organ segmentation, directly applying MAE is not ideal. Here, we propose RepRec, a hybrid visual representation learning framework for self-supervised pre-training on large-scale unlabelled medical datasets, which takes advantage of both contrastive and generative modeling. To solve the aforementioned dilemma that MAE encounters, a convolutional encoder is pre-trained to provide low-level feature information, in a contrastive way; and a transformer encoder is pre-trained to produce high level semantic dependency, in a generative way - by recovering masked representations from the convolutional encoder. Extensive experiments on three multi-organ segmentation datasets demonstrate that our method outperforms current state-of-the-art methods.

Original languageEnglish
Title of host publicationProceedings - 2023 IEEE Winter Conference on Applications of Computer Vision, WACV 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages2684-2694
Number of pages11
ISBN (Electronic)9781665493468
DOIs
StatePublished - 2023
Event23rd IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2023 - Waikoloa, United States
Duration: Jan 3 2023Jan 7 2023

Publication series

NameProceedings - 2023 IEEE Winter Conference on Applications of Computer Vision, WACV 2023

Conference

Conference23rd IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2023
Country/TerritoryUnited States
CityWaikoloa
Period01/3/2301/7/23

Keywords

  • and algorithms (including transfer, low-shot, semi-, self-, and un-supervised learning)
  • Applications: Biomedical/healthcare/medicine
  • formulations
  • Image recognition and understanding (object detection, categorization, segmentation, scene modeling, visual reasoning)
  • Machine learning architectures

Fingerprint

Dive into the research topics of 'Representation Recovering for Self-Supervised Pre-training on Medical Images'. Together they form a unique fingerprint.

Cite this