Skip to main navigation Skip to search Skip to main content

Self Pre-Training with Masked Autoencoders for Medical Image Classification and Segmentation

  • Stony Brook University
  • Amazon.com, Inc.
  • Shanghai Ai Laboratory

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

97 Scopus citations

Abstract

Masked Autoencoder (MAE) has recently been shown to be effective in pre-training Vision Transformers (ViT) for natural image analysis. By reconstructing full images from partially masked inputs, a ViT encoder aggregates contextual information to infer masked image regions. We believe that this context aggregation ability is particularly essential to the medical image domain where each anatomical structure is functionally and mechanically connected to other structures and regions. Because there is no ImageNet-scale medical image dataset for pre-training, we investigate a self pre-training paradigm with MAE for medical image analysis tasks. Our method pre-trains a ViT on the training set of the target data instead of another dataset. Thus, self pre-training can benefit more scenarios where pre-training data is hard to acquire. Our experimental results show that MAE self pre-training markedly improves diverse medical image tasks including chest X-ray disease classification, abdominal CT multi-organ segmentation, and MRI brain tumor segmentation. Code is available at https://github.com/cvlab-stonybrook/SelfMedMAE

Original languageEnglish
Title of host publication2023 IEEE International Symposium on Biomedical Imaging, ISBI 2023
PublisherIEEE Computer Society
ISBN (Electronic)9781665473583
DOIs
StatePublished - 2023
Event20th IEEE International Symposium on Biomedical Imaging, ISBI 2023 - Cartagena, Colombia
Duration: Apr 18 2023Apr 21 2023

Publication series

NameProceedings - International Symposium on Biomedical Imaging
Volume2023-April
ISSN (Print)1945-7928
ISSN (Electronic)1945-8452

Conference

Conference20th IEEE International Symposium on Biomedical Imaging, ISBI 2023
Country/TerritoryColombia
CityCartagena
Period04/18/2304/21/23

Fingerprint

Dive into the research topics of 'Self Pre-Training with Masked Autoencoders for Medical Image Classification and Segmentation'. Together they form a unique fingerprint.

Cite this