Skip to main navigation Skip to search Skip to main content

Cloud-based whole slide image analysis using mapreduce

  • Hoang Vo
  • , Jun Kong
  • , Dejun Teng
  • , Yanhui Liang
  • , Ablimit Aji
  • , George Teodoro
  • , Fusheng Wang
  • Stony Brook University
  • Emory University
  • Ohio State University
  • Hewlett-Packard
  • Universidade de Brasília

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Systematic analysis of high resolution whole slide images enables more effective diagnosis, prognosis and prediction of cancer and other important diseases. Due to the enormous sizes and dimensions of whole slide images, the analysis requires extensive computing resources which are not commonly available. Images have to be divided into smaller regions for processing due to computer memory limitations, which will lead to inaccurate results due to the ignorance of boundary crossing objects. In this paper, we propose a highly scalable and cost effective MapReduce based image analysis framework for whole slide image processing, and provide a cloud based implementation. The framework takes a grid-based overlapping partitioning scheme, and provides parallelization of image segmentation based on MapReduce. It provides graceful handling of boundary objects with a highly efficient spatial indexing based matching method, thus avoiding loss of accuracy due to partitioning. We demonstrate that the system achieves high scalability and is cost-effective – our experiments demonstrate that it costs less than fifteen cents to analyze one image on average using Amazon Elastic MapReduce.

Original languageEnglish
Title of host publicationData Management and Analytics for Medicine and Healthcare - 2nd International Workshop, DMAH 2016 Held at VLDB 2016, Revised Selected Papers
EditorsLixia Yao, Fusheng Wang, Gang Luo
PublisherSpringer Verlag
Pages62-67
Number of pages6
ISBN (Print)9783319577401
DOIs
StatePublished - 2017
Event2nd International Workshop on Data Management and Analytics for Medicine and Healthcare, DMAH 2016 held in conjunction with 42nd International Conference on Very Large Data Bases, VLDB 2016 - New Delhi, India
Duration: Sep 5 2016Sep 9 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10186 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference2nd International Workshop on Data Management and Analytics for Medicine and Healthcare, DMAH 2016 held in conjunction with 42nd International Conference on Very Large Data Bases, VLDB 2016
Country/TerritoryIndia
CityNew Delhi
Period09/5/1609/9/16

Keywords

  • Cloud computing
  • MapReduce
  • Pathology image analysis
  • Whole slide images

Fingerprint

Dive into the research topics of 'Cloud-based whole slide image analysis using mapreduce'. Together they form a unique fingerprint.

Cite this