Skip to main navigation Skip to search Skip to main content

K-Means clustering on two-level memory systems

  • Michael A. Bender
  • , Jonathan Berry
  • , Simon D. Hammond
  • , Branden Moore
  • , Benjamin Moseley
  • , Cynthia A. Phillips
  • Sandia National Laboratories, New Mexico
  • Washington University St. Louis

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

11 Scopus citations

Abstract

In recent work we quantified the anticipated performance boost when a sorting algorithm is modified to leverage user- Addressable "near-memory," which we call scratchpad. This architectural feature is expected in the Intel Knight's Land- ing processors that will be used in DOE's next large-scale supercomputer. This paper expands our analytical study of the scratch- pad to consider k-means clustering, a classical data-analysis technique that is ubiquitous in the literature and in prac- Tice. We present new theoretical results using the model introduced in [13], which measures memory transfers and assumes that computations are memory-bound. Our the- oretical results indicate that scratchpad-aware versions of k-means clustering can expect performance boosts for high- dimensional instances with relatively few cluster centers. These constraints may limit the practical impact of scratch- pad for k-means acceleration, so we discuss their origins and practical implications. We corroborate our theory with ex- perimental runs on a system instrumented to mimic one with scratchpad memory. We also contribute a semi-formalization of the computa- Tional properties that are necessary and sufficient to predict a performance boost from scratchpad-aware variants of al- gorithms. We have observed and studied these properties in the context of sorting, and now clustering. We conclude with some thoughts on the application of these properties to new areas. Specifically, we believe that dense linear algebra has similar properties to k-means, while sparse linear algebra and FFT computations are more sim-ilar to sorting. The sparse operations are more common in scientific computing, so we expect scratchpad to have signif- icant impact in that area.

Original languageEnglish
Title of host publicationMEMSYS 2015 - Proceedings of the 1st International Symposium on Memory Systems
PublisherAssociation for Computing Machinery
Pages197-205
Number of pages9
ISBN (Electronic)9781450336048
DOIs
StatePublished - Oct 5 2015
Event1st International Symposium on Memory Systems, MEMSYS 2015 - Washington, United States
Duration: Aug 14 2015Aug 15 2015

Publication series

NameACM International Conference Proceeding Series
Volume05-08-October-2015

Conference

Conference1st International Symposium on Memory Systems, MEMSYS 2015
Country/TerritoryUnited States
CityWashington
Period08/14/1508/15/15

Keywords

  • K- means
  • Scratchpad
  • Two-level-memory
  • Variable bandwidth

Fingerprint

Dive into the research topics of 'K-Means clustering on two-level memory systems'. Together they form a unique fingerprint.

Cite this