Skip to main navigation Skip to search Skip to main content

Lempel-Ziv factorization: Simple, fast, practical

  • University of Helsinki

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

36 Scopus citations

Abstract

For decades the Lempel-Ziv (LZ77) factorization has been a cornerstone of data compression and string processing algorithms, and uses for it are still being uncovered. For example, LZ77 is central to several recent text indexing data structures designed to search highly repetitive collections. However, in many applications computation of the factorization remains a bottleneck in practice. In this paper we describe simple and fast algorithms for computing the LZ77 factorization. These new methods consistently outperform all previous approaches in practice, use less memory, and still offer strong worstcase performance guarantees. A common feature of the new algorithms is their avoidance of the longest-common-prefix array, essential to nearly all prior art.

Original languageEnglish
Title of host publication2013 Proceedings of the 15th Workshop on Algorithm Engineering and Experiments, ALENEX 2013
EditorsNorbert Zeh, Peter Sanders
PublisherSociety for Industrial and Applied Mathematics Publications
Pages103-112
Number of pages10
ISBN (Electronic)9781611972535
DOIs
StatePublished - 2013
Event15th Workshop on Algorithm Engineering and Experiments, ALENEX 2013 - New Orleans, United States
Duration: Jan 7 2013 → …

Publication series

NameProceedings of the Workshop on Algorithm Engineering and Experiments
ISSN (Print)2164-0300

Conference

Conference15th Workshop on Algorithm Engineering and Experiments, ALENEX 2013
Country/TerritoryUnited States
CityNew Orleans
Period01/7/13 → …

Fingerprint

Dive into the research topics of 'Lempel-Ziv factorization: Simple, fast, practical'. Together they form a unique fingerprint.

Cite this