Abstract
The suffix array is frequently augmented with the longest-common-prefix (LCP) array that stores the lengths of the longest common prefixes between lexicographically adjacent suffixes of a text. While the sum of the values in the LCP array can be Ω(n2) for a text of length n, the sum of so-called irreducible LCP values was shown to be O(nlgn) a few years ago. In this paper, we improve the bound to O(nlgr), where r≤n is the number of runs in the Burrows–Wheeler transform of the text. We also show that our bound is tight up to lower order terms (unlike the previous bound). Our results and the techniques used in proving them provide new insights into the combinatorics of text indexing and compression, and have immediate applications to LCP array construction algorithms.
| Original language | English |
|---|---|
| Pages (from-to) | 265-278 |
| Number of pages | 14 |
| Journal | Theoretical Computer Science |
| Volume | 656 |
| DOIs | |
| State | Published - Dec 20 2016 |
Keywords
- Burrows–Wheeler transform
- Irreducible LCP values
- LCP array
- Suffix array
Fingerprint
Dive into the research topics of 'Tighter bounds for the sum of irreducible LCP values'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver