Abstract
The Sort Transform (ST) can significantly speed up the block sorting
phase of the Burrows-Wheeler transform (BWT) by sorting only limited
order contexts. However, the best result obtained so far for the inverse
ST has a time complexity O(Nlogk) and a space complexity O(N), where N and k
are the text size and the context order of the transform, respectively.
In this paper, we present a novel algorithm that can compute the
inverse ST in an O(N) time/space complexity, a linear result independent of k. The main idea behind the design of the linear algorithm is a set of cycle properties of k-order
contexts we explored for this work. These newly discovered cycle
properties allow us to quickly compute the longest common prefix (LCP)
between any pair of adjacent k-order contexts that may belong to two different cycles, leading to the proposed linear inverse ST algorithm.
Original language | English |
---|---|
Title of host publication | Combinatorial Pattern Matching |
Subtitle of host publication | 19th Annual Symposium, CPM 2008 Pisa, Italy, June 18-20, 2008, Proceedings |
Editors | Paolo Ferragina, Gad M. Landau |
Publisher | Springer Berlin |
Pages | 178-190 |
Number of pages | 13 |
Edition | 1st |
ISBN (Electronic) | 9783540690689 |
ISBN (Print) | 3540690662, 9783540690665 |
DOIs | |
Publication status | Published - 3 Jun 2008 |
Externally published | Yes |
Event | 19th Annual Symposium on Combinatorial Pattern Matching, CPM 2008 - Pisa, Italy Duration: 18 Jun 2008 → 20 Jun 2008 https://link.springer.com/book/10.1007/978-3-540-69068-9 (Conference proceedings) |
Publication series
Name | Theoretical Computer Science and General Issues |
---|---|
Publisher | Springer |
Volume | 5029 |
ISSN (Print) | 2512-2010 |
ISSN (Electronic) | 2512-2029 |
Conference
Conference | 19th Annual Symposium on Combinatorial Pattern Matching, CPM 2008 |
---|---|
Country/Territory | Italy |
City | Pisa |
Period | 18/06/08 → 20/06/08 |
Internet address |
|
Scopus Subject Areas
- Theoretical Computer Science
- General Computer Science