TY - GEN
T1 - Computing Inverse ST in Linear Complexity
AU - Nong, Ge
AU - Zhang, Sen
AU - Chan, Wai Hong
N1 - Publisher copyright:
© 2008 Springer-Verlag Berlin Heidelberg
PY - 2008/6/3
Y1 - 2008/6/3
N2 - The Sort Transform (ST) can significantly speed up the block sorting
phase of the Burrows-Wheeler transform (BWT) by sorting only limited
order contexts. However, the best result obtained so far for the inverse
ST has a time complexity O(Nlogk) and a space complexity O(N), where N and k
are the text size and the context order of the transform, respectively.
In this paper, we present a novel algorithm that can compute the
inverse ST in an O(N) time/space complexity, a linear result independent of k. The main idea behind the design of the linear algorithm is a set of cycle properties of k-order
contexts we explored for this work. These newly discovered cycle
properties allow us to quickly compute the longest common prefix (LCP)
between any pair of adjacent k-order contexts that may belong to two different cycles, leading to the proposed linear inverse ST algorithm.
AB - The Sort Transform (ST) can significantly speed up the block sorting
phase of the Burrows-Wheeler transform (BWT) by sorting only limited
order contexts. However, the best result obtained so far for the inverse
ST has a time complexity O(Nlogk) and a space complexity O(N), where N and k
are the text size and the context order of the transform, respectively.
In this paper, we present a novel algorithm that can compute the
inverse ST in an O(N) time/space complexity, a linear result independent of k. The main idea behind the design of the linear algorithm is a set of cycle properties of k-order
contexts we explored for this work. These newly discovered cycle
properties allow us to quickly compute the longest common prefix (LCP)
between any pair of adjacent k-order contexts that may belong to two different cycles, leading to the proposed linear inverse ST algorithm.
UR - http://www.scopus.com/inward/record.url?scp=45849125009&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-69068-9_18
DO - 10.1007/978-3-540-69068-9_18
M3 - Conference contribution
AN - SCOPUS:45849125009
SN - 3540690662
SN - 9783540690665
T3 - Theoretical Computer Science and General Issues
SP - 178
EP - 190
BT - Combinatorial Pattern Matching
A2 - Ferragina, Paolo
A2 - Landau, Gad M.
PB - Springer Berlin
T2 - 19th Annual Symposium on Combinatorial Pattern Matching, CPM 2008
Y2 - 18 June 2008 through 20 June 2008
ER -