Online LZ77 parsing and matching statistics with RLBWTs

Hideo Bannai, Travis Gagie, Tomohiro I

研究成果: Chapter in Book/Report/Conference proceedingConference contribution

3 引用 (Scopus)

抜粋

Lempel-Ziv 1977 (LZ77) parsing, matching statistics and the Burrows-Wheeler Transform (BWT) are all fundamental elements of stringology. In a series of recent papers, Policriti and Prezza (DCC 2016 and Algorithmica, CPM 2017) showed how we can use an augmented run-length compressed BWT (RLBWT) of the reverse TR of a text T, to compute offline the LZ77 parse of T in O(n log r) time and O(r) space, where n is the length of T and r is the number of runs in the BWT of TR. In this paper we first extend a well-known technique for updating an unaugmented RLBWT when a character is prepended to a text, to work with Policriti and Prezza's augmented RLBWT. This immediately implies that we can build online the LZ77 parse of T while still using O(n log r) time and O(r) space; it also seems likely to be of independent interest. Our experiments, using an extension of Ohno, Takabatake, I and Sakamoto's (IWOCA 2017) implementation of updating, show our approach is both time- and space-efficient for repetitive strings. We then show how to augment the RLBWT further -albeit making it static again and increasing its space by a factor proportional to the size of the alphabet -such that later, given another string S and O(log log n)-time random access to T, we can compute the matching statistics of S with respect to T in O(|S| log log n) time.

元の言語英語
ホスト出版物のタイトル29th Annual Symposium on Combinatorial Pattern Matching, CPM 2018
編集者Binhai Zhu, Gonzalo Navarro, David Sankoff
出版者Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
ページ71-712
ページ数642
ISBN(電子版)9783959770743
DOI
出版物ステータス出版済み - 5 1 2018
イベント29th Annual Symposium on Combinatorial Pattern Matching, CPM 2018 - Qingdao, 中国
継続期間: 7 2 20187 4 2018

出版物シリーズ

名前Leibniz International Proceedings in Informatics, LIPIcs
105
ISSN(印刷物)1868-8969

その他

その他29th Annual Symposium on Combinatorial Pattern Matching, CPM 2018
中国
Qingdao
期間7/2/187/4/18

All Science Journal Classification (ASJC) codes

  • Software

フィンガープリント Online LZ77 parsing and matching statistics with RLBWTs' の研究トピックを掘り下げます。これらはともに一意のフィンガープリントを構成します。

  • これを引用

    Bannai, H., Gagie, T., & I, T. (2018). Online LZ77 parsing and matching statistics with RLBWTs. : B. Zhu, G. Navarro, & D. Sankoff (版), 29th Annual Symposium on Combinatorial Pattern Matching, CPM 2018 (pp. 71-712). (Leibniz International Proceedings in Informatics, LIPIcs; 巻数 105). Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing. https://doi.org/10.4230/LIPIcs.CPM.2018.7