抄録
We consider grammar based text compression with longest first substitution where non-overlapping occurrences of a longest repeating substring of the input text are replaced by a new non-terminal symbol. We present a new text compression algorithm by simplifying the algorithm presented in [4]. We give a new formulation of the correctness proof introducing the sparse lazy suffix tree data structure. We also present another type of longest first substitution strategy that allows better compression. We show results of preliminary experiments comparing grammar sizes of the two versions of the longest first strategy and the most frequent strategy.
本文言語 | 英語 |
---|---|
ホスト出版物のタイトル | Proceedings - DCC 2007: 2007 Data Compression Conference |
ページ | 123-132 |
ページ数 | 10 |
DOI | |
出版ステータス | 出版済み - 2007 |
イベント | DCC 2007: 2007 Data Compression Conference - Snowbird, UT, 米国 継続期間: 3月 27 2007 → 3月 29 2007 |
その他
その他 | DCC 2007: 2007 Data Compression Conference |
---|---|
国/地域 | 米国 |
City | Snowbird, UT |
Period | 3/27/07 → 3/29/07 |
!!!All Science Journal Classification (ASJC) codes
- 電子工学および電気工学
- ハードウェアとアーキテクチャ