TY - GEN
T1 - The minimum DAWG for all suffixes of a string and its applications
AU - Inenaga, Shunsuke
AU - Takeda, Masayuki
AU - Shinohara, Ayumi
AU - Hoshino, Hiromasa
AU - Arikawa, Setsuo
PY - 2002
Y1 - 2002
N2 - For a string w over an alphabet Σ, we consider a composite data structure called the all-suffixes directed acyclic word graph (ASDAWG). ASDAWG (w) has |w| + 1 initial nodes, and the dag induced by all reachable nodes from the k-th initial node conforms with DAWG(w[k :]), where w[k :] denotes the k-th suffix of w. We prove that the size of the minimum ASDAWG(w) (MASDAWG(w)) is Θ(|w|2) for |Σ| = 1, and is Θ(|w|) for |Σ| ≥ 2. Moreover, we introduce an on-line algorithm which directly constructs MASDAWG(w) for given w, whose running time is linear with respect to its size. We also demonstrate some application problems, beginning-sensitive pattern matching, region-sensitive pattern matching, and VLDC-pattern matching, for which AS-DAWGs are useful.
AB - For a string w over an alphabet Σ, we consider a composite data structure called the all-suffixes directed acyclic word graph (ASDAWG). ASDAWG (w) has |w| + 1 initial nodes, and the dag induced by all reachable nodes from the k-th initial node conforms with DAWG(w[k :]), where w[k :] denotes the k-th suffix of w. We prove that the size of the minimum ASDAWG(w) (MASDAWG(w)) is Θ(|w|2) for |Σ| = 1, and is Θ(|w|) for |Σ| ≥ 2. Moreover, we introduce an on-line algorithm which directly constructs MASDAWG(w) for given w, whose running time is linear with respect to its size. We also demonstrate some application problems, beginning-sensitive pattern matching, region-sensitive pattern matching, and VLDC-pattern matching, for which AS-DAWGs are useful.
UR - http://www.scopus.com/inward/record.url?scp=23044534111&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=23044534111&partnerID=8YFLogxK
U2 - 10.1007/3-540-45452-7_14
DO - 10.1007/3-540-45452-7_14
M3 - Conference contribution
AN - SCOPUS:23044534111
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 153
EP - 167
BT - Combinatorial Pattern Matching - 13th Annual Symposium, CPM 2002, Proceedings
A2 - Apostolico, Alberto
A2 - Takeda, Masayuki
PB - Springer Verlag
T2 - 13th Annual Symposium on Combinatorial Pattern Matching, CPM 2002
Y2 - 3 July 2002 through 5 July 2002
ER -