Inferring strings from Lyndon factorization

研究成果: ジャーナルへの寄稿記事

1 引用 (Scopus)

抄録

The Lyndon factorization of a string w is a unique factorization ℓ1 p1 ,…,ℓm pm of w such that ℓ1,…,ℓm is a sequence of Lyndon words that is monotonically decreasing in lexicographic order. In this paper, we consider the reverse-engineering problem on Lyndon factorization: Given a sequence S=((s1,p1),…,(sm,pm)) of ordered pairs of positive integers, find a string w whose Lyndon factorization corresponds to the input sequence S, i.e., the Lyndon factorization of w is in a form of ℓ1 p1 ,…,ℓm pm with |ℓi|=si for all 1≤i≤m. Firstly, we show that there exists a simple O(n)-time algorithm if the size of the alphabet is unbounded, where n is the length of the output string. Secondly, we present an O(n)-time algorithm to compute a string over an alphabet of the smallest size. Thirdly, we show how to compute only the size of the smallest alphabet in O(m) time. Fourthly, we give an O(m)-time algorithm to compute an O(m)-size representation of a string over an alphabet of the smallest size. Finally, we propose an efficient algorithm to enumerate all strings whose Lyndon factorizations correspond to S.

元の言語英語
ページ(範囲)147-156
ページ数10
ジャーナルTheoretical Computer Science
689
DOI
出版物ステータス出版済み - 8 15 2017

Fingerprint

Factorization
Strings
p.m.
Lyndon Words
Unique factorisation
Lexicographic Order
Ordered pair
Reverse engineering
Reverse Engineering
Efficient Algorithms
Integer
Output

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

これを引用

Inferring strings from Lyndon factorization. / Nakashima, Yuto; Okabe, Takashi; I, Tomohiro; Inenaga, Shunsuke; Bannai, Hideo; Takeda, Masayuki.

:: Theoretical Computer Science, 巻 689, 15.08.2017, p. 147-156.

研究成果: ジャーナルへの寄稿記事

@article{7451e825c7184dfea69320e76fbcff28,
title = "Inferring strings from Lyndon factorization",
abstract = "The Lyndon factorization of a string w is a unique factorization ℓ1 p1 ,…,ℓm pm of w such that ℓ1,…,ℓm is a sequence of Lyndon words that is monotonically decreasing in lexicographic order. In this paper, we consider the reverse-engineering problem on Lyndon factorization: Given a sequence S=((s1,p1),…,(sm,pm)) of ordered pairs of positive integers, find a string w whose Lyndon factorization corresponds to the input sequence S, i.e., the Lyndon factorization of w is in a form of ℓ1 p1 ,…,ℓm pm with |ℓi|=si for all 1≤i≤m. Firstly, we show that there exists a simple O(n)-time algorithm if the size of the alphabet is unbounded, where n is the length of the output string. Secondly, we present an O(n)-time algorithm to compute a string over an alphabet of the smallest size. Thirdly, we show how to compute only the size of the smallest alphabet in O(m) time. Fourthly, we give an O(m)-time algorithm to compute an O(m)-size representation of a string over an alphabet of the smallest size. Finally, we propose an efficient algorithm to enumerate all strings whose Lyndon factorizations correspond to S.",
author = "Yuto Nakashima and Takashi Okabe and Tomohiro I and Shunsuke Inenaga and Hideo Bannai and Masayuki Takeda",
year = "2017",
month = "8",
day = "15",
doi = "10.1016/j.tcs.2017.05.038",
language = "English",
volume = "689",
pages = "147--156",
journal = "Theoretical Computer Science",
issn = "0304-3975",
publisher = "Elsevier",

}

TY - JOUR

T1 - Inferring strings from Lyndon factorization

AU - Nakashima, Yuto

AU - Okabe, Takashi

AU - I, Tomohiro

AU - Inenaga, Shunsuke

AU - Bannai, Hideo

AU - Takeda, Masayuki

PY - 2017/8/15

Y1 - 2017/8/15

N2 - The Lyndon factorization of a string w is a unique factorization ℓ1 p1 ,…,ℓm pm of w such that ℓ1,…,ℓm is a sequence of Lyndon words that is monotonically decreasing in lexicographic order. In this paper, we consider the reverse-engineering problem on Lyndon factorization: Given a sequence S=((s1,p1),…,(sm,pm)) of ordered pairs of positive integers, find a string w whose Lyndon factorization corresponds to the input sequence S, i.e., the Lyndon factorization of w is in a form of ℓ1 p1 ,…,ℓm pm with |ℓi|=si for all 1≤i≤m. Firstly, we show that there exists a simple O(n)-time algorithm if the size of the alphabet is unbounded, where n is the length of the output string. Secondly, we present an O(n)-time algorithm to compute a string over an alphabet of the smallest size. Thirdly, we show how to compute only the size of the smallest alphabet in O(m) time. Fourthly, we give an O(m)-time algorithm to compute an O(m)-size representation of a string over an alphabet of the smallest size. Finally, we propose an efficient algorithm to enumerate all strings whose Lyndon factorizations correspond to S.

AB - The Lyndon factorization of a string w is a unique factorization ℓ1 p1 ,…,ℓm pm of w such that ℓ1,…,ℓm is a sequence of Lyndon words that is monotonically decreasing in lexicographic order. In this paper, we consider the reverse-engineering problem on Lyndon factorization: Given a sequence S=((s1,p1),…,(sm,pm)) of ordered pairs of positive integers, find a string w whose Lyndon factorization corresponds to the input sequence S, i.e., the Lyndon factorization of w is in a form of ℓ1 p1 ,…,ℓm pm with |ℓi|=si for all 1≤i≤m. Firstly, we show that there exists a simple O(n)-time algorithm if the size of the alphabet is unbounded, where n is the length of the output string. Secondly, we present an O(n)-time algorithm to compute a string over an alphabet of the smallest size. Thirdly, we show how to compute only the size of the smallest alphabet in O(m) time. Fourthly, we give an O(m)-time algorithm to compute an O(m)-size representation of a string over an alphabet of the smallest size. Finally, we propose an efficient algorithm to enumerate all strings whose Lyndon factorizations correspond to S.

UR - http://www.scopus.com/inward/record.url?scp=85020825934&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85020825934&partnerID=8YFLogxK

U2 - 10.1016/j.tcs.2017.05.038

DO - 10.1016/j.tcs.2017.05.038

M3 - Article

VL - 689

SP - 147

EP - 156

JO - Theoretical Computer Science

JF - Theoretical Computer Science

SN - 0304-3975

ER -