A fast string-searching algorithm for multiple patterns

Noriyoshi Uratani, Masayuki Takeda

Research output: Contribution to journalArticle

4 Citations (Scopus)

Abstract

The string-searching problem is to find all occurrences of pattern(s) in a text string. The Aho-Corasick string searching algorithm simultaneously finds all occurrences of multiple patterns in one pass through the text. On the other hand, the Boyer-Moore algorithm is understood to be the fastest algorithm for a single pattern. By combining the ideas of these two algorithms, we present an efficient string searching algorithm for multiple patterns. The algorithm runs in sublinear time, on the average, as the BM algorithm achieves, and its preprocessing time is linear proportional to the sum of the lengths of the patterns like the AC algorithm.

Original languageEnglish
Pages (from-to)775-791
Number of pages17
JournalInformation Processing and Management
Volume29
Issue number6
DOIs
Publication statusPublished - Jan 1 1993

Fingerprint

String searching algorithms

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Media Technology
  • Computer Science Applications
  • Management Science and Operations Research
  • Library and Information Sciences

Cite this

A fast string-searching algorithm for multiple patterns. / Uratani, Noriyoshi; Takeda, Masayuki.

In: Information Processing and Management, Vol. 29, No. 6, 01.01.1993, p. 775-791.

Research output: Contribution to journalArticle

@article{b5007eddd37341ce8ac9bc0679327c3f,
title = "A fast string-searching algorithm for multiple patterns",
abstract = "The string-searching problem is to find all occurrences of pattern(s) in a text string. The Aho-Corasick string searching algorithm simultaneously finds all occurrences of multiple patterns in one pass through the text. On the other hand, the Boyer-Moore algorithm is understood to be the fastest algorithm for a single pattern. By combining the ideas of these two algorithms, we present an efficient string searching algorithm for multiple patterns. The algorithm runs in sublinear time, on the average, as the BM algorithm achieves, and its preprocessing time is linear proportional to the sum of the lengths of the patterns like the AC algorithm.",
author = "Noriyoshi Uratani and Masayuki Takeda",
year = "1993",
month = "1",
day = "1",
doi = "10.1016/0306-4573(93)90106-N",
language = "English",
volume = "29",
pages = "775--791",
journal = "Information Processing and Management",
issn = "0306-4573",
publisher = "Elsevier Limited",
number = "6",

}

TY - JOUR

T1 - A fast string-searching algorithm for multiple patterns

AU - Uratani, Noriyoshi

AU - Takeda, Masayuki

PY - 1993/1/1

Y1 - 1993/1/1

N2 - The string-searching problem is to find all occurrences of pattern(s) in a text string. The Aho-Corasick string searching algorithm simultaneously finds all occurrences of multiple patterns in one pass through the text. On the other hand, the Boyer-Moore algorithm is understood to be the fastest algorithm for a single pattern. By combining the ideas of these two algorithms, we present an efficient string searching algorithm for multiple patterns. The algorithm runs in sublinear time, on the average, as the BM algorithm achieves, and its preprocessing time is linear proportional to the sum of the lengths of the patterns like the AC algorithm.

AB - The string-searching problem is to find all occurrences of pattern(s) in a text string. The Aho-Corasick string searching algorithm simultaneously finds all occurrences of multiple patterns in one pass through the text. On the other hand, the Boyer-Moore algorithm is understood to be the fastest algorithm for a single pattern. By combining the ideas of these two algorithms, we present an efficient string searching algorithm for multiple patterns. The algorithm runs in sublinear time, on the average, as the BM algorithm achieves, and its preprocessing time is linear proportional to the sum of the lengths of the patterns like the AC algorithm.

UR - http://www.scopus.com/inward/record.url?scp=28044472203&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=28044472203&partnerID=8YFLogxK

U2 - 10.1016/0306-4573(93)90106-N

DO - 10.1016/0306-4573(93)90106-N

M3 - Article

AN - SCOPUS:28044472203

VL - 29

SP - 775

EP - 791

JO - Information Processing and Management

JF - Information Processing and Management

SN - 0306-4573

IS - 6

ER -