Max-Shift BM and Max-Shift Horspool: Practical Fast Exact String Matching Algorithms

Mohammed, Sahli; Tetsuo, Shibuya; Mohammed, Sahli; Tetsuo, Shibuya

WEKO3

インデックスツリー

RootNode

アイテム

Max-Shift BM and Max-Shift Horspool: Practical Fast Exact String Matching Algorithms

https://ipsj.ixsq.nii.ac.jp/records/81778

名前 / ファイル	ライセンス	アクション
IPSJ-JNL5304020 (1.3 MB)	Copyright (c) 2012 by the Information Processing Society of Japan
オープンアクセス

Item type

Journal(1)

公開日

2012-04-15

タイトル

Max-Shift BM and Max-Shift Horspool: Practical Fast Exact String Matching Algorithms

タイトル

言語

タイトル

Max-Shift BM and Max-Shift Horspool: Practical Fast Exact String Matching Algorithms

言語

eng

キーワード

主題Scheme

Other

主題

一般論文

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_6501

資源タイプ

journal article

著者所属

Department of Computer Science, Graduate School of Information Science and Technology, The University of Tokyo

著者所属

Human Genome Center, Institute of Medical Science, The University of Tokyo

著者所属(英)

Department of Computer Science, Graduate School of Information Science and Technology, The University of Tokyo

著者所属(英)

Human Genome Center, Institute of Medical Science, The University of Tokyo

著者名

Mohammed, Sahli
Tetsuo, Shibuya

著者名(英)

Mohammed, Sahli
Tetsuo, Shibuya

論文抄録

内容記述タイプ

Other

内容記述

Exact string matching is the problem of finding all occurrences of a pattern P in a text T. The problem is well-known and many sophisticated algorithms have been proposed. Some fast exact string matching algorithms have been described since the 80s (e.g., the Boyer-Moore algorithm and its simplified version the Boyer-Moore-Horspool algorithm). They have been regarded as the standard benchmarks for the practical exact string search literature. In this paper, we propose two algorithms MSBM (Max-Shift BM) and MSH (Max-Shift BMH) both based on the combination of the bad-character rule of the right-most character used in the Boyer-Moore-Horspool algorithm, the extended bad-character rule and the good-suffix rule used in the Gusfield algorithm, which is a modification of the Boyer-Moore algorithm. Only a small extra space and preprocessing time are needed with respect to the BM and BMH algorithms. Nonetheless, empirical results on different data (DNA, Protein and Text Web) with different pattern lengths show that both MSBM and MSH are very fast in practice. MSBM algorithm usually won against other algorithms.
------------------------------
This is a preprint of an article intended for publication Journal of Information Processing(JIP). This preprint should not be cited. This article should be cited as: Journal of Information Processing Vol.20(2012) No.2 (online)
DOI　http://dx.doi.org/10.2197/ipsjjip.20.419
------------------------------

論文抄録(英)

内容記述タイプ

Other

内容記述

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN00116647

書誌情報

情報処理学会論文誌

巻 53, 号 4, 発行日 2012-04-15

ISSN

収録物識別子タイプ

ISSN

収録物識別子

1882-7764

戻る

views

See details

	Views

Versions

Ver.1

2025-01-20 06:51:53.625666

Show All versions

Cite as

Mohammed, Sahli, Tetsuo, Shibuya, 2012.

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

Max-Shift BM and Max-Shift Horspool: Practical Fast Exact String Matching Algorithms

× Mohammed, Sahli

× Tetsuo, Shibuya

× Mohammed, Sahli

× Tetsuo, Shibuya

Versions

Share

Cite as

エクスポート