ログイン 新規登録
言語:

WEKO3

  • トップ
  • ランキング


インデックスリンク

インデックスツリー

  • RootNode

メールアドレスを入力してください。

WEKO

One fine body…

WEKO

One fine body…

アイテム

  1. 論文誌(ジャーナル)
  2. Vol.53
  3. No.4

Max-Shift BM and Max-Shift Horspool: Practical Fast Exact String Matching Algorithms

https://ipsj.ixsq.nii.ac.jp/records/81778
https://ipsj.ixsq.nii.ac.jp/records/81778
3401371a-2a7c-42af-b557-140e8a40393f
名前 / ファイル ライセンス アクション
IPSJ-JNL5304020.pdf IPSJ-JNL5304020 (1.3 MB)
Copyright (c) 2012 by the Information Processing Society of Japan
オープンアクセス
Item type Journal(1)
公開日 2012-04-15
タイトル
タイトル Max-Shift BM and Max-Shift Horspool: Practical Fast Exact String Matching Algorithms
タイトル
言語 en
タイトル Max-Shift BM and Max-Shift Horspool: Practical Fast Exact String Matching Algorithms
言語
言語 eng
キーワード
主題Scheme Other
主題 一般論文
資源タイプ
資源タイプ識別子 http://purl.org/coar/resource_type/c_6501
資源タイプ journal article
著者所属
Department of Computer Science, Graduate School of Information Science and Technology, The University of Tokyo
著者所属
Human Genome Center, Institute of Medical Science, The University of Tokyo
著者所属(英)
en
Department of Computer Science, Graduate School of Information Science and Technology, The University of Tokyo
著者所属(英)
en
Human Genome Center, Institute of Medical Science, The University of Tokyo
著者名 Mohammed, Sahli

× Mohammed, Sahli

Mohammed, Sahli

Search repository
Tetsuo, Shibuya

× Tetsuo, Shibuya

Tetsuo, Shibuya

Search repository
著者名(英) Mohammed, Sahli

× Mohammed, Sahli

en Mohammed, Sahli

Search repository
Tetsuo, Shibuya

× Tetsuo, Shibuya

en Tetsuo, Shibuya

Search repository
論文抄録
内容記述タイプ Other
内容記述 Exact string matching is the problem of finding all occurrences of a pattern P in a text T. The problem is well-known and many sophisticated algorithms have been proposed. Some fast exact string matching algorithms have been described since the 80s (e.g., the Boyer-Moore algorithm and its simplified version the Boyer-Moore-Horspool algorithm). They have been regarded as the standard benchmarks for the practical exact string search literature. In this paper, we propose two algorithms MSBM (Max-Shift BM) and MSH (Max-Shift BMH) both based on the combination of the bad-character rule of the right-most character used in the Boyer-Moore-Horspool algorithm, the extended bad-character rule and the good-suffix rule used in the Gusfield algorithm, which is a modification of the Boyer-Moore algorithm. Only a small extra space and preprocessing time are needed with respect to the BM and BMH algorithms. Nonetheless, empirical results on different data (DNA, Protein and Text Web) with different pattern lengths show that both MSBM and MSH are very fast in practice. MSBM algorithm usually won against other algorithms.
------------------------------
This is a preprint of an article intended for publication Journal of Information Processing(JIP). This preprint should not be cited. This article should be cited as: Journal of Information Processing Vol.20(2012) No.2 (online)
DOI http://dx.doi.org/10.2197/ipsjjip.20.419
------------------------------
論文抄録(英)
内容記述タイプ Other
内容記述 Exact string matching is the problem of finding all occurrences of a pattern P in a text T. The problem is well-known and many sophisticated algorithms have been proposed. Some fast exact string matching algorithms have been described since the 80s (e.g., the Boyer-Moore algorithm and its simplified version the Boyer-Moore-Horspool algorithm). They have been regarded as the standard benchmarks for the practical exact string search literature. In this paper, we propose two algorithms MSBM (Max-Shift BM) and MSH (Max-Shift BMH) both based on the combination of the bad-character rule of the right-most character used in the Boyer-Moore-Horspool algorithm, the extended bad-character rule and the good-suffix rule used in the Gusfield algorithm, which is a modification of the Boyer-Moore algorithm. Only a small extra space and preprocessing time are needed with respect to the BM and BMH algorithms. Nonetheless, empirical results on different data (DNA, Protein and Text Web) with different pattern lengths show that both MSBM and MSH are very fast in practice. MSBM algorithm usually won against other algorithms.
------------------------------
This is a preprint of an article intended for publication Journal of Information Processing(JIP). This preprint should not be cited. This article should be cited as: Journal of Information Processing Vol.20(2012) No.2 (online)
DOI http://dx.doi.org/10.2197/ipsjjip.20.419
------------------------------
書誌レコードID
収録物識別子タイプ NCID
収録物識別子 AN00116647
書誌情報 情報処理学会論文誌

巻 53, 号 4, 発行日 2012-04-15
ISSN
収録物識別子タイプ ISSN
収録物識別子 1882-7764
戻る
0
views
See details
Views

Versions

Ver.1 2025-01-20 06:51:53.625666
Show All versions

Share

Mendeley Twitter Facebook Print Addthis

Cite as

Mohammed, Sahli, Tetsuo, Shibuya, 2012.

Loading...

エクスポート

OAI-PMH
  • OAI-PMH JPCOAR
  • OAI-PMH DublinCore
  • OAI-PMH DDI
Other Formats
  • JSON
  • BIBTEX

Confirm


Powered by WEKO3


Powered by WEKO3