手書き文字列読み取りのための単語列探索アルゴリズム　－文字タグ法－

福島, 俊一; 下村, 秀樹; 森, 義和; Toshikazu, Fukushima; Hideki, Shimomura; Yoshikazu, Mori

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

手書き文字列読み取りのための単語列探索アルゴリズム　－文字タグ法－

https://ipsj.ixsq.nii.ac.jp/records/13691

名前 / ファイル	ライセンス	アクション
IPSJ-JNL3704005.pdf (980.1 kB)	Copyright (c) 1996 by the Information Processing Society of Japan
オープンアクセス

Item type

Journal(1)

公開日

1996-04-15

タイトル

手書き文字列読み取りのための単語列探索アルゴリズム　－文字タグ法－

タイトル

言語

タイトル

A Word -Sequence Search Algorithm for a Hand- Written Character Reader

言語

jpn

キーワード

主題Scheme

Other

主題

論文

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_6501

資源タイプ

journal article

その他タイトル

その他のタイトル

パターン認識

著者所属

日本電気株式会社情報メディア研究所音声言語研究部

著者所属

日本電気株式会社情報メディア研究所音声言語研究部

著者所属

株式会社NEC情報システムズ

著者所属(英)

Human Language Research Laboratory, Information Technology Research Laboratories, NEC Corporation

著者所属(英)

Human Language Research Laboratory, Information Technology Research Laboratories, NEC Corporation

著者所属(英)

NEC Informatic Systems Ltd

著者名

福島, 俊一

著者名(英)

Toshikazu, Fukushima

論文抄録

内容記述タイプ

Other

内容記述

本論文ではフリーピッチ手書き文字列の読み取りのために新しい知識処理アルゴリズムである「文字タグ法」を提案する. 字形が多様で文字サイズ・文字ピッチにばらつきがあり文字の接触・入り組みなどもよく起きる手書き文字列の読み取りでは誤切り出しや誤認識によって欠落した正解文字を補完する知識処理が不可欠である. 従来の知識処理方式は単語辞書と候補文字列とを照合して単語候補を抽出したうえでその並びの妥当性を判定する2段構成である. このような従来法では単語境界が不確定なケースをうまく扱えないことや候補文字列と単語辞書との虫食い照合における組合せ爆発を避けると強引に候補を切り捨てることになって最良解を保証できないことなどが大きな問題になっている. これに対して本論文で提案する文字タグ法は文字を基本単位としてタグを付与しその位置関係をチェックしながら連結していく戦略をとる. 単語内の文字の連結と単語間の文字の連結とを同等に扱って動的計画法を適用することで最良解を保証しかつ入力文字列の長さLと候補多重度Mに対してO(L^2・M^2)またはO(L・M^2)の時間計算量を達成している. さらに手書き宛名住所の地名領域の読み取りに文字タグ法を応用し文字切り出しや個別文字認識のあらゆる組合せと正解文字欠落の可能性の中から最良解を高速に探索する文字タグ法の能力を確認した.

論文抄録(英)

内容記述タイプ

Other

内容記述

This paper proposes a new algorithm for post-processing in a hand-written character reader. Hand-written characters have such characteristics as various styles, irregularity in size and pitch, frequency of character overlapping, and so on. These characteristics bring difficulty into hand-written character reading systems. Post-processing to correct mis-segmentation and mis-recognition by linguistic information is an important approach to accurate reading. Conventional post-processing methods consist of two steps. In the first step, word candidates are extracted by word dictionary looking-up. In the second step, combinations of words are evaluated. These conventional methods have the following problems. The first problem is that they don't work well when word boundary segmentation is missed. The second one is combinational time complexity, required for examinations of all combinations of character segmentation candidates and character recognition candidates by approximate matching. In the algorithm proposed in this paper, character candidates are tagged with position-in-word information, and the position-in-word tags are connected by a dynamic programming method. This algorithm has the advantage of time complexity O(L^2・M^2) or O(L・M^2) for optimum path search, where L is input length, and M is average number of segmentation and recognition candidates per character. This paper also describes its implementation and evaluation results in hand-written Japanese address reading.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN00116647

書誌情報

情報処理学会論文誌

巻 37, 号 4, p. 500-510, 発行日 1996-04-15

ISSN

収録物識別子タイプ

ISSN

収録物識別子

1882-7764

戻る

views

See details

	Views

Versions

Ver.1

2025-01-23 01:07:09.292391

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

手書き文字列読み取りのための単語列探索アルゴリズム　－文字タグ法－

× 福島, 俊一

× Toshikazu, Fukushima

Versions

Share

Cite as

エクスポート

インデックスリンク

インデックスツリー

アイテム

手書き文字列読み取りのための単語列探索アルゴリズム －文字タグ法－

× 福島, 俊一

× Toshikazu, Fukushima

Versions

Share

Cite as

エクスポート

手書き文字列読み取りのための単語列探索アルゴリズム　－文字タグ法－