深層学習による物体検出モデルの文字認識器としての適性について---YOLOv6の日本語文字認識器としての適性--

畔柳, 愛子; 松田, 悠; 松本, 尚; Aiko, Kuroyanagi; Haruka, Matsuda; Takashi, Matsumoto

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

深層学習による物体検出モデルの文字認識器としての適性について---YOLOv6の日本語文字認識器としての適性--

https://ipsj.ixsq.nii.ac.jp/records/241495

名前 / ファイル	ライセンス	アクション
IPSJ-MPS24151015.pdf (2.4 MB) 2026年12月2日からダウンロード可能です。	Copyright (c) 2024 by the Information Processing Society of Japan
非会員：¥660, IPSJ:学会員：¥330, MPS:会員：¥0, DLIB:会員：¥0

Item type

SIG Technical Reports(1)

公開日

2024-12-02

タイトル

深層学習による物体検出モデルの文字認識器としての適性について---YOLOv6の日本語文字認識器としての適性--

タイトル

言語

タイトル

Suitability of Deep Learning Object Detection Model as a Character Recognizer --- YOLOv6’s Suitability as a Japanese Character Recognizer---

言語

jpn

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

奈良女子大学

著者所属

奈良女子大学

著者所属

奈良女子大学

著者所属(英)

Nara Women's University

著者所属(英)

Nara Women's University

著者所属(英)

Nara Women's University

著者名

畔柳, 愛子
松田, 悠
松本, 尚

著者名(英)

Aiko, Kuroyanagi
Haruka, Matsuda
Takashi, Matsumoto

論文抄録

内容記述タイプ

Other

内容記述

広く活用されている文字認識 (OCR) 技術は，文字領域の検出工程と文字の認識工程に分けて行っているため，前処理など多くの工程と文字単位や行単位の文字認識作業の繰り返しが必要となる．一方，深層学習による物体検出器を文字認識に流用することにより，前処理無しで複数の文字検出（物体検出と認識）を同時に行うことができる．松本研究室では，物体検出器のYOLO (You Only Look Once)v3 を利用した日本語文字認識が，学習方法や文字判定方法に工夫を施すことにより，実用レベルで実現可能なことを示した．YOLO には後継のバージョンが複数存在しているが，どのバージョンが文字認識に適しているかまでは精査されていない．本研究では，YOLOv6 に焦点を当てて，文字認識器としての適性を調べる．YOLOv6 は，畳込み層を含む基本構造に EfficientRep を採用し，推論の高速性，アンカーフリー方式，用途に合わせたモデル選択可能を特徴とする物体検出器である．YOLOv6 と YOLOv3 に関して，文字認識の推論および学習の性能を比較して，どちらがより日本語文字認識用の物体検出モデルとして優れているかを明らかにする．

論文抄録(英)

内容記述タイプ

Other

内容記述

While widely used Optical Character Recognition (OCR) technology divides the process into character region detection and character recognition, requiring many pre-processing steps and repeated character-by-character or line-by-line recognition tasks, deep learning-based object detectors can be applied to character recognition, enabling simultaneous detection and recognition of multiple characters without pre-processing. The Matsumoto Laboratory has demonstrated the feasibility of Japanese character recognition using the object detector YOLO (You Only Look Once) v3 with modifications to the learning method and character judgment method. Although there are multiple successor versions of YOLO, which version is most suitable for character recognition has not been thoroughly investigated. This study focuses on YOLOv6 to investigate its suitability as a character recognizer. YOLOv6 is an object detector featuring EfficientRep for the basic structure including convolutional layers, high inference speed, an anchor-free method, and the ability to select models for specific applications. We compare the inference and learning performance of YOLOv6 and YOLOv3 for character recognition to determine which is more suitable as an object detection model for Japanese character recognition.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10505667

書誌情報

研究報告数理モデル化と問題解決（MPS）

巻 2024-MPS-151, 号 15, p. 1-6, 発行日 2024-12-02

ISSN

収録物識別子タイプ

ISSN

収録物識別子

2188-8833

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-19 07:38:39.989288

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

深層学習による物体検出モデルの文字認識器としての適性について---YOLOv6の日本語文字認識器としての適性--

× 畔柳, 愛子

× 松田, 悠

× 松本, 尚

× Aiko, Kuroyanagi

× Haruka, Matsuda

× Takashi, Matsumoto

Versions

Share

Cite as

エクスポート