ログイン 新規登録
言語:

WEKO3

  • トップ
  • ランキング
To
lat lon distance
To

Field does not validate



インデックスリンク

インデックスツリー

メールアドレスを入力してください。

WEKO

One fine body…

WEKO

One fine body…

アイテム

  1. シンポジウム
  2. シンポジウムシリーズ
  3. じんもんこんシンポジウム
  4. 2024

Beyond OCR: Enhancing Classical Japanese Transcription with Large Language Models

https://ipsj.ixsq.nii.ac.jp/records/241512
https://ipsj.ixsq.nii.ac.jp/records/241512
ce318aa9-5c6e-4248-b23b-135adb7da71b
名前 / ファイル ライセンス アクション
IPSJ-CH2024011.pdf IPSJ-CH2024011.pdf (2.2 MB)
Copyright (c) 2024 by the Information Processing Society of Japan
オープンアクセス
Item type Symposium(1)
公開日 2024-11-30
タイトル
タイトル Beyond OCR: Enhancing Classical Japanese Transcription with Large Language Models
タイトル
言語 en
タイトル Beyond OCR: Enhancing Classical Japanese Transcription with Large Language Models
言語
言語 eng
キーワード
主題Scheme Other
主題 Large Language Model, Historical Document, Classical Japanese, OCR
資源タイプ
資源タイプ識別子 http://purl.org/coar/resource_type/c_5794
資源タイプ conference paper
著者所属
Sakana AI
著者所属
ROIS-DS Center for Open Data in the Humanities
著者所属
National Institute of Informatics
著者所属(英)
en
Sakana AI
著者所属(英)
en
ROIS-DS Center for Open Data in the Humanities
著者所属(英)
en
National Institute of Informatics
著者名 Clanuwat, Tarin

× Clanuwat, Tarin

Clanuwat, Tarin

Search repository
Zhao Tianyu

× Zhao Tianyu

Zhao Tianyu

Search repository
Imajuku, Yuki

× Imajuku, Yuki

Imajuku, Yuki

Search repository
Kitamoto, Asanobu

× Kitamoto, Asanobu

Kitamoto, Asanobu

Search repository
著者名(英) Tarin, Clanuwat

× Tarin, Clanuwat

en Tarin, Clanuwat

Search repository
Tianyu Zhao

× Tianyu Zhao

en Tianyu Zhao

Search repository
Yuki Imajuku

× Yuki Imajuku

en Yuki Imajuku

Search repository
Asanobu Kitamoto

× Asanobu Kitamoto

en Asanobu Kitamoto

Search repository
論文抄録
内容記述タイプ Other
内容記述 This paper presents a methodology for enhancing Optical Character Recognition (OCR) accuracy for historical Japanese documents, using Large Language Models (LLMs). We experimented with six open-source LLMs, ranging in size from 7 to 14 billion parameters, developing two models—a next-token prediction model and an OCR text refiner—both fine-tuned on classical Japanese text from the Minna de Honkoku project. Our approach significantly reduces the Character Error Rate (CER) by correcting misidentified characters and reordering incorrect sequences, particularly improving the recognition of Katakana and Kanji characters often misinterpreted by RURI Kuzushiji OCR model. The findings demonstrate the potential of advanced LLMs to improve the digitization and preservation of Japanese historical documents.
論文抄録(英)
内容記述タイプ Other
内容記述 This paper presents a methodology for enhancing Optical Character Recognition (OCR) accuracy for historical Japanese documents, using Large Language Models (LLMs). We experimented with six open-source LLMs, ranging in size from 7 to 14 billion parameters, developing two models—a next-token prediction model and an OCR text refiner—both fine-tuned on classical Japanese text from the Minna de Honkoku project. Our approach significantly reduces the Character Error Rate (CER) by correcting misidentified characters and reordering incorrect sequences, particularly improving the recognition of Katakana and Kanji characters often misinterpreted by RURI Kuzushiji OCR model. The findings demonstrate the potential of advanced LLMs to improve the digitization and preservation of Japanese historical documents.
書誌情報 じんもんこん2024論文集

巻 2024, p. 75-82, 発行日 2024-11-30
出版者
言語 ja
出版者 情報処理学会
戻る
0
views
See details
Views

Versions

Ver.1 2025-01-19 07:38:20.330017
Show All versions

Share

Mendeley Twitter Facebook Print Addthis

Cite as

エクスポート

OAI-PMH
  • OAI-PMH JPCOAR
  • OAI-PMH DublinCore
  • OAI-PMH DDI
Other Formats
  • JSON
  • BIBTEX

Confirm


Powered by WEKO3


Powered by WEKO3