連続音声認識コンソーシアムの活動報告及び最終版ソフトウェアの概要

河原, 達也; 武田, 一哉; 伊藤, 克亘; 李晃伸; 鹿野, 清宏; 山田, 篤; Tatsuya, Kawahara; Kazuya, Takeda; Katunobu, Itou; Akinobu, Lee; Kiyohiro, Shikano; Atsushi, Yamada

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

連続音声認識コンソーシアムの活動報告及び最終版ソフトウェアの概要

https://ipsj.ixsq.nii.ac.jp/records/57207

名前 / ファイル	ライセンス	アクション
IPSJ-SLP03049057 (544.8 kB)	Copyright (c) 2003 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2003-12-19

タイトル

連続音声認識コンソーシアムの活動報告及び最終版ソフトウェアの概要

タイトル

言語

タイトル

Overview of Activities and Software of Continuous Speech Recognition Consortium

言語

jpn

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

京都大学学術情報メディアセンター

著者所属

名古屋大学情報科学研究科

著者所属

名古屋大学情報科学研究科

著者所属

奈良先端科学技術大学院大学情報科学研究科

著者所属

奈良先端科学技術大学院大学情報科学研究科

著者所属

（財）京都高度技術研究所

著者所属(英)

Kyoto University, School of Informatics

著者所属(英)

Nagoya University, School of Information Science

著者所属(英)

Nagoya University, School of Information Science

著者所属(英)

Nara Institute of Science and Technology, School of Information Science

著者所属(英)

Nara Institute of Science and Technology, School of Information Science

著者所属(英)

ASTEM, Kyoto

著者名

河原, 達也武田, 一哉伊藤, 克亘李晃伸鹿野, 清宏山田, 篤

著者名(英)

Tatsuya, Kawahara Kazuya, Takeda Katunobu, Itou Akinobu, Lee Kiyohiro, Shikano Atsushi, Yamada

論文抄録

内容記述タイプ

Other

内容記述

連続音声認識コンソーシアム(CSRC)は、IPAプロジェクトで開発された「日本語ディクテーション基本ソフトウェア」の維持・発展をめざして、情報処理学会音声言語情報処理研究会のもとで2000年度から2002年度まで(2003年9月まで)活動を行ってきた。本稿では、この活動の報告を行うとともに、このたび編集した最終版ソフトウェアの概要を述べる。本プロジェクトでは、大語彙連続音声認識エンジンJuliusの機能拡張とWindowsSAPI対応を行うとともに、非常に大規模なデータベースを用いた高精度な音響モデル・言語モデルの構築を行った。また音響モデルについては、多様な話者層（高年齢・小児）や入力環境（電話・社内環境など）に対応したモデルを整備した。

論文抄録(英)

内容記述タイプ

Other

内容記述

Continuous Speech Recognition consortium (CSRC was founded under IPSJ SIG-SLP for further enhancement of Japanese Dictation Toolkit that had been developed by the IPA project, An overview of its activities and final version of the developed software is given in this report. The LVCSR (large vocabulary continuous speech recognition) engine Julius has been improved both in functionality and stability, and ported to Windows in compliance with SAPI (Speech API). A set of acoustic and language models are trained using very large-scale databases. We also set up a variety of acoustic models to cover wider user generations and speech-input environments.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10442647

書誌情報

情報処理学会研究報告音声言語情報処理（SLP）

巻 2003, 号 124(2003-SLP-049), p. 325-330, 発行日 2003-12-19

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-21 15:29:46.292109

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

連続音声認識コンソーシアムの活動報告及び最終版ソフトウェアの概要

× 河原, 達也武田, 一哉伊藤, 克亘李晃伸鹿野, 清宏山田, 篤

× Tatsuya, Kawahara Kazuya, Takeda Katunobu, Itou Akinobu, Lee Kiyohiro, Shikano Atsushi, Yamada

Versions

Share

Cite as

エクスポート

インデックスリンク

インデックスツリー

アイテム

連続音声認識コンソーシアムの活動報告及び最終版ソフトウェアの概要

× 河原, 達也 武田, 一哉 伊藤, 克亘 李晃伸 鹿野, 清宏 山田, 篤

× Tatsuya, Kawahara Kazuya, Takeda Katunobu, Itou Akinobu, Lee Kiyohiro, Shikano Atsushi, Yamada

Versions

Share

Cite as

エクスポート

× 河原, 達也武田, 一哉伊藤, 克亘李晃伸鹿野, 清宏山田, 篤