講義講演シーン検索における音声データの利用

岡本, 拓明; 小林, 隆志; 直井, 聡; 横田, 治夫; 古井, 貞煕; Hiroaki, OKAMOTO; Takashi, KOBAYASHI; Satochi, NAOI; Haruo, YOKOTA; Sadaoki, FURUI

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

講義講演シーン検索における音声データの利用

https://ipsj.ixsq.nii.ac.jp/records/19070

名前 / ファイル	ライセンス	アクション
IPSJ-DBS05137078.pdf (499.6 kB)	Copyright (c) 2005 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2005-07-15

タイトル

講義講演シーン検索における音声データの利用

タイトル

言語

タイトル

Application of Voice Data for Retrieving Unified Presentation Contents

言語

jpn

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

東京工業大学大学院情報理工学研究科計算工学専攻

著者所属

東京工業大学学術国際情報センター

著者所属

東京工業大学学術国際情報センター株式会社富士通研究所

著者所属

東京工業大学大学院情報理工学研究科計算工学専攻東京工業大学学術国際情報センター

著者所属

東京工業大学大学院情報理工学研究科計算工学専攻

著者所属(英)

Department of Computer Science Graduate School of Information Science and Engineering in Tokyo Institute of Technology

著者所属(英)

Global Scientic Information & Computing Center in Tokyo Institute of Technology

著者所属(英)

Global Scientic Information & Computing Center in Tokyo Institute of Technology,FUJITSU LABORATORIES LTD.

著者所属(英)

Department of Computer Science Graduate School of Information Science and Engineering in Tokyo Institute of Technology,Global Scientic Information & Computing Center in Tokyo Institute of Technology

著者所属(英)

Department of Computer Science Graduate School of Information Science and Engineering in Tokyo Institute of Technology

著者名

岡本, 拓明

著者名(英)

Hiroaki, OKAMOTO

論文抄録

内容記述タイプ

Other

内容記述

我々はこれまで，講義講演のプレゼンテーションにおける資料とその撮影動画をメタデータによる統合コンテンツとして蓄積し，その特性を利用して高度なシーン検索を提供するUPRISEを提案してきた．本稿では，このUPRISEのシーン検索機能の性能を向上させるために，講義講演の動画から抽出した音声データを利用する手法を提案する．音声認識エンジンにより抽出を行った音声データと従来の適合度手法をシーン毎に比較することにより，シーン検索に音声データを利用することの有用性を示す．

論文抄録(英)

内容記述タイプ

Other

内容記述

We have proposed unifying presentation contents, such as lecture video and presentation slides used in lectures, using metadata. For the unified contents, we have also proposed an advanced searching scene mechanism named UPRISE(Unified Presentation Slide Retrieval by Impression Search Engine). In this paper, for the performance , we pay attention to voice data abstracted from lecture videos. To confirm the availability of the voice data, we compared voice data with existing impression indicators.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10112482

書誌情報

情報処理学会研究報告データベースシステム（DBS）

巻 2005, 号 68(2005-DBS-137), p. 585-591, 発行日 2005-07-15

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-22 22:25:49.556349

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

講義講演シーン検索における音声データの利用

× 岡本, 拓明

× Hiroaki, OKAMOTO

Versions

Share

Cite as

エクスポート