談話標識の抽出に基づいた講演音声の自動インデキシング

長谷川, 将宏; 秋田, 祐哉; 河原, 達也; Masahiro, Hasegawa; Yuya, Akita; Tatsuya, Kawahara

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

談話標識の抽出に基づいた講演音声の自動インデキシング

https://ipsj.ixsq.nii.ac.jp/records/57427

名前 / ファイル	ライセンス	アクション
IPSJ-SLP01036006.pdf (1.4 MB)	Copyright (c) 2001 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2001-06-01

タイトル

談話標識の抽出に基づいた講演音声の自動インデキシング

タイトル

言語

タイトル

Automatic Indexing of Lecture Speech by Extracting Discourse Markers

言語

jpn

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

京都大学情報学研究科知能情報学専攻

著者所属

京都大学情報学研究科知能情報学専攻

著者所属

京都大学情報学研究科知能情報学専攻

著者所属(英)

Graduate School of Informatics, Kyoto University

著者所属(英)

Graduate School of Informatics, Kyoto University

著者所属(英)

Graduate School of Informatics, Kyoto University

著者名

長谷川, 将宏
秋田, 祐哉
河原, 達也

著者名(英)

Masahiro, Hasegawa
Yuya, Akita
Tatsuya, Kawahara

論文抄録

内容記述タイプ

Other

内容記述

パラグラフの先頭部分に頻出する特徴的な単語(談話標識)を用いて講演音声に対して自動インデキシングを行う手法を提案する。本研究では、種々の講演のなかでも流れが比較的明確で共通性のある学会講演を対象とする。学習セットの講演の書き起こしからポーズ情報を用いてパラグラフ境界を検出し、統計的言語モデルを用いて句点を挿入して各パラグラフの先頭の一文を抽出する。その中に含まれる名詞からtf・idfに基づいて談話標識を選定する。評価データの各文について談話標識のtf・idf値を計算し、その合計が閾値以上であればインデックスを付与する。実際の講演音声の書き起こしと認識結果に対して評価を行った結果、再現率は90%程度(適合率は20%程度)となり、高精度にインデキシングできた。

論文抄録(英)

内容記述タイプ

Other

内容記述

We address a method of automatic indexing for lecture speech by suggestive words that frequently appear in the initial sentences in each paragraph, and we define such words as discourse markers. We deal with academic presentations because these presentations can be segmented into relatively distinct parts. At first, we segment transcriptions into paragraphs and sentences by using aver age length of pauses during the lecture as a threshold. Next, each paragraph is segmented into sentences by using a statistical languag e model. Then, discourse markers are selected from nouns based on tf and idf statistics. We evaluated these discourse markers with recall and precision rate on indexing task of the lecture speech. Sentences are indexed if sum of the tf-idf value of detected discourse markers exceeds a threshold. As a result, we achieved a recall rate of 90%.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10442647

書誌情報

情報処理学会研究報告音声言語情報処理（SLP）

巻 2001, 号 55(2001-SLP-036), p. 35-42, 発行日 2001-06-01

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-22 04:30:25.036670

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

談話標識の抽出に基づいた講演音声の自動インデキシング

× 長谷川, 将宏

× 秋田, 祐哉

× 河原, 達也

× Masahiro, Hasegawa

× Yuya, Akita

× Tatsuya, Kawahara

Versions

Share

Cite as

エクスポート