視覚情報を話題の対象とする音声対話システム

山肩洋子; 河原, 達也; 奥乃, 博; Yoko, Yamakata; Tatsuya, Kawahara; Hiroshi, G.Okuno

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

視覚情報を話題の対象とする音声対話システム

https://ipsj.ixsq.nii.ac.jp/records/57381

名前 / ファイル	ライセンス	アクション
IPSJ-SLP01039014.pdf (1.1 MB)	Copyright (c) 2001 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2001-12-20

タイトル

視覚情報を話題の対象とする音声対話システム

タイトル

言語

タイトル

Spoken Dialogue System for Robot with Computer Vision

言語

jpn

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

京都大学情報学研究科知能情報学専攻

著者所属

京都大学情報学研究科知能情報学専攻

著者所属

京都大学情報学研究科知能情報学専攻

著者所属(英)

Graduate School of Informatics, Kyoto University

著者所属(英)

Graduate School of Informatics, Kyoto University

著者所属(英)

Graduate School of Informatics, Kyoto University

著者名

山肩洋子
河原, 達也
奥乃, 博

著者名(英)

Yoko, Yamakata
Tatsuya, Kawahara
Hiroshi, G.Okuno

論文抄録

内容記述タイプ

Other

内容記述

ユーザとの音声対話により実世界中でオブジェクトを探索するロボットの実現を目指す。音声認識や画像認識においては認識誤り、言語情報と視覚情報の対応づけには個人差によるあいまい性が生じる。また、ユーザの信念の誤りによって誤解が生じる可能性もある。そこで本研究では、信念ネットワーク及びユーザモデルを導入し、これらの確率的枠組みに基づいてユーザとの対話をプランニングすることで上記の問題の解決を図る。ユーザの視野外におけるオブジェクト探索タスクで実装を行った結果、ユーザの意図したオブジェクトを同定するまでに必要な対話回数を削減でき、また画像認識結果から音声認識結果を絞り込めることを示した。

論文抄録(英)

内容記述タイプ

Other

内容記述

A spoken dialogue system is developed with the aim of creating a robot which searches for an object in the real world through interacting with the user. Speech and image recognition errors may occur within the system and differences among individual users may cause errors when translating the speech into a image representation. Misunderstandings may also occur due to false user beliefs. These problems are solved using a dialogue planning mechanism based on the probablistic framework of the belief network and a user model. We design and implement a system which searches for the object that is specified by the user but is not within the user's view. We demonstrate that this system can reduce the number of interactions for identifying on the object, and improves the speech recognition result by using the results of image recognition.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10442647

書誌情報

情報処理学会研究報告音声言語情報処理（SLP）

巻 2001, 号 123(2001-SLP-039), p. 81-86, 発行日 2001-12-20

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-22 04:31:40.176561

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

視覚情報を話題の対象とする音声対話システム

× 山肩洋子

× 河原, 達也

× 奥乃, 博

× Yoko, Yamakata

× Tatsuya, Kawahara

× Hiroshi, G.Okuno

Versions

Share

Cite as

エクスポート