ログイン 新規登録
言語:

WEKO3

  • トップ
  • ランキング
To
lat lon distance
To

Field does not validate



インデックスリンク

インデックスツリー

メールアドレスを入力してください。

WEKO

One fine body…

WEKO

One fine body…

アイテム

  1. 研究報告
  2. コンピュータビジョンとイメージメディア(CVIM)
  3. 2018
  4. 2018-CVIM-212

Linking videos and languages: Representations and Their Applications

https://ipsj.ixsq.nii.ac.jp/records/187500
https://ipsj.ixsq.nii.ac.jp/records/187500
6aa91e1f-dd25-4a4b-8006-f11b53b6ae4f
名前 / ファイル ライセンス アクション
IPSJ-CVIM18212038.pdf IPSJ-CVIM18212038.pdf (5.7 MB)
Copyright (c) 2018 by the Information Processing Society of Japan
オープンアクセス
Item type SIG Technical Reports(1)
公開日 2018-05-03
タイトル
タイトル Linking videos and languages: Representations and Their Applications
タイトル
言語 en
タイトル Linking videos and languages: Representations and Their Applications
言語
言語 eng
キーワード
主題Scheme Other
主題 D論セッション2
資源タイプ
資源タイプ識別子 http://purl.org/coar/resource_type/c_18gh
資源タイプ technical report
著者所属
CyberAgent, Inc.
著者所属
Osaka University
著者所属
Tampere University of Technology
著者所属
University of Oulu
著者所属
Nara Institute of Science and Technology
著者所属(英)
en
CyberAgent, Inc.
著者所属(英)
en
Osaka University
著者所属(英)
en
Tampere University of Technology
著者所属(英)
en
University of Oulu
著者所属(英)
en
Nara Institute of Science and Technology
著者名 Mayu, Otani

× Mayu, Otani

Mayu, Otani

Search repository
Yuta, Nakashima

× Yuta, Nakashima

Yuta, Nakashima

Search repository
Esa, Rahtu

× Esa, Rahtu

Esa, Rahtu

Search repository
Janne, Heikkilä

× Janne, Heikkilä

Janne, Heikkilä

Search repository
Naokazu, Yokoya

× Naokazu, Yokoya

Naokazu, Yokoya

Search repository
著者名(英) Mayu, Otani

× Mayu, Otani

en Mayu, Otani

Search repository
Yuta, Nakashima

× Yuta, Nakashima

en Yuta, Nakashima

Search repository
Esa, Rahtu

× Esa, Rahtu

en Esa, Rahtu

Search repository
Janne, Heikkilä

× Janne, Heikkilä

en Janne, Heikkilä

Search repository
Naokazu, Yokoya

× Naokazu, Yokoya

en Naokazu, Yokoya

Search repository
論文抄録
内容記述タイプ Other
内容記述 Mimicking the human ability to understand visual data (images or videos) is a long-standing goal of computer vision. To achieve visual content understanding in a computer, many recent works attempt to connect visual and natural language data including object labels and descriptions. This attempt is important not only for visual understanding but also for broad applications such as content-based visual data retrieval and automatic description generation to help visually impaired people. The goal of this paper is to develop cross-modal representations, which enable us to associate videos with natural language. We explorer two directions for constructing cross-modal representations: hand-crafted representations and data-driven representation learning. The experiments demonstrate the proposed representations can be applied to a wide range of practical applications including query-focused video summarization and content-based video retrieval with natural language queries.
論文抄録(英)
内容記述タイプ Other
内容記述 Mimicking the human ability to understand visual data (images or videos) is a long-standing goal of computer vision. To achieve visual content understanding in a computer, many recent works attempt to connect visual and natural language data including object labels and descriptions. This attempt is important not only for visual understanding but also for broad applications such as content-based visual data retrieval and automatic description generation to help visually impaired people. The goal of this paper is to develop cross-modal representations, which enable us to associate videos with natural language. We explorer two directions for constructing cross-modal representations: hand-crafted representations and data-driven representation learning. The experiments demonstrate the proposed representations can be applied to a wide range of practical applications including query-focused video summarization and content-based video retrieval with natural language queries.
書誌レコードID
収録物識別子タイプ NCID
収録物識別子 AA11131797
書誌情報 研究報告コンピュータビジョンとイメージメディア(CVIM)

巻 2018-CVIM-212, 号 38, p. 1-16, 発行日 2018-05-03
ISSN
収録物識別子タイプ ISSN
収録物識別子 2188-8701
Notice
SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.
出版者
言語 ja
出版者 情報処理学会
戻る
0
views
See details
Views

Versions

Ver.1 2025-01-20 02:15:29.214183
Show All versions

Share

Mendeley Twitter Facebook Print Addthis

Cite as

エクスポート

OAI-PMH
  • OAI-PMH JPCOAR
  • OAI-PMH DublinCore
  • OAI-PMH DDI
Other Formats
  • JSON
  • BIBTEX

Confirm


Powered by WEKO3


Powered by WEKO3