WEKO3
アイテム
Automatic Generation of Photorealistic 3D Inner Mouth Animation only from Frontal Images
https://ipsj.ixsq.nii.ac.jp/records/145081
https://ipsj.ixsq.nii.ac.jp/records/1450814159b6b0-b0dd-4b1c-9ea9-34c8a6c7492c
名前 / ファイル | ライセンス | アクション |
---|---|---|
![]() |
Copyright (c) 2015 by the Information Processing Society of Japan
|
|
オープンアクセス |
Item type | Journal(1) | |||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
公開日 | 2015-09-15 | |||||||||||||
タイトル | ||||||||||||||
タイトル | Automatic Generation of Photorealistic 3D Inner Mouth Animation only from Frontal Images | |||||||||||||
タイトル | ||||||||||||||
言語 | en | |||||||||||||
タイトル | Automatic Generation of Photorealistic 3D Inner Mouth Animation only from Frontal Images | |||||||||||||
言語 | ||||||||||||||
言語 | eng | |||||||||||||
キーワード | ||||||||||||||
主題Scheme | Other | |||||||||||||
主題 | [一般論文] Multi-view Detai-lization, inner mouth, skull bone, phoneme combination, speech animation | |||||||||||||
資源タイプ | ||||||||||||||
資源タイプ識別子 | http://purl.org/coar/resource_type/c_6501 | |||||||||||||
資源タイプ | journal article | |||||||||||||
著者所属 | ||||||||||||||
Waseda University | ||||||||||||||
著者所属 | ||||||||||||||
Waseda University | ||||||||||||||
著者所属 | ||||||||||||||
Waseda University | ||||||||||||||
著者所属 | ||||||||||||||
Waseda Research Institute for Science and Engineering | ||||||||||||||
著者所属(英) | ||||||||||||||
en | ||||||||||||||
Waseda University | ||||||||||||||
著者所属(英) | ||||||||||||||
en | ||||||||||||||
Waseda University | ||||||||||||||
著者所属(英) | ||||||||||||||
en | ||||||||||||||
Waseda University | ||||||||||||||
著者所属(英) | ||||||||||||||
en | ||||||||||||||
Waseda Research Institute for Science and Engineering | ||||||||||||||
著者名 |
Masahide, Kawai
× Masahide, Kawai
× Tomoyori, Iwao
× Akinobu, Maejima
× Shigeo, Morishima
|
|||||||||||||
著者名(英) |
Masahide, Kawai
× Masahide, Kawai
× Tomoyori, Iwao
× Akinobu, Maejima
× Shigeo, Morishima
|
|||||||||||||
論文抄録 | ||||||||||||||
内容記述タイプ | Other | |||||||||||||
内容記述 | In this paper, we propose a novel method to generate highly photorealistic three-dimensional (3D) inner mouth animation that is well-fitted to an original ready-made speech animation using only frontal captured images and small-size databases. The algorithms are composed of quasi-3D model reconstruction and motion control of teeth and the tongue, and final compositing of photorealistic speech animation synthesis tailored to the original. In general, producing a satisfactory photorealistic appearance of the inner mouth that is synchronized with mouth movement is a very complicated and time-consuming task. This is because the tongue and mouth are too flexible and delicate to be modeled with the large number of meshes required. Therefore, in some cases, this process is omitted or replaced with a very simple generic model. Our proposed method, on the other hand, can automatically generate 3D inner mouth appearances by improving photorealism with only three inputs: an original tailor-made lip-sync animation, a single image of the speaker's teeth, and a syllabic decomposition of the desired speech. The key idea of our proposed method is to combine 3D reconstruction and simulation with two-dimensional (2D) image processing using only the above three inputs, as well as a tongue database and mouth database. The satisfactory performance of our proposed method is illustrated by the significant improvement in picture quality of several tailor-made animations to a degree nearly equivalent to that of camera-captured videos. \n------------------------------ This is a preprint of an article intended for publication Journal of Information Processing(JIP). This preprint should not be cited. This article should be cited as: Journal of Information Processing Vol.23(2015) No.5 (online) DOI http://dx.doi.org/10.2197/ipsjjip.23.693 ------------------------------ |
|||||||||||||
論文抄録(英) | ||||||||||||||
内容記述タイプ | Other | |||||||||||||
内容記述 | In this paper, we propose a novel method to generate highly photorealistic three-dimensional (3D) inner mouth animation that is well-fitted to an original ready-made speech animation using only frontal captured images and small-size databases. The algorithms are composed of quasi-3D model reconstruction and motion control of teeth and the tongue, and final compositing of photorealistic speech animation synthesis tailored to the original. In general, producing a satisfactory photorealistic appearance of the inner mouth that is synchronized with mouth movement is a very complicated and time-consuming task. This is because the tongue and mouth are too flexible and delicate to be modeled with the large number of meshes required. Therefore, in some cases, this process is omitted or replaced with a very simple generic model. Our proposed method, on the other hand, can automatically generate 3D inner mouth appearances by improving photorealism with only three inputs: an original tailor-made lip-sync animation, a single image of the speaker's teeth, and a syllabic decomposition of the desired speech. The key idea of our proposed method is to combine 3D reconstruction and simulation with two-dimensional (2D) image processing using only the above three inputs, as well as a tongue database and mouth database. The satisfactory performance of our proposed method is illustrated by the significant improvement in picture quality of several tailor-made animations to a degree nearly equivalent to that of camera-captured videos. \n------------------------------ This is a preprint of an article intended for publication Journal of Information Processing(JIP). This preprint should not be cited. This article should be cited as: Journal of Information Processing Vol.23(2015) No.5 (online) DOI http://dx.doi.org/10.2197/ipsjjip.23.693 ------------------------------ |
|||||||||||||
書誌レコードID | ||||||||||||||
収録物識別子タイプ | NCID | |||||||||||||
収録物識別子 | AN00116647 | |||||||||||||
書誌情報 |
情報処理学会論文誌 巻 56, 号 9, 発行日 2015-09-15 |
|||||||||||||
ISSN | ||||||||||||||
収録物識別子タイプ | ISSN | |||||||||||||
収録物識別子 | 1882-7764 |