情報学広場：情報処理学会電子図書館

WEKO3

To

lat lon distance

[[sub_check.contents]]

[[sub_check.contents]]

[[sub_radio.contents]]

To

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

StyleMapを用いた事前学習済みStyleGANによる画像編集

https://doi.org/10.20729/00231734

名前 / ファイル	ライセンス	アクション
IPSJ-JNL6501012.pdf (106.6 MB)	Copyright (c) 2024 by the Information Processing Society of Japan
オープンアクセス

Item type

Journal(1)

公開日

2024-01-15

タイトル

タイトル

StyleMapを用いた事前学習済みStyleGANによる画像編集

タイトル

言語

en

タイトル

Image Editing with Pre-trained StyleGAN Using StyleMap

言語

言語

jpn

キーワード

主題Scheme

Other

主題

[特集:エージェント理論・技術とその応用] GAN Inversion，StyleGAN，StyleMap，局所編集

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_6501

資源タイプ

journal article

ID登録

ID登録

10.20729/00231734

ID登録タイプ

JaLC

著者所属

電気通信大学大学院情報理工学研究科情報学専攻

著者所属

電気通信大学大学院情報理工学研究科情報学専攻

著者所属

電気通信大学大学院情報理工学研究科情報学専攻

著者所属

電気通信大学大学院情報理工学研究科情報学専攻

著者所属

電気通信大学大学院情報理工学研究科情報学専攻

著者所属(英)

en

University of Electro-Communications Graduate School of Informatics and Engineering Department of Informatics

著者所属(英)

en

University of Electro-Communications Graduate School of Informatics and Engineering Department of Informatics

著者所属(英)

en

University of Electro-Communications Graduate School of Informatics and Engineering Department of Informatics

著者所属(英)

en

University of Electro-Communications Graduate School of Informatics and Engineering Department of Informatics

著者所属(英)

en

University of Electro-Communications Graduate School of Informatics and Engineering Department of Informatics

著者名

本田, 爽
折原, 良平
清, 雄一
田原, 康之
大須賀, 昭彦

著者名(英)

So, Honda
Ryohei, Orihara
Yuichi, Sei
Yasuyuki, Tahara
Akihiko, Ohsuga

論文抄録

内容記述タイプ

Other

内容記述

近年，所望の画像を再現するようにGANの潜在変数を推定するGAN Inversionという分野が注目されている．入力画像を再現する潜在変数が得られると，この潜在変数を編集することにより画像を編集することができる．しかし，入力画像と再構成画像の差分である再構成品質と，編集画像のもっともらしさである編集品質の間にはトレードオフがあることが知られている．本研究では画像全体の性質を表す潜在変数を空間方向に拡張することで再構成品質の向上を図った．次に，このような拡張が編集品質を大幅に損なうことから，追加の正則化を課することで再構成品質と編集品質を兼ね備えたGAN Inversionを行った．その結果，提案手法は定量的・定性的な観点からベースラインに対して再構成品質と編集品質のトレードオフにおいてより良い結果を得た．

論文抄録(英)

内容記述タイプ

Other

内容記述

Recently, the field of GAN Inversion, which estimates the latent code of a GAN to reproduce the desired image, has attracted much attention. Once a latent variable that reproduces the input image is obtained, the image can be edited by manipulating the latent code. However, it is known that there is a trade-off between reconstruction quality, which is the difference between the input image and the reproduced image, and editability, which is the plausibility of the edited image. In our study, we attempted to improve reconstruction quality by extending latent code that represents the properties of the entire image in the spatial direction. Next, since such an expansion significantly impairs the editing quality, we performed a GAN Inversion that realizes both reconstruction quality and editability by imposing an additional regularization. As a result, the proposed method yielded a better trade-off between the reconstruction quality and the editability against the baseline from both quantitative and qualitative perspectives, and is comparable to state-of-the-art(SOTA) methods that adjust the weights of the generators.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN00116647

書誌情報

情報処理学会論文誌

巻 65, 号 1, p. 97-111, 発行日 2024-01-15

ISSN

収録物識別子タイプ

ISSN

収録物識別子

1882-7764

公開者

言語

ja

出版者

情報処理学会

戻る

0

views

	Views

Versions

Ver.1

2025-01-19 10:36:18.656048

Show All versions

Share

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX