WEKO3
アイテム
Corpus Augmentation Based on Pseudo-Chinese Generation for Chinese-Japanese Neural Machine Translation
https://ipsj.ixsq.nii.ac.jp/records/230196
https://ipsj.ixsq.nii.ac.jp/records/23019613528093-0832-483b-9b81-576c500b2f42
名前 / ファイル | ライセンス | アクション |
---|---|---|
![]() |
Copyright (c) 2023 by the Information Processing Society of Japan
|
Item type | National Convention(1) | |||||||||
---|---|---|---|---|---|---|---|---|---|---|
公開日 | 2023-02-16 | |||||||||
タイトル | ||||||||||
タイトル | Corpus Augmentation Based on Pseudo-Chinese Generation for Chinese-Japanese Neural Machine Translation | |||||||||
言語 | ||||||||||
言語 | eng | |||||||||
キーワード | ||||||||||
主題Scheme | Other | |||||||||
主題 | 人工知能と認知科学 | |||||||||
資源タイプ | ||||||||||
資源タイプ識別子 | http://purl.org/coar/resource_type/c_5794 | |||||||||
資源タイプ | conference paper | |||||||||
著者所属 | ||||||||||
筑波大 | ||||||||||
著者所属 | ||||||||||
筑波大 | ||||||||||
著者名 |
魏, 徴
× 魏, 徴
× 関, 洋平
|
|||||||||
論文抄録 | ||||||||||
内容記述タイプ | Other | |||||||||
内容記述 | The performance of neural machine translation model depends on the size of training datasets. This paper presents a novel corpus augmentation for Chinese-Japanese neural machine translation. Unlike conventional corpus augmentation methods, we perform corpus augmentation by generating pseudo-Chinese sentences directly from Japanese monolingual corpus. Rules are set up to replace particles and auxiliaries written in kana characters with Chinese kanji words that have the same meanings. In addition, using word embedding, nouns, verbs, adjectives written in kana characters are converted into words written in kanji characters with the similar meaning. We evaluate the proposed method using BLEU evaluation metric and compare it with back-translation method. | |||||||||
書誌レコードID | ||||||||||
収録物識別子タイプ | NCID | |||||||||
収録物識別子 | AN00349328 | |||||||||
書誌情報 |
第85回全国大会講演論文集 巻 2023, 号 1, p. 821-822, 発行日 2023-02-16 |
|||||||||
出版者 | ||||||||||
言語 | ja | |||||||||
出版者 | 情報処理学会 |