Support Vector Machineを用いた決定性上昇型構文解析

山田, 寛康; 松本, 裕治; Yamada, Hiroyasu; Matsumoto, Yuji

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

Support Vector Machineを用いた決定性上昇型構文解析

https://ipsj.ixsq.nii.ac.jp/records/48415

名前 / ファイル	ライセンス	アクション
IPSJ-NL02149009.pdf (1.4 MB)	Copyright (c) 2002 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2002-05-23

タイトル

Support Vector Machineを用いた決定性上昇型構文解析

タイトル

言語

タイトル

Deterministic Bottom - up Parsing with Support Vector Machines

言語

jpn

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

北陸先端科学技術大学院大学情報科学研究科

著者所属

奈良先端科学技術大学院大学情報科学研究科

著者所属(英)

Japan Advanced Institute of Science and Technology

著者所属(英)

Graduate School of Information Science, Nara Institute Science and Technology

著者名

山田, 寛康

著者名(英)

Yamada, Hiroyasu

論文抄録

内容記述タイプ

Other

内容記述

本稿では機械学習アルゴリズム Support Vector Machine を用いた英語構文解析法を提案する. 高精度な構文解析を行うには句のラベルだけでなく句の主辞がもつ語彙情報をも考慮する必要がある. しかし従来の統計的構文解析モデルはデータスパースネスの問題から主辞の語彙情報を素性として大量に使用することは逆に精度低下の要因となっていた. 機械学習アルゴリズム Support Vector Machine は素性空間の次元数に依存しない高い汎化性能と Kernel 関数によって素性の組合せまでも考慮した学習が可能である. そのため主辞の語彙情報を含めた多くの素性とその組合わせを考慮した学習が行える. しかし SVM は確率を推定するのではなく 2つのクラスを識別する分類器であり従来多くの統計的構文解析モデルが採用している確率モデルへの直接的な適用が困難である.本稿では上昇型解析アルゴリズムを用い構文解析の各段階を文脈に適切な解析木構築手続きへの分類問題とみなすことでSVMを適用し解析木構築規則の学習を行う. 解析木は SVMが分類器であることから決定的に構築される. 本手法を Penn Treebank コーパスを用いて評価した結果 labeledrecall/precision で 88.2/89.0％という高い精度を得ることができた.

論文抄録(英)

内容記述タイプ

Other

内容記述

In this paper, we propose a parsing method for English sentences with machine learning algorithm called Support Vector Machines (SVMs). The performance of statistical parsing strongly depends on how to deal with lexical information and incorporate them into the statistics for parsing. Data sparseness problem arises when using large number of features like head words. As a result, we cannot estimate correct statistics for construction of parse trees. SVMs not only have high generalization performance in sparse data using a large number of features like head words, but also can take into account the combinations of features by virtue of polynomial kernel functions. However, SVMs are classifiers, not probabilistic estimator. Thus, it is difficult to apply SVMs to the probabilistic parsing model directly. Our parser constructs a parse tree for an input sentence with a deterministic bottom-up algorithm. Each parsing process is regarded as a classification task which classifies the context into a procedure for constructing parsed trees. We evaluated our parser using the Penn Treebank corpus, and the result attained over the 88.2/89.0% labeled recall/precision.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10115061

書誌情報

情報処理学会研究報告自然言語処理（NL）

巻 2002, 号 44(2002-NL-149), p. 57-64, 発行日 2002-05-23

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-22 08:30:54.961970

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

Support Vector Machineを用いた決定性上昇型構文解析

× 山田, 寛康

× Yamada, Hiroyasu

Versions

Share

Cite as

エクスポート