英文契約書における内容の抽出　－シソーラス作成のための統計情報を用いた類似度計算－

相良, かおる; 渡邊, 勝正; Kaoru, Sagara; Katsumasa, Watanabe

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

英文契約書における内容の抽出　－シソーラス作成のための統計情報を用いた類似度計算－

https://ipsj.ixsq.nii.ac.jp/records/48817

名前 / ファイル	ライセンス	アクション
IPSJ-NL98128022.pdf (896.2 kB)	Copyright (c) 1998 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

1998-11-05

タイトル

英文契約書における内容の抽出　－シソーラス作成のための統計情報を用いた類似度計算－

タイトル

言語

タイトル

Information Extraction From an English Contract A statistics - based computation of word similarity for making a thesaurus for English contracts

言語

jpn

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

奈良先端科学技術大学院大学情報科学研究科

著者所属

奈良先端科学技術大学院大学情報科学研究科

著者所属(英)

Graduate School of Information Science, Nara Institute of Science and Technology

著者所属(英)

Graduate School of Information Science, Nara Institute of Science and Technology

著者名

相良, かおる渡邊, 勝正

著者名(英)

Kaoru, Sagara Katsumasa, Watanabe

論文抄録

内容記述タイプ

Other

内容記述

本稿では、英文契約書に使われる単語の類似度を求める手法について提案する。本手法は、単語の共起頻度に基づく統計的手法の一種であり、内積を用いて類似度を定義している。本手法の特徴は、英文契約書の書式集の条文から、名詞ど動詞、動詞と名詞、形容詞と名詞、前置詞と名詞というように統語構造を意識した2つ組を求め、その2つ組間の関連度をベクトルの要素としている点にある。本手法により、英文契約書の書式集（162 298語）に含まれる名詞898種からなる209 274組のペアについて類似度を求め、数量化IV類によるクラス分けにより、81個のクラスの類概念データを作成した。なお、本研究は、英文契約書の内容抽出のための準備研究である。

論文抄録(英)

内容記述タイプ

Other

内容記述

This paper proposes an approach for similarity measurement of words that are used in a collection of English contracts. This approach is a statistics-based computation of word similarity by a vector consisting of co-occurrence statistics. Using a vector consisting of the correlation between two-tuple of terms on the basis of syntactic behavior (such as the ordered pair verb, noun) is a feature of this approach. We made similarity data of 209,274 pairs from 898 nouns in the collection of English contracts, and made 81 classes from these similarity data with a multi-dimensional scaling. This work becomes a preparatory work for the information extraction from an English contract.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10115061

書誌情報

情報処理学会研究報告自然言語処理（NL）

巻 1998, 号 99(1998-NL-128), p. 159-166, 発行日 1998-11-05

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-22 08:19:52.427724

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

英文契約書における内容の抽出　－シソーラス作成のための統計情報を用いた類似度計算－

× 相良, かおる渡邊, 勝正

× Kaoru, Sagara Katsumasa, Watanabe

Versions

Share

Cite as

エクスポート

インデックスリンク

インデックスツリー

アイテム

英文契約書における内容の抽出 －シソーラス作成のための統計情報を用いた類似度計算－

× 相良, かおる 渡邊, 勝正

× Kaoru, Sagara Katsumasa, Watanabe

Versions

Share

Cite as

エクスポート

英文契約書における内容の抽出　－シソーラス作成のための統計情報を用いた類似度計算－

× 相良, かおる渡邊, 勝正