LLMにおける個人特性に基づくステレオタイプの定量的分析手法の提案

青島,達大; 秋山,満昭; Tatsuhiro Aoshima; Mitsuaki Akiyama

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

LLMにおける個人特性に基づくステレオタイプの定量的分析手法の提案

https://ipsj.ixsq.nii.ac.jp/records/2008780

名前 / ファイル	ライセンス	アクション
IPSJ-CSS2025009.pdf (725.5 KB) 2027年10月20日からダウンロード可能です。	Copyright (c) 2025 by the Information Processing Society of Japan
非会員：¥660, IPSJ:学会員：¥330, CSEC:会員：¥0, SPT:会員：¥0, DLIB:会員：¥0

Item type

Symposium(1)

公開日

2025-10-20

タイトル

言語

タイトル

LLMにおける個人特性に基づくステレオタイプの定量的分析手法の提案

タイトル

言語

タイトル

Towards Quantifying Individual-Attribute-Based Stereotypes in LLMs

言語

jpn

キーワード

主題Scheme

Other

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_5794

資源タイプ

conference paper

著者所属

NTT社会情報研究所

著者所属

NTT社会情報研究所

著者所属(英)

NTT Social Informatics Laboratories

著者所属(英)

NTT Social Informatics Laboratories

著者名

青島,達大
秋山,満昭

著者名(英)

Tatsuhiro Aoshima
Mitsuaki Akiyama

論文抄録

内容記述タイプ

Other

内容記述

LLM の出力が人々の行動や社会活動へ影響を与える場面が増加している．特に，年齢，性別，人種等の個人特性による影響として，そのステレオタイプを評価することは重要である．
本論文では，個人特性が質問文に含まれる明示的な評価として，その選択肢は「はい」か「いいえ」の二択となるが，その正解に関する解釈が分かれるような状況を想定する．既存研究では，線形な統計モデルを当てはめ，その回帰係数を平均化した結果も報告されているが，例えば，年齢による非線形な変化を見逃す可能性や，人種ごとの異なる方向への偏りを過小評価する可能性がある．そこで我々は，個人特性の変化による応答傾向の差や一致度合いを測るための評価手法を提案し，9 個の LLM を 70 種類の質問で評価した結果を報告する．最後に，LLM の信頼性評価として，各ステークホルダーが実施すべきことについて議論する．

論文抄録(英)

内容記述タイプ

Other

内容記述

As large language models (LLMs) increasingly influence human behavior and social activities, it becomes crucial to assess how individual attributes, such as age, gender, and race, affect their outputs.
This paper focuses on quantifying stereotypes that arise when explicit evaluations involving individual attributes are embedded in input prompts.
We focus on yes/no questions that explicitly include individual attributes, where no universally accepted correct answer exists, and interpretations may vary from person to person.
While previous studies have employed linear statistical models and averaged regression coefficients, such approaches may overlook non-linear effects of age, and underestimate divergent biases across racial groups.
To address these limitations, we propose an evaluation method that measures differences and consistencies in response patterns as individual attributes vary.
We apply our methodology to evaluate nine LLMs across 70 distinct questions.
Finally, we discuss the implications of our findings for trustworthiness evaluations and outline key responsibilities for relevant stakeholders.

書誌情報

コンピュータセキュリティシンポジウム2025論文集

p. 60-67, 発行日 2025-10-20

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2026-03-25 04:52:56.813869

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

LLMにおける個人特性に基づくステレオタイプの定量的分析手法の提案

× 青島,達大

× 秋山,満昭

× Tatsuhiro Aoshima

× Mitsuaki Akiyama

Versions

Share

Cite as

エクスポート