Integrated Use of Internal and External Evidence in the Alignment of Multi-Word Named Entities

Kutsumi, Takeshi; Yoshimi, Takehiko; Kotani, Katsunori; Sata, Ichiko; Isahara, Hitoshi; 九津見, 毅; 吉見, 毅彦; 小谷, 克則; 佐田, いち子; 井佐原, 均

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

{"_buckets": {"deposit": "788f7bb3-9955-4db9-867b-b475d00d80bd"}, "_deposit": {"created_by": 3, "id": "28855", "owners": [3], "pid": {"revision_id": 0, "type": "depid", "value": "28855"}, "status": "published"}, "_oai": {"id": "oai:waseda.repo.nii.ac.jp:00028855", "sets": ["2080"]}, "author_link": ["50002", "49995", "50000", "49998", "50001", "49999", "50003", "49994", "49996", "49997"], "item_10003_biblio_info_90": {"attribute_name": "書誌情報", "attribute_value_mlt": [{"bibliographicIssueDates": {"bibliographicIssueDate": "2005-11-16", "bibliographicIssueDateType": "Issued"}, "bibliographicPageEnd": "196", "bibliographicPageStart": "187", "bibliographic_titles": [{}]}]}, "item_10003_creator_87": {"attribute_name": "著者別名", "attribute_type": "creator", "attribute_value_mlt": [{"creatorNames": [{"creatorName": "Kutsumi, Takeshi"}], "nameIdentifiers": [{"nameIdentifier": "49999", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "Yoshimi, Takehiko"}], "nameIdentifiers": [{"nameIdentifier": "50000", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "Kotani, Katsunori"}], "nameIdentifiers": [{"nameIdentifier": "50001", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "Sata, Ichiko"}], "nameIdentifiers": [{"nameIdentifier": "50002", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "Isahara, Hitoshi"}], "nameIdentifiers": [{"nameIdentifier": "50003", "nameIdentifierScheme": "WEKO"}]}]}, "item_10003_description_123": {"attribute_name": "資源タイプ", "attribute_value_mlt": [{"subitem_description": "text", "subitem_description_type": "Other"}]}, "item_10003_description_88": {"attribute_name": "抄録", "attribute_value_mlt": [{"subitem_description": "This paper proposes a method of extracting English multi-word named entities and their Japanese equivalents from a parallel corpus. The aim of our research is to extract multi-word named entities which are not listed in a dictionary of an English-to-Japanese MT system and appear infrequently in a parallel corpus. Our method makes its alignment on the basis of two kinds of external evidence provided by the context in which a bilingual pair appears, as well as two kinds of internal evidence within the pair. Each evidence is accompanied by a score, and the aggregate score is computed as a weighted sum of the scores. The appropriate weights are estimated with the logistic regression analysis. An experiment using a parallel corpus of Yomiuri Shimbun and The Daily Yomiuri satisfactorily found that 86.36% of the extracted bilingual pairs with the highest scores were judged to be correct.", "subitem_description_type": "Abstract"}]}, "item_10003_publisher_116": {"attribute_name": "出版者", "attribute_value_mlt": [{"subitem_publisher": "Logico-Linguistic Society of Japan"}]}, "item_10003_relation_124": {"attribute_name": "シリーズ", "attribute_value_mlt": [{"subitem_relation_name": [{"subitem_relation_name_text": "Oral Session"}]}]}, "item_10003_relation_125": {"attribute_name": "関係URI", "attribute_value_mlt": [{"subitem_relation_name": [{"subitem_relation_name_text": "http://www.decode.waseda.ac.jp/PACLIC18/"}]}]}, "item_10003_subject_100": {"attribute_name": "日本十進分類法", "attribute_value_mlt": [{"subitem_subject": "801.06", "subitem_subject_scheme": "NDC"}]}, "item_10003_subject_110": {"attribute_name": "米国議会図書館件名標目", "attribute_value_mlt": [{"subitem_subject": "Computational linguistics--Congresses", "subitem_subject_scheme": "LCSH"}]}, "item_10003_text_144": {"attribute_name": "URI", "attribute_value_mlt": [{"subitem_text_value": "http://hdl.handle.net/2065/572"}]}, "item_creator": {"attribute_name": "著者", "attribute_type": "creator", "attribute_value_mlt": [{"creatorNames": [{"creatorName": "九津見, 毅"}], "nameIdentifiers": [{"nameIdentifier": "49994", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "吉見, 毅彦"}], "nameIdentifiers": [{"nameIdentifier": "49995", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "小谷, 克則"}], "nameIdentifiers": [{"nameIdentifier": "49996", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "佐田, いち子"}], "nameIdentifiers": [{"nameIdentifier": "49997", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "井佐原, 均"}], "nameIdentifiers": [{"nameIdentifier": "49998", "nameIdentifierScheme": "WEKO"}]}]}, "item_files": {"attribute_name": "ファイル情報", "attribute_type": "file", "attribute_value_mlt": [{"accessrole": "open_date", "date": [{"dateType": "Available", "dateValue": "2016-11-28"}], "displaytype": "detail", "download_preview_message": "", "file_order": 0, "filename": "oral-16.pdf", "filesize": [{"value": "413.1 kB"}], "format": "application/pdf", "future_date_message": "", "is_thumbnail": false, "licensetype": "license_free", "mimetype": "application/pdf", "size": 413100.0, "url": {"label": "oral-16.pdf", "url": "https://waseda.repo.nii.ac.jp/record/28855/files/oral-16.pdf"}, "version_id": "23103c1f-8e48-43f1-95b2-d0e3e4073069"}]}, "item_language": {"attribute_name": "言語", "attribute_value_mlt": [{"subitem_language": "eng"}]}, "item_resource_type": {"attribute_name": "資源タイプ", "attribute_value_mlt": [{"resourcetype": "conference paper", "resourceuri": "http://purl.org/coar/resource_type/c_5794"}]}, "item_title": "Integrated Use of Internal and External Evidence in the Alignment of Multi-Word Named Entities", "item_titles": {"attribute_name": "タイトル", "attribute_value_mlt": [{"subitem_title": "Integrated Use of Internal and External Evidence in the Alignment of Multi-Word Named Entities", "subitem_title_language": "en"}]}, "item_type_id": "10003", "owner": "3", "path": ["2080"], "permalink_uri": "http://hdl.handle.net/2065/572", "pubdate": {"attribute_name": "公開日", "attribute_value": "2008-04-28"}, "publish_date": "2008-04-28", "publish_status": "0", "recid": "28855", "relation": {}, "relation_version_is_last": true, "title": ["Integrated Use of Internal and External Evidence in the Alignment of Multi-Word Named Entities"], "weko_shared_id": -1}

Integrated Use of Internal and External Evidence in the Alignment of Multi-Word Named Entities

http://hdl.handle.net/2065/572

名前 / ファイル	ライセンス	アクション
oral-16.pdf (413.1 kB)

Item type

会議発表論文 / Conference Paper(1)

公開日

2008-04-28

タイトル

言語

タイトル

Integrated Use of Internal and External Evidence in the Alignment of Multi-Word Named Entities

言語

eng

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_5794

資源タイプ

conference paper

著者

著者別名

Sata, Ichiko
Isahara, Hitoshi

抄録

内容記述タイプ

Abstract

内容記述

This paper proposes a method of extracting English multi-word named entities and their Japanese equivalents from a parallel corpus. The aim of our research is to extract multi-word named entities which are not listed in a dictionary of an English-to-Japanese MT system and appear infrequently in a parallel corpus. Our method makes its alignment on the basis of two kinds of external evidence provided by the context in which a bilingual pair appears, as well as two kinds of internal evidence within the pair. Each evidence is accompanied by a score, and the aggregate score is computed as a weighted sum of the scores. The appropriate weights are estimated with the logistic regression analysis. An experiment using a parallel corpus of Yomiuri Shimbun and The Daily Yomiuri satisfactorily found that 86.36% of the extracted bilingual pairs with the highest scores were judged to be correct.

書誌情報

p. 187-196, 発行日 2005-11-16

件名

主題Scheme

NDC

主題

801.06

件名

主題Scheme

LCSH

主題

Computational linguistics--Congresses

出版者

Logico-Linguistic Society of Japan

データタイプ

内容記述タイプ

Other

内容記述

text

HDL URI

http://hdl.handle.net/2065/572

戻る

views

See details

	Views

Versions

Ver.1

2023-07-28 03:31:08.343547

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

Integrated Use of Internal and External Evidence in the Alignment of Multi-Word Named Entities

× 九津見, 毅

× 吉見, 毅彦

× 小谷, 克則

× 佐田, いち子

× 井佐原, 均

× Kutsumi, Takeshi

× Yoshimi, Takehiko

× Kotani, Katsunori

× Sata, Ichiko

× Isahara, Hitoshi

Versions

Share

Cite as

エクスポート