行動計量学
Online ISSN : 1880-4705
Print ISSN : 0385-5481
ISSN-L : 0385-5481
原著
テキストマイニングによる筆者識別の正確性ならびに判定手続きの標準化
財津 亘金 明哲
著者情報
ジャーナル フリー

2018 年 45 巻 1 号 p. 39-47

詳細
抄録

This study examined the accuracy for author identification by text mining. We conducted 16 analyses (four writing styles × four multivariate analyses) across texts of 100 Bloggers, written by approximately 1,000 characters. Specifically, we conducted (1) principal components analysis, (2) correspondence analysis, (3) multi-dimensional scaling, and (4) hierarchical cluster analysis on each writing style: (1) rate of usage of non-independent words, (2) bigram of parts-of-speech, (3) bigram of postpositional particles, and (4) positioning of commas. We obtained high accuracy: 100% on sensitivity and 95.1% on specificity. Furthermore, the results showed no effects of age and gender against accuracy for author identification.

著者関連情報
© 2018 日本行動計量学会
前の記事 次の記事
feedback
Top