人工知能学会全国大会論文集
Online ISSN : 2758-7347
37th (2023)
セッションID: 1U5-IS-2b-01
会議情報

Comparing Feature Extraction Methods for Sarcasm Detection in Twitter
*Jenq-Haur WANGRahmat Fadli ISNANTO
著者情報
会議録・要旨集 フリー

詳細
抄録

Sarcasm detection is a challenging task, which identifies expressions that have the opposite meaning of what is written. Most previous works only measure sentiment polarity in sentences. However, more features are needed for improving the result. In this paper, we intend to compare different feature extraction methods including n-gram, sentiment, punctuation, and part of speech features for sarcasm detection. Firstly, sarcastic data are collected using Twitter API, and preprocessed by removing all the hashtags, mentions and URLs. Then, after all features were extracted, they are combined by One Hot Encoding. Finally, we use two classification methods: Support Vector Machine and Logistic Regression for comparison. In our experimental results, n-gram feature gives the best performance compared to the other individual features. Support Vector Machine gives a better performance than logistic regression with an F1-measure of 79.64%. This shows the potential of combining different features for sarcasm detection.

著者関連情報
© 2023 The Japanese Society for Artificial Intelligence
前の記事 次の記事
feedback
Top