Aspect-based sentiment classification with multi-attention network

doi:10.1016/j.neucom.2020.01.024

Neurocomputing

Volume 388, 7 May 2020, Pages 135-143

https://doi.org/10.1016/j.neucom.2020.01.024 Get rights and content

Abstract

Aspect-based sentiment classification aims to predict the sentiment polarity of an aspect term in a sentence instead of the sentiment polarity of the entire sentence. Neural networks have been used for this task, and most existing methods have adopted sequence models, which require more training time than other models. When an aspect term comprises several words, most methods involve a coarse-level attention mechanism to model the aspect, and this may result in information loss. In this paper, we propose a multi-attention network (MAN) to address the above problems. The proposed model uses intra- and inter-level attention mechanisms. In the former, the MAN employs a transformer encoder instead of a sequence model to reduce training time. The transformer encoder encodes the input sentence in parallel and preserves long-distance sentiment relations. In the latter, the MAN uses a global and a local attention module to capture differently grained interactive information between aspect and context. The global attention module focuses on the entire relation, whereas the local attention module considers interactions at word level; this was often neglected in previous studies. Experiments demonstrate that the proposed model achieves superior results when compared to the baseline models.

Introduction

Aspect-based sentiment classification is a fine-grained task in aspect-based sentiment analysis (ABSA). Instead of predicting the sentiment polarity of an entire sentence, the sentiment polarity of a specific aspect in the sentence is determined [1]. For example, in the sentence ‘This is a high-speed computer, but it has short battery life’, the sentiment polarity of the aspects ‘speed’ and ‘battery life’ are positive and negative, respectively. Aspect-based sentiment classification overcomes the limitation of sentence-level sentiment classification that the sentiment polarity of each aspect may differ when a sentence contains more than one aspect. Aspect-based sentiment classification consists of two stages: aspect extraction [2], [3], [4], [5], [6], [7] and sentiment classification [8], [9]. The former explores the aspects that appear in reviews, and the latter classifies the opinions about these aspects. In this study, we focus only on sentiment classification.

Recently, sequence models such as long short-term memory (LSTM) [10] and gated recurrent units [11] have been successfully used in aspect-based sentiment classification [9], [12], [13]. Despite the effectiveness of these approaches, sequential models encode words individually, which is time-consuming. To overcome this, Xue and Li [14] proposed a parallelisable solution by using convolutional neural networks (CNNs). Although CNNs are effective in reducing training time, they cannot capture long-distance relations in sentences. In addition, aspect-level sentiment polarity is highly dependent on both review context and aspect. Some models utilise an attention mechanism to add aspect information [15], [16], [17]. However, most of them regard all aspect words as a whole. When an aspect contains several words, these approaches ignore the different importance between the words in the aspect phrase, resulting in information loss. For example, the aspect of the sentence ‘This place has many different styles of pizza, and they are all amazing’ contains three words. In the aspect phrase ‘styles of pizza’, ‘of’ contributes less than ‘styles’ and ‘pizza’. It is inappropriate to place the three aspect words in equal position.

In this paper, we propose a multi-attention network (MAN) to address the aforementioned issues. MAN is a parallelisable model, as no sequence model is involved. It contains an intra- and an inter-level attention mechanism. The former learns word representations through a transformer encoder [18], which is based on a self-attention mechanism that can process context and aspect in parallel. Self-attention also allows MAN to handle long-distance dependencies because it considers every two words in a sentence. The latter employs global and local attention to capture coarse- and fine-grained interactive information between aspect and context. Global attention captures the entire interaction, whereas local attention captures the word-level interaction between aspect and context words. The main contributions of this study can be summarised as follows:

•
We propose a novel model (MAN) to process words in review sentences in parallel using an attention mechanism. The proposed model requires significantly less training time than sequence models. MAN can effectively capture long dependencies in sentences by self-attention.
•
MAN introduces global and local attention modules to capture different-level interactions between aspect and context. The local attention module considers the difference between aspect words.
•
We evaluated MAN on several datasets, namely laptop, restaurant, and twitter. Experiments demonstrate that the proposed model achieves superior results when compared to the baseline models.

The rest of this paper is organised as follows. In Section 2, we review related work. In Section 3, we define the problem of aspect-based sentiment classification and present the proposed model in detail. Section 4 reports experiments and evaluations. Section 5 concludes this paper.

Section snippets

Related work

In this section, we review related work as follows: First, we discuss the particularities of aspect-based sentiment classification and existing related methods. Secondly, we present recent neural networks for aspect-based sentiment classification. Thirdly, we present some attention mechanisms for aspect-based sentiment classification.

Multi-attention network for aspect-based sentiment classification

This section presents the structure of MAN. The overall architecture is shown in Fig. 1. It consists of input embedding, multi-attention and output layers.

Datasets

We evaluate MAN on five datasets: laptop2014, restaurant2014,restaurant2015,restaurant2016, and twitter. The first two datasets are from SemEval 2014 Task 4,³ which consist of reviews of laptops and restaurants. Restaurant2015 and Restaurant2016 are reviews of restaurants from SemEval 2015 Task 12⁴ and SemEval 2016 Task 5,⁵

Conclusion and future work

We proposed a novel model based on multiple attention (MAN) for aspect-based sentiment classification. MAN requires less training time than sequence models because it can process the input sentence in parallel. Compared with convolution models, MAN can effectively capture long-range sentiment relations. Moreover, it uses global and local attention mechanisms to capture differently grained interactive relations between aspect and context. The global attention mechanism computes the entire

CRediT authorship contribution statement

Qiannan Xu: Conceptualization, Methodology, Software, Data curation, Writing - original draft, Writing - review & editing. Li Zhu: Resources, Writing - review & editing, Funding acquisition. Tao Dai: Writing - original draft, Writing - review & editing. Chengbing Yan: Project administration, Writing - original draft.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgment

This research is supported by National Key Research and Development Project (No. 2018AAA0101100) and National Key Research and Development Project (No. 2019YFB2102500).

Qiannan Xu received her B.E. degree in Computer Science and Technology from Southwestern University of Finance and Economics, China, in 2017. She is currently pursuing the M.S. degree in the School of Software Engineering at Xi’an Jiaotong University. Her main research interests include sentiment analysis and natural language processing.

References (57)

M. Dragoni et al.
An unsupervised aspect extraction strategy for monitoring real-time reviews stream
Inf. Process. Manag.
(2019)
G. Rao et al.
LSTM with sentence representations for document-level sentiment classification
Neurocomputing
(2018)
X. Fu et al.
Combine Hownet lexicon to train phrase recursive autoencoder for sentence-level sentiment analysis
Neurocomputing
(2017)
R. Ma et al.
Feature-based compositing memory networks for aspect-based sentiment classification in social internet of things
Futur. Gener. Comput. Syst.
(2019)
K. Shuang et al.
Aela-dlstms: attention-enabled and location-aware double lstms for aspect-level sentiment classification
Neurocomputing
(2019)
M. Yang et al.
Feature-enhanced attention network for target-dependent sentiment classification
Neurocomputing
(2018)
M. Hu et al.
Mining and summarizing customer reviews
Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
(2004)
L. Shu et al.
Lifelong learning crf for supervised aspect extraction
Proceedings of Meeting of the Association for Computational Linguistics
(2017)
X. Li et al.
Aspect term extraction with history attention and selective transformation
Proceedings of International Joint Conference on Artificial Intelligence
(2018)
T.A. Rana et al.
Aspect extraction in sentiment analysis: comparative analysis and survey
Artif. Intell. Rev.
(2016)

D. Ma et al.

Exploring sequence-to-sequence learning in aspect term extraction

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

(2019)

H. Xu et al.

Double embeddings and CNN-based sequence labeling for aspect extraction

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

(2018)

J. Wang et al.

Aspect sentiment classification with both word-level and clause-level attention networks

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence

(2018)

X. Li et al.

Transformation networks for target-oriented sentiment classification

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics

(2018)

S. Hochreiter et al.

Long short-term memory

Neural Comput.

(1997)

J. Chung, Ç. Gülçehre, K. Cho, Y. Bengio, Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling,...

Y. Wang et al.

Attention-based lstm for aspect-level sentiment classification

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing

(2016)

D. Tang et al.

Effective lstms for target-dependent sentiment classification

Proceedings of the 26th International Conference on Computational Linguistics: Technical Papers

(2016)

W. Xue et al.

Aspect based sentiment analysis with gated convolutional networks

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics

(2018)

D. Tang et al.

Aspect level sentiment classification with deep memory network

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing

(2016)

D. Ma et al.

Interactive attention networks for aspect-level sentiment classification

Proceedings of International Joint Conference on Artificial Intelligence

(2017)

Y. Cui et al.

Attention-over-attention neural networks for reading comprehension

Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics

(2017)

A. Vaswani et al.

Attention is all you need

Proceedings of Neural Information Processing Systems

(2017)

K. Schouten et al.

Survey on aspect-level sentiment analysis

IEEE Trans. Knowl. Data Eng.

(2016)

M. Dragoni et al.

A neural word embeddings approach for multi-domain sentiment analysis

IEEE Trans. Affect. Comput.

(2017)

A. Tripathy et al.

Document-level sentiment classification using hybrid machine learning approach

Knowl. Inf. Syst.

(2017)

D. Ma et al.

Cascading multiway attentions for document-level sentiment classification

Proceedings of the Eighth International Joint Conference on Natural Language Processing

(2017)

F. Wu et al.

Sentence-level sentiment classification with weak supervision

Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval

(2017)

Cited by (93)

Aspect-based sentiment classification with aspect-specific hypergraph attention networks
2024, Expert Systems with Applications
Aspect-based sentiment classification aims to infer the sentiment expression towards a specific aspect in a sentence. The key to this task is to utilize the relationship between sentiment words and aspect words. The mainstream methods use Recurrent Neural Networks (RNN), Attention mechanisms, or Graph Neural Networks (GNN) to explore the syntactic information. Though these methods are undoubtedly effective, they still encounter several challenges: (1) Since most of the studies used only syntactic dependency graphs, they lacked a more optimal representation of inter-word relationships. (2) Some studies have explored multiple relationship graphs, but they fail to effectively integrate syntactic dependencies with semantic or other information, thereby impeding the exchange of multiple information elements. Moreover, the inclusion of more information graphs increases the computational burden on the model. In this paper, we construct a word-level relational hypergraph containing various syntactic and semantic relationships between aspect words and other context words. We propose an aspect-specific hypergraph attention network (ASHGAT) to thoroughly investigate the hypergraph’s information. Furthermore, we design an aspect-oriented syntactic distance-based weight distribution mechanism to optimize hypergraph attention. We conducted extensive experiments on four benchmark datasets from SemEval 14, 15, and 16. The results show that ASHGAT demonstrates the other SOTA baselines.
Enhanced local knowledge with proximity values and syntax-clusters for aspect-level sentiment analysis
2024, Computer Speech and Language
Aspect-level sentiment analysis (ALSA) aims to extract the polarity of different aspect terms in a sentence. Previous works leveraging traditional dependency syntax parsing trees (DSPT) to encode contextual syntactic information had obtained state-of-the-art results. However, these works may not be able to learn fine-grained syntactic knowledge efficiently, which makes them difficult to take advantage of local context. Furthermore, these works failed to exploit the dependency relation from DSPT sufficiently. To solve these problems, we propose a novel method to enhance local knowledge by using extensions of Local Context Network based on Proximity Values (LCPV) and Syntax-clusters Attention (SCA), named LCSA. LCPV first gets the induced trees from pre-trained models and generates the syntactic proximity values between context word and aspect to adaptively determine the extent of local context. Our improved SCA further extracts fine-grained knowledge, which not only focuses on the essential clusters for the target aspect term but also guides the model to learn essential words inside each cluster in DSPT. Extensive experimental results on multiple benchmark datasets demonstrate that LCSA is highly robust and achieves state-of-the-art performance for ALSA.
Reconstructing graph networks by using new target representation for aspect-based sentiment analysis
2023, Knowledge-Based Systems
The purpose of aspect-based sentiment analysis (ABSA) is to identify the sentiment polarity of a given aspect of a sentence. Recent investigations have revealed that incorporating syntactic structures derived from dependency-parsing trees into graph convolutional networks (GCNs) can yield excellent performance. However, these GCN-based methods excessively rely on the quality of the dependency-parsing tree, resulting possibly in suboptimal dependencies between words. Moreover, these GCN-based models fail to adapt properly to informal and complex comments without syntactic dependencies. To alleviate these deficiencies, we proposed a target-based GCN with semantic and syntactic information (TSGCN). In a TSGCN, a new target generation (NTG) module with a dependency attention mechanism is designed to generate a new target representation using explicit semantic information to replace a given aspect. Then, the syntactic structure is reconstructed based on the new target representation to capture the shortest distance between the given aspect and viewpoint words. Finally, the semantic structure generated by the self-attention mechanism was injected into the syntactic structure to complement the semantic dependencies between words. The experimental findings on five benchmark datasets indicated that the TSGCN outperformed the other baseline models.
Fusing sentiment knowledge and inter-aspect dependency based on gated mechanism for aspect-level sentiment classification
2023, Neurocomputing
Aspect level sentiment classification is a fine-grained sentiment analysis task that aims to identify the sentiment polarity of one or more given aspects in a sentence. In natural language, words frequently carry certain sentimental tendencies, which can be beneficial in obtaining the features between aspects and contexts. On the other hand, the dependencies between different aspects in a sentence can provide sufficient information for the sentiment polarity discrimination of a target aspect. However, existing models tend to focus on sentiment knowledge or aspect interactions individually without leveraging their converged information. Therefore, we propose a model based on Gated Mechanism Fusing Sentiment Knowledge and Inter-Aspect dependency (GMF-SKIA) for Aspect-level Sentiment Classification in this paper, aiming to dynamically fuse sentiment knowledge information of words and inter-aspect dependency. Specifically, the model uses the SenticNet sentiment dictionary to add sentiment knowledge information to words during dependency tree construction, and then we introduce a graph convolutional network to obtain sentiment information of dependency tree. We utilize an aspect-related multiheaded self-attention mechanism to model the inter-aspect interactions. Moreover, we design an information gate based on gated mechanism to fuse sentiment knowledge and inter-aspect features. We performed experiments on four publicly available datasets, our model outperforms the best benchmark model by an average of 2.1 $%$ and achieves the highest accuracy of 91.56 $%$ on the Rest16 dataset.
An aspect sentiment classification model for graph attention networks incorporating syntactic, semantic, and knowledge
2023, Knowledge-Based Systems
Aspect sentiment classification tasks aim to identify the aspect in a sentence and express the sentiment polarities of the aspect. In existing studies, combining different deep learning models has achieved good results in aspectual sentiment classification tasks; however, problems such as complex syntactic parsing relationships, insensitivity to syntactic structure information and lack of exploitation of external sentiment knowledge. We propose a graph attention network (SSK-GAT) model incorporating syntactic, semantic and knowledge that considers reshaped syntactic dependencies, multi-head self-attention to capture semantic information of context and introduction of an emotion knowledge base to enhance aspect-related emotional information. In previous studies, aspect and context were modeled separately. Hence, to enhance the representation of sentiment between the aspect and contexts, we use the interactive modeling of aspect and context at the output layer to obtain deeper information through mutual learning. Our SSK-GAT model improved on three publicly available datasets compared to the baseline approaches.
A prompt model with combined semantic refinement for aspect sentiment analysis
2023, Information Processing and Management
Recently, pre-trained language models (PLMs), especially pre-trained bidirectional encoder representations from transformers (BERT), have improved the performance of aspect-based sentiment analysis (ABSA) tasks to some extent. However, due to the imbalance of training data in different polarities, the following shortcomings remain in PLM-based ABSA methods: (1) for small corpus scenarios with polarized emotions, an unbalanced performance problem exists; and (2) for delicate and obscure scenes dominated by neutral emotions, PLM-based performance gains are limited. To address these shortcomings, we use BERT as an instance of PLMs to propose a general-purpose prompt model with combined semantic refinement for ABSA. First, we utilize a BERT without fine-tuning to automatically induce prompts for various ABSA datasets to enhance the adaptability of the model to different application scenarios. We then leverage multi-prompt learning to propose a data augmentation method to address the imbalance of training data in different polarities. Moreover, to further deepen the model's understanding and analysis of reviews with prompts, we also propose an improved BERT semantic refinement method that combines global semantic refinement and local semantic extraction. Experiments on five public datasets show that compared with existing methods, our macro-average F1 improvement is over 10% on polarized small datasets and over 7% on an emotionally delicate and obscure dataset.

View all citing articles on Scopus

Li Zhu received his Ph.D. degree in Computer System Architecture from Xi’an Jiaotong University, China, in 2000. He is currently a Professor in the School of Software Engineering at Xi’an Jiaotong University. His research interests include machine learning and computer networking.

Tao Dai received his B.E. and M.S. degree in Software Engineering from Xi’an Jiaotong University, China, in 2008 and 2011, respectively. He is currently a Ph.D. candidate in the School of Software Engineering at Xi’an Jiaotong University. His main research interests include machine learning and information retrieval.

Chengbing Yan received her B.E. degree in Computer Science and Technology from Sun Yat-Sen University, China, in 2017. She is currently pursuing the M.S. degree in the School of Software Engineering at Xi’an Jiaotong University. Her main research interests include machine learning and image processing.

View full text

Aspect-based sentiment classification with multi-attention network

Abstract

Introduction

Section snippets

Related work

Multi-attention network for aspect-based sentiment classification

Datasets

Conclusion and future work

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgment

Inf. Process. Manag.

Neurocomputing

Neurocomputing

Futur. Gener. Comput. Syst.

Neurocomputing

Neurocomputing

Mining and summarizing customer reviews

Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Lifelong learning crf for supervised aspect extraction

Proceedings of Meeting of the Association for Computational Linguistics

Aspect term extraction with history attention and selective transformation

Proceedings of International Joint Conference on Artificial Intelligence

Aspect extraction in sentiment analysis: comparative analysis and survey

Artif. Intell. Rev.

Exploring sequence-to-sequence learning in aspect term extraction

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

Double embeddings and CNN-based sequence labeling for aspect extraction

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

Aspect sentiment classification with both word-level and clause-level attention networks

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence

Transformation networks for target-oriented sentiment classification

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics

Long short-term memory

Neural Comput.

Attention-based lstm for aspect-level sentiment classification

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing

Effective lstms for target-dependent sentiment classification

Proceedings of the 26th International Conference on Computational Linguistics: Technical Papers

Aspect based sentiment analysis with gated convolutional networks

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics

Aspect level sentiment classification with deep memory network

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing

Interactive attention networks for aspect-level sentiment classification

Proceedings of International Joint Conference on Artificial Intelligence

Attention-over-attention neural networks for reading comprehension

Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics

Attention is all you need

Proceedings of Neural Information Processing Systems

Survey on aspect-level sentiment analysis

IEEE Trans. Knowl. Data Eng.

A neural word embeddings approach for multi-domain sentiment analysis

IEEE Trans. Affect. Comput.

Document-level sentiment classification using hybrid machine learning approach

Knowl. Inf. Syst.

Cascading multiway attentions for document-level sentiment classification

Proceedings of the Eighth International Joint Conference on Natural Language Processing

Sentence-level sentiment classification with weak supervision

Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval