Published April 6, 2022 | Version 3.1
Journal article Open

Automatic Topic Title Predicting from News Articles Using Semantic-Based NMF Model

  • 1. Department of Computer Science & I.T University of Balochistan, Quetta, Pakistan.

Description

Social medical being a predominant form of communication, millions of texts in terms of news articles, tweets, and snippets are generated worldwide every hour. From them discovering concise and useful knowledge has caught the interest from both academia and the business industry. Since the text document has an infinite amount of contextual data and it is sparse and ambiguous, therefore, learning topics automatically from them is a significant issue. To address this problem, this research paper proposes a semantic-based non-negative matrix factorization (NMF) model for extracting concise and meaningful topic titles for the text to grasp the whole text theme. The model is efficiently integrated with the semantic correlations between words and their context, which are learned through skip-gram. The NMF method is used to tackle this issue by using a block coordinate algorithm. In terms of topic coherence, extensive quantitative evaluations of the proposed models on a variety of real-world text datasets show that they outperform various state-of-the-art methods. The interpretability of these models demonstrated by qualitative semantic analysis, which identifies significant and consistent topics. It is an effective standard topic model for unstructured sparse text due to its superior performance and simple construction.

Files

LCJSTEM-123.pdf

Files (1.0 MB)

Name Size Download all
md5:ec66232754f3570eb591492239c33d2a
1.0 MB Preview Download