Archives

Text Analysis Via Composite Feature Extraction


Dr.V. Sellam, Aditya Rohan Das, Ashu Kumar and Yashaswi Rahut,
Abstract

Words are fundamental semantic units in content, expressions, and articulations contain extra data, which is significant for content grouping. To separate this data, customary calculations search for composite highlights, for example, word successions or co-events, utilizing techniques, for example, bigrams and sets of terms, however overlook the impact of punctuation and accentuation, which diminishes the quality and precision of the final product. Right now, utilization of composite component extraction is proposed and choice of the vital highlights is done as needs be to characterize content. Termsets that sidestep accentuation stamps or punctuations in the content get rejected. To kill prolixity, a new measure consisting of two variables for sentence-level talk examination is proposed. One of the variables is utilized to quantify the pertinence; the other factor is utilized to expand the estimations of composite highlights, whose class frequencies are a lot less than their sub-highlights.

Volume 12 | Issue 4

Pages: 310-320

DOI: 10.5373/JARDCS/V12I4/20201445