Research Article Open Access

An Improved Approach for Topic Ontology Based Categorization of Blogs Using Support Vector Machine

V. Subramaniyaswamy and S. Chenthur Pandian

Abstract

Problem statement: Information search, collection and categorization from the blogosphere are still one of the important issues to be resolved. Mainly, the blogs assist the variety of interesting and useful information. Because of its increasing growth, blogs can not be categorized effectively. Therefore it is difficult to find relevant topics from the blogs. Hence blogs need to be categorized topically to make easy for readers. Approach: Blog contents are associated with a set of predefined topic ontology keywords. This study proposes categorization of blogs to facilitate easy identification of user expected topic from the massive collection of blogs. Tags, page contents were collected as inputs from the blogs and the blogs were categorized using Support Vector Machine (SVM) algorithm. Most frequent occurrences of topic ontological keywords are used to train the classifier. This approach has effectively improved blog categorization process using SVM. Results: The performance was evaluated for precision and recall for blog categorization based on topic ontology using SVM with Naive Bayes algorithm. It was proved that topic ontology assisted SVM improves the classification accuracy than Naïve Bayes algorithm. Conclusion: This study has effectively improved the classification of blogs based on topic ontology assisted SVM. Experiments showed the effectiveness of the blog categorization.

Journal of Computer Science
Volume 8 No. 2, 2012, 251-258

DOI: https://doi.org/10.3844/jcssp.2012.251.258

Submitted On: 22 August 2011 Published On: 29 November 2011

How to Cite: Subramaniyaswamy, V. & Pandian, S. C. (2012). An Improved Approach for Topic Ontology Based Categorization of Blogs Using Support Vector Machine. Journal of Computer Science, 8(2), 251-258. https://doi.org/10.3844/jcssp.2012.251.258

  • 3,449 Views
  • 3,800 Downloads
  • 8 Citations

Download

Keywords

  • Blogosphere
  • blog content and tagging
  • topic ontology
  • machine learning
  • blog categorization
  • Support Vector Machine (SVM)
  • naive bayes algorithm