ABSTRACT
We describe a system that monitors social and mainstream media to determine shifts in what people are thinking about a product or company. We process over 100,000 news articles, blog posts, review sites, and tweets a day for mentions of items (e.g., products) of interest, extract phrases that are mentioned near them, and determine which of the phrases are of greatest possible interest to, for example, brand managers. Case studies show a good ability to rapidly pinpoint emerging subjects buried deep in large volumes of data and then highlight those that are rising or falling in significance as they relate to the firms interests. The tool and algorithm improves the signal-to-noise ratio and pinpoints precisely the opportunities and risks that matter most to communications professionals and their organizations.
Supplemental Material
- J. Allen. Topic detection and tracking: event-based information organization, Kluwer. 2002. Google ScholarDigital Library
- D. Blei and J. Lafferty. Dynamic topic models. In Proceedings of the 23rd International Conference on Machine Learning, 2006 Google ScholarDigital Library
- R. Feldman, M. Fresko, J. Goldenberg, O. Netzer and L. Ungar. "Extracting Product Comparisons from Discussion Boards," Proceedings of ICDM-2007, Omaha, NB. 2007. Google ScholarDigital Library
- N Glance, M Hurst, T Tomokiyo BlogPulse: Automated trend discovery for weblogs. WWW 2004 Workshop on the Weblogging Ecosystem. 2004.Google Scholar
- S. Guha, et al., "Clustering Data Streams: Theory and Practice," IEEE Trans. Knowl. Data Eng. 15(3): 515--528. 2003. Google ScholarDigital Library
- M. Hu and B. Liu. "Mining and Summarizing Customer Reviews," Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 168--177. 2004. Google ScholarDigital Library
- N. Jindal and B. Liu. "Mining comparative sentences and relations" Proceedings of AAAI'06. 2006. Google ScholarDigital Library
- A. Kontostathis, L. Galitsky, W. M. Pottenger, S. Roy, and D. J. Phelps. A survey of emerging trend detection in textual data mining. in Survey of Text Mining, pp 185--224. 2003Google Scholar
- J. Leskovec and J. Kleinberg. "Meme-tracking and the Dynamics of the News Cycle," KDD-09. 2009. Google ScholarDigital Library
- Loulwah AlSumait, Daniel Barbará, Carlotta Domeniconi, "On-line LDA: Adaptive Topic Models for Mining Text Streams with Applications to Topic Detection and Tracking," Eighth IEEE International Conference on Data Mining (ICDM), pp. 3--12. 2008. Google ScholarDigital Library
- A. McCallum. "Information Extraction: Distilling Structured Data from Unstructured Text," ACM Queue, 3(9). 2005. Google ScholarDigital Library
- A. McCallum and W. Li. "Early Results for Named Entity Recognition with Conditional Random Fields, Feature Induction and Web-Enhanced Lexicons," Proceedings of CoNLL-2003, Edmonton, Canada, pp. 188--191. 2003 Google ScholarDigital Library
- D. Miller, R. Schwartz, R. Weischedel and R. Stone. "Named Entity Extraction from Broadcast News," Proceedings of DARPA Broadcast News Workshop, Herndon, VA. 1999.Google Scholar
- B. Pang, L. Lee and S. Vaithyanathan. "Thumbs up? Sentiment Classification Using Machine Learning Techniques," Proceedings of EMNLP-02, 7th Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Morristown, US, pp. 79--86. 2002. Google ScholarDigital Library
- A.-M. Popescu and O. Etzioni. "Extracting Product Features and Opinions from Reviews," Proceedings of HLT-EMNLP, pp. 339--346. 2005. Google ScholarDigital Library
- M.F. Porter. "An algorithm for suffix stripping," Program, 14(3) pp 130--137. 1980.Google ScholarCross Ref
- Wang, X. and McCallum, A. Topics over time: a non-Markov continuous-time model of topical trends, Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining (KDD) pp. 424--433. 2006. Google ScholarDigital Library
Index Terms
- Discovery of significant emerging trends
Recommendations
Identify Emergent Trends Based on the Blogosphere
WI-IAT '13: Proceedings of the 2013 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 03Information about upcoming trends is a valuable knowledge for both, companies and individuals. Detecting trends for a certain topic is of special interest. According to the latest information over 200 million blogs exist in the World Wide Web. Hence, ...
Monitoring Trends on Facebook
DASC '11: Proceedings of the 2011 IEEE Ninth International Conference on Dependable, Autonomic and Secure ComputingThe public nature of user generated content on social media platforms offers the possibility for trend monitoring as an insight into the topics that attract the attention of a large fraction of users. While Google Trends and Twitter have already been ...
Buzz in Social Media: Detection of Short-lived Viral Phenomena
WWW '18: Companion Proceedings of the The Web Conference 2018Social media interaction happens in a broad variety of context and magnitude. The vast majority of posts cause little to no discussion, while some start trends and become viral. We study the virality, explicitly of "Buzzes" - posts that evoke intense ...
Comments