skip to main content
10.1145/1835804.1835815acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Discovery of significant emerging trends

Published:25 July 2010Publication History

ABSTRACT

We describe a system that monitors social and mainstream media to determine shifts in what people are thinking about a product or company. We process over 100,000 news articles, blog posts, review sites, and tweets a day for mentions of items (e.g., products) of interest, extract phrases that are mentioned near them, and determine which of the phrases are of greatest possible interest to, for example, brand managers. Case studies show a good ability to rapidly pinpoint emerging subjects buried deep in large volumes of data and then highlight those that are rising or falling in significance as they relate to the firms interests. The tool and algorithm improves the signal-to-noise ratio and pinpoints precisely the opportunities and risks that matter most to communications professionals and their organizations.

Skip Supplemental Material Section

Supplemental Material

kdd2010_ungar_dset_01.mov

mov

186.6 MB

References

  1. J. Allen. Topic detection and tracking: event-based information organization, Kluwer. 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. D. Blei and J. Lafferty. Dynamic topic models. In Proceedings of the 23rd International Conference on Machine Learning, 2006 Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. R. Feldman, M. Fresko, J. Goldenberg, O. Netzer and L. Ungar. "Extracting Product Comparisons from Discussion Boards," Proceedings of ICDM-2007, Omaha, NB. 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. N Glance, M Hurst, T Tomokiyo BlogPulse: Automated trend discovery for weblogs. WWW 2004 Workshop on the Weblogging Ecosystem. 2004.Google ScholarGoogle Scholar
  5. S. Guha, et al., "Clustering Data Streams: Theory and Practice," IEEE Trans. Knowl. Data Eng. 15(3): 515--528. 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. Hu and B. Liu. "Mining and Summarizing Customer Reviews," Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 168--177. 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. N. Jindal and B. Liu. "Mining comparative sentences and relations" Proceedings of AAAI'06. 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Kontostathis, L. Galitsky, W. M. Pottenger, S. Roy, and D. J. Phelps. A survey of emerging trend detection in textual data mining. in Survey of Text Mining, pp 185--224. 2003Google ScholarGoogle Scholar
  9. J. Leskovec and J. Kleinberg. "Meme-tracking and the Dynamics of the News Cycle," KDD-09. 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Loulwah AlSumait, Daniel Barbará, Carlotta Domeniconi, "On-line LDA: Adaptive Topic Models for Mining Text Streams with Applications to Topic Detection and Tracking," Eighth IEEE International Conference on Data Mining (ICDM), pp. 3--12. 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. McCallum. "Information Extraction: Distilling Structured Data from Unstructured Text," ACM Queue, 3(9). 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A. McCallum and W. Li. "Early Results for Named Entity Recognition with Conditional Random Fields, Feature Induction and Web-Enhanced Lexicons," Proceedings of CoNLL-2003, Edmonton, Canada, pp. 188--191. 2003 Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. D. Miller, R. Schwartz, R. Weischedel and R. Stone. "Named Entity Extraction from Broadcast News," Proceedings of DARPA Broadcast News Workshop, Herndon, VA. 1999.Google ScholarGoogle Scholar
  14. B. Pang, L. Lee and S. Vaithyanathan. "Thumbs up? Sentiment Classification Using Machine Learning Techniques," Proceedings of EMNLP-02, 7th Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Morristown, US, pp. 79--86. 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. A.-M. Popescu and O. Etzioni. "Extracting Product Features and Opinions from Reviews," Proceedings of HLT-EMNLP, pp. 339--346. 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. M.F. Porter. "An algorithm for suffix stripping," Program, 14(3) pp 130--137. 1980.Google ScholarGoogle ScholarCross RefCross Ref
  17. Wang, X. and McCallum, A. Topics over time: a non-Markov continuous-time model of topical trends, Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining (KDD) pp. 424--433. 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Discovery of significant emerging trends

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader