Abstract
The TREC Blog track aims to explore information seeking behaviour in the blogosphere, by building reusable test collections for blog-related search tasks. Since, its advent in TREC 2006, the Blog track has led to much research in this growing field, and encapsulated cross-pollination from natural language processing research. This paper recaps on the tasks addressed at the TREC Blog track thus far, covering the period 2006 - 2009. In particular, we describe the used corpora, the tasks addressed within the track, and the resulting published research.
- G. Amati, E. Ambrosi, M. Bianchi, C. Gaibisso, and G. Gambosi. Automatic construction of an opinion-term vocabulary for ad hoc retrieval. In Proceedings of the 30th European Conference on IR Research on Advances in Information Retrieval (ECIR 2008), pages 89--100, 2008. Google ScholarDigital Library
- J. Arguello, J. Elsas, J. Callan, and J. Carbonell. Document representation and query expansion models for blog recommendation. In Proceedings of the International Conference on Weblogs and Social Media (ICWSM 2007). AAAI, 2008.Google Scholar
- P. Bailey, N. Craswell, A. P. de Vries, and I. Soboroff. Overview of the TREC-2007 Enterprise track. In Proceedings of the 16th Text REtrieval Conference (TREC 2007), 2007.Google Scholar
- K. Balog, L. Azzopardi, and M. de Rijke. Formal models for expert finding in enterprise corpora. In Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2006), pages 43--50. ACM, 2006. Google ScholarDigital Library
- K. Balog, M. de Rijke, and W. Weerkamp. Bloggers as experts: Feed distillation using expert retrieval models. In Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), pages 753--754. ACM, 2008. Google ScholarDigital Library
- F. Cacheda, V. Plachouras, and I. Ounis. A case study of distributed information retrieval architectures to index one terabyte of text. Information Processing and Management, 41(5):1141--1161, 2005. Google ScholarDigital Library
- J. Callan. Distributed information retrieval. In W. B. Croft, editor, Advances in Information Retrieval, chapter 5, pages 127--150. Kluwer Academic Publishers, 2000.Google Scholar
- C. L. A. Clarke, N. Craswell, and I. Soboroff. Overview of the TREC 2009 Web track. In Proceedings of the 18th Text REtrieval Conference (TREC 2009), 2010.Google Scholar
- C. L. A. Clarke, M. Kolla, G. V. Cormack, O. Vechtomova, A. Ashkan, S. Büttcher, and I. MacKinnon. Novelty and diversity in information retrieval evaluation. In Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2008), pages 659--666. ACM, 2008. Google ScholarDigital Library
- J. L. Elsas, J. Arguello, J. Callan, and J. G. Carbonell. Retrieval and feedback models for blog feed search. In Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), pages 347--354. ACM, 2008. Google ScholarDigital Library
- S. Gerani, M. Carman, and F. Crestani. Proximity based opinion retrieval. In Proceedings of the 33rd annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2010), 2010. Google ScholarDigital Library
- D. Gruhl, R. Guha, D. Liben-Nowell, and A. Tomkins. Information diffusion through blogspace. In Proceedings of the 13th international conference on World Wide Web (WWW 2004), pages 491--501. ACM, 2004. Google ScholarDigital Library
- B. He, C. Macdonald, J. He, and I. Ounis. An effective statistical approach to blog post opinion retrieval. In Proceeding of the 17th ACM Conference on Information and Knowledge Management (CIKM 2008), pages 1063--1072. ACM, 2008. Google ScholarDigital Library
- B. He, C. Macdonald, and I. Ounis. Ranking opinionated blog posts using OpinionFinder. In Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), pages 727--728. ACM, 2008. Google ScholarDigital Library
- M. A. Hearst, M. Hurst, and S. T. Dumais. What should blog search look like? In Proceedings of the 1st International Workshop on Search in Social Media (SSM 2008), pages 95--98. ACM, 2008. Google ScholarDigital Library
- X. Huang and W. B. Croft. A unified relevance model for opinion retrieval. In Proceedings of the 18th ACM conference on Information and knowledge management (CIKM 2009), pages 947--956. ACM, 2009. Google ScholarDigital Library
- A. Java, P. Kolari, T. Finin, A. Joshi, and T. Oates. Feeds That Matter: A Study of Bloglines Subscriptions. In Proceedings of the International Conference on Weblogs and Social Media (ICWSM 2007). Computer Science and Electrical Engineering, University of Maryland, Baltimore County, March 2007.Google Scholar
- M. Keikha, M. J. Carman, and F. Crestani. Blog distillation using random walks. In Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2009), pages 638--639. ACM, 2009. Google ScholarDigital Library
- M. Keikha and F. Crestani. Effectiveness of aggregation methods in blog distillation. In Proceedings of the 8th International Conference on Flexible Query Answering Systems (FQAS 2009), pages 157--167. Springer-Verlag, 2009. Google ScholarDigital Library
- A. C. König, M. Gamon, and Q. Wu. Click-through prediction for news queries. In Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2009), pages 347--354. ACM, 2009. Google ScholarDigital Library
- R. Kumar, J. Novak, P. Raghavan, and A. Tomkins. On the bursty evolution of blogspace. In Proceedings of the 12th international conference on World Wide Web (WWW 2003), pages 568--576. ACM, 2003. Google ScholarDigital Library
- J. Lafferty and C. Zhai. Document language models, query models, and risk minimization for information retrieval. In Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2001), pages 111--119. ACM, 2001. Google ScholarDigital Library
- V. Lavrenko and W. B. Croft. Relevance based language models. In Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2001), pages 120--127. ACM, 2001. Google ScholarDigital Library
- Y. Lee, H.-Y. Jung, W. Song, and J.-H. Lee. Mining the blogosphere for top news stories identification. In Proceedings of the 33rd annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2010), 2010. Google ScholarDigital Library
- Y. Lee, S.-H. Na, and J.-H. Lee. An improved feedback approach using relevant local posts for blog feed retrieval. In Proceeding of the 18th ACM conference on Information and knowledge management (CIKM 2009), pages 1971--1974. ACM, 2009. Google ScholarDigital Library
- C. Macdonald and I. Ounis. The TREC Blogs06 collection: creating and analysing a blog test collection. Technical Report TR-2006-224, Department of Computing Science, University of Glasgow, 2006.Google Scholar
- C. Macdonald and I. Ounis. Voting for candidates: adapting data fusion techniques for an expert search task. In Proceedings of the 15th ACM International Conference on Information and Knowledge Management (CIKM 2006), pages 387--396. ACM, 2006. Google ScholarDigital Library
- C. Macdonald and I. Ounis. Key blog distillation: ranking aggregates. In Proceedings of the 17th ACM conference on Information and knowledge management (CIKM 2008), pages 1043--1052. ACM, 2008. Google ScholarDigital Library
- C. Macdonald and I. Ounis. Searching for expertise: Experiments with the voting model. Computer Journal: Special Focus on Profiling Expertise and Behaviour, 52(7):729--748, 2009. Google ScholarDigital Library
- C. Macdonald, I. Ounis, and I. Soboroff. Overview of the TREC 2007 Blog track. In Proceedings of the 16th Text REtrieval Conference (TREC 2007), 2007.Google Scholar
- C. Macdonald, I. Ounis, and I. Soboroff. Is spam an issue for opinionated blog post search? In Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2009), pages 710--711. ACM, 2009. Google ScholarDigital Library
- C. Macdonald, I. Ounis, and I. Soboroff. Overview of the TREC 2009 Blog track. In Proceedings of the 18th Text REtrieval Conference (TREC 2009), 2009.Google Scholar
- R. M. C. McCreadie, C. Macdonald, and I. Ounis. News article ranking: Leveraging the wisdom of bloggers. In Proceedings of the 9th International Conference on Computer-Assisted Information Retrieval (RIAO 2010), 2010. Google ScholarDigital Library
- J. McLean. State of the Blogosphere, introduction, 2009. http://technorati.com/blogging/article/state-of-the-blogosphere-2009-introduction.Google Scholar
- D. Metzler and W. B. Croft. A Markov random field model for term dependencies. In Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2005), pages 472--479. ACM, 2005. Google ScholarDigital Library
- G. Mishne and M. de Rijke. A study of blog search. In Proceedings of the 28th European Conference on Information Retrieval (ECIR 2006), pages 289--301. Springer, 2006. Google ScholarDigital Library
- S.-H. Na, Y. Lee, S.-H. Nam, and J.-H. Lee. Improving opinion retrieval based on query-specific sentiment lexicon. In Proceedings of the 31st European Conference on IR Research on Advances in Information Retrieval (ECIR 2009), pages 734--738. Springer-Verlag, 2009. Google ScholarDigital Library
- I. Ounis, C. Macdonald, M. de Rijke, G. Mishne, and I. Soboroff. Overview of the TREC 2006 Blog track. In Proceedings of the 15th Text REtrieval Conference, 2006.Google Scholar
- I. Ounis, C. Macdonald, and I. Soboroff. On TREC Blog track. In Proceedings of the International Conference on Weblogs and Social Media (ICWSM 2008). AAAI, 2008.Google Scholar
- I. Ounis, C. Macdonald, and I. Soboroff. Overview of the TREC 2008 Blog track. In Proceedings of the 17th Text REtrieval Conference (TREC 2008), 2008.Google Scholar
- B. Pang and L. Lee. Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval, 2(1-2):1--135, 2008. Google ScholarDigital Library
- R. L. T. Santos, B. He, C. Macdonald, and I. Ounis. Integrating proximity to subjective sentences for blog opinion retrieval. In Proceedings of the 31st European Conference on IR Research on Advances in Information Retrieval (ECIR 2009), pages 325--336. Springer, 2009. Google ScholarDigital Library
- H. Sayyadi, M. Hurst, and A. Maykov. Event detection and tracking in social streams. In Proceedings of the International Conference on Weblogs and Social Media (ICWSM 2009). AAAI, 2009.Google ScholarCross Ref
- K. Seki and K. Uehara. Adaptive subjective triggers for opinionated document retrieval. In Proceedings of the Second ACM International Conference on Web Search and Data Mining (WSDM 2009), pages 25--33. ACM, 2009. Google ScholarDigital Library
- J. Seo and W. B. Croft. Blog site search using resource selection. In Proceeding of the 17th ACM Conference on Information and Knowledge Management (CIKM 2008), pages 1053--1062. ACM, 2008. Google ScholarDigital Library
- M. Thelwall. Bloggers during the London attacks: Top information sources and topics. In Proceedings of the 3rd International Workshop on the Weblogging Ecosystem (WWE 2006), 2006.Google Scholar
- O. Vechtomova. Facet-based opinion retrieval from blogs. Information Processing and Management, 46(1):71--88, 2010. Google ScholarDigital Library
- E. M. Voorhees. TREC: Continuing information retrieval's tradition of experimentation. Commun. ACM, 50(11):51--54, 2007. Google ScholarDigital Library
- E. M. Voorhees and L. P. Buckland, editors. Proceedings of the 15th Text REtrieval Conference (TREC 2006), 2007.Google ScholarCross Ref
- E. M. Voorhees and L. P. Buckland, editors. Proceedings of the 16th Text REtrieval Conference (TREC 2007), 2008.Google ScholarCross Ref
- E. M. Voorhees and L. P. Buckland, editors. Proceedings of the 17th Text REtrieval Conference (TREC 2008), 2009.Google Scholar
- E. M. Voorhees and L. P. Buckland, editors. Proceedings of the 18th Text REtrieval Conference (TREC 2009), 2010.Google Scholar
- E. M. Voorhees and D. K. Harman. TREC: Experiment and Evaluation in Information Retrieval. MIT Press, 2005. Google ScholarDigital Library
- W. Weerkamp, K. Balog, and M. de Rijke. Finding key bloggers, one post at a time. In Proceedings of the 18th Conference on Artificial Intelligence (ECAI 2008), pages 318--322. IOS Press, 2008. Google ScholarDigital Library
- T. Wilson, P. Hoffmann, S. Somasundaran, J. Kessler, J. Wiebe, Y. Choi, C. Cardie, E. Riloff, and S. Patwardhan. OpinionFinder: A system for subjectivity analysis. In Proceedings of HLT/EMNLP on Interactive Demonstrations, pages 34--35. Association for Computational Linguistics, 2005. Google ScholarDigital Library
- P. Winn. State of the Blogosphere, introduction, 2008. http://technorati.com/blogging/article/state-of-the-blogosphere-introduction.Google Scholar
- Y. Yang, T. Pierce, and J. Carbonell. A study of retrospective and on-line event detection. In Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR 1998), pages 28--36. ACM, 1998. Google ScholarDigital Library
- M. Zhang and X. Ye. A generation model to unify topic relevance and lexicon-based sentiment for opinion retrieval. In Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2008), pages 411--418. ACM, 2008. Google ScholarDigital Library
- W. Zhang, L. Jia, C. Yu, and W. Meng. Improve the effectiveness of the opinion retrieval and opinion polarity classification. In Proceedings of the 17th ACM conference on Information and knowledge management (CIKM 2008), pages 1415--1416. ACM, 2008. Google ScholarDigital Library
- W. Zhang, C. Yu, and W. Meng. Opinion retrieval from blogs. In Proceedings of the sixteenth ACM conference on Conference on information and knowledge management (CIKM 2007), pages 831--840. ACM, 2007. Google ScholarDigital Library
- X. Zhang, Z. Zhou, and M. Wu. Positive, negative, or mixed? Mining blogs for opinions. In Proceedings of the 14th Australasian Document Computing Symposium (ADCS 2009), 2009.Google Scholar
Index Terms
- Blog track research at TREC
Recommendations
To Blog or Not to Blog: Characterizing and Predicting Retention in Community Blogs
SocialCom '14: Proceedings of the 2014 International Conference on Social ComputingCommunity blogging is a medium for publishing daily journals, expressing opinions or ideas, and sharing knowledge. Blogging has a high impact on marketing, shaping public opinions, and informing the world about major events from a grassroots point of ...
Blog Trust Model for Blog Readers
ITC '10: Proceedings of the 2010 International Conference on Recent Trends in Information, Telecommunication and ComputingSocial network plays a major role in today’s internet technology. FOAF, community, Blogs etc are the few top level social network today. Weblogs (blogs) are today’s prominent social media on the internet which allows blogger and the user to interact ...
Comments