skip to main content
review-article

Blog track research at TREC

Published:18 August 2010Publication History
Skip Abstract Section

Abstract

The TREC Blog track aims to explore information seeking behaviour in the blogosphere, by building reusable test collections for blog-related search tasks. Since, its advent in TREC 2006, the Blog track has led to much research in this growing field, and encapsulated cross-pollination from natural language processing research. This paper recaps on the tasks addressed at the TREC Blog track thus far, covering the period 2006 - 2009. In particular, we describe the used corpora, the tasks addressed within the track, and the resulting published research.

References

  1. G. Amati, E. Ambrosi, M. Bianchi, C. Gaibisso, and G. Gambosi. Automatic construction of an opinion-term vocabulary for ad hoc retrieval. In Proceedings of the 30th European Conference on IR Research on Advances in Information Retrieval (ECIR 2008), pages 89--100, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. J. Arguello, J. Elsas, J. Callan, and J. Carbonell. Document representation and query expansion models for blog recommendation. In Proceedings of the International Conference on Weblogs and Social Media (ICWSM 2007). AAAI, 2008.Google ScholarGoogle Scholar
  3. P. Bailey, N. Craswell, A. P. de Vries, and I. Soboroff. Overview of the TREC-2007 Enterprise track. In Proceedings of the 16th Text REtrieval Conference (TREC 2007), 2007.Google ScholarGoogle Scholar
  4. K. Balog, L. Azzopardi, and M. de Rijke. Formal models for expert finding in enterprise corpora. In Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2006), pages 43--50. ACM, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. K. Balog, M. de Rijke, and W. Weerkamp. Bloggers as experts: Feed distillation using expert retrieval models. In Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), pages 753--754. ACM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. F. Cacheda, V. Plachouras, and I. Ounis. A case study of distributed information retrieval architectures to index one terabyte of text. Information Processing and Management, 41(5):1141--1161, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. Callan. Distributed information retrieval. In W. B. Croft, editor, Advances in Information Retrieval, chapter 5, pages 127--150. Kluwer Academic Publishers, 2000.Google ScholarGoogle Scholar
  8. C. L. A. Clarke, N. Craswell, and I. Soboroff. Overview of the TREC 2009 Web track. In Proceedings of the 18th Text REtrieval Conference (TREC 2009), 2010.Google ScholarGoogle Scholar
  9. C. L. A. Clarke, M. Kolla, G. V. Cormack, O. Vechtomova, A. Ashkan, S. Büttcher, and I. MacKinnon. Novelty and diversity in information retrieval evaluation. In Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2008), pages 659--666. ACM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. J. L. Elsas, J. Arguello, J. Callan, and J. G. Carbonell. Retrieval and feedback models for blog feed search. In Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), pages 347--354. ACM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. S. Gerani, M. Carman, and F. Crestani. Proximity based opinion retrieval. In Proceedings of the 33rd annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2010), 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. D. Gruhl, R. Guha, D. Liben-Nowell, and A. Tomkins. Information diffusion through blogspace. In Proceedings of the 13th international conference on World Wide Web (WWW 2004), pages 491--501. ACM, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. B. He, C. Macdonald, J. He, and I. Ounis. An effective statistical approach to blog post opinion retrieval. In Proceeding of the 17th ACM Conference on Information and Knowledge Management (CIKM 2008), pages 1063--1072. ACM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. B. He, C. Macdonald, and I. Ounis. Ranking opinionated blog posts using OpinionFinder. In Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), pages 727--728. ACM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. M. A. Hearst, M. Hurst, and S. T. Dumais. What should blog search look like? In Proceedings of the 1st International Workshop on Search in Social Media (SSM 2008), pages 95--98. ACM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. X. Huang and W. B. Croft. A unified relevance model for opinion retrieval. In Proceedings of the 18th ACM conference on Information and knowledge management (CIKM 2009), pages 947--956. ACM, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. A. Java, P. Kolari, T. Finin, A. Joshi, and T. Oates. Feeds That Matter: A Study of Bloglines Subscriptions. In Proceedings of the International Conference on Weblogs and Social Media (ICWSM 2007). Computer Science and Electrical Engineering, University of Maryland, Baltimore County, March 2007.Google ScholarGoogle Scholar
  18. M. Keikha, M. J. Carman, and F. Crestani. Blog distillation using random walks. In Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2009), pages 638--639. ACM, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. M. Keikha and F. Crestani. Effectiveness of aggregation methods in blog distillation. In Proceedings of the 8th International Conference on Flexible Query Answering Systems (FQAS 2009), pages 157--167. Springer-Verlag, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. A. C. König, M. Gamon, and Q. Wu. Click-through prediction for news queries. In Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2009), pages 347--354. ACM, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. R. Kumar, J. Novak, P. Raghavan, and A. Tomkins. On the bursty evolution of blogspace. In Proceedings of the 12th international conference on World Wide Web (WWW 2003), pages 568--576. ACM, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. J. Lafferty and C. Zhai. Document language models, query models, and risk minimization for information retrieval. In Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2001), pages 111--119. ACM, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. V. Lavrenko and W. B. Croft. Relevance based language models. In Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2001), pages 120--127. ACM, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Y. Lee, H.-Y. Jung, W. Song, and J.-H. Lee. Mining the blogosphere for top news stories identification. In Proceedings of the 33rd annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2010), 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Y. Lee, S.-H. Na, and J.-H. Lee. An improved feedback approach using relevant local posts for blog feed retrieval. In Proceeding of the 18th ACM conference on Information and knowledge management (CIKM 2009), pages 1971--1974. ACM, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. C. Macdonald and I. Ounis. The TREC Blogs06 collection: creating and analysing a blog test collection. Technical Report TR-2006-224, Department of Computing Science, University of Glasgow, 2006.Google ScholarGoogle Scholar
  27. C. Macdonald and I. Ounis. Voting for candidates: adapting data fusion techniques for an expert search task. In Proceedings of the 15th ACM International Conference on Information and Knowledge Management (CIKM 2006), pages 387--396. ACM, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. C. Macdonald and I. Ounis. Key blog distillation: ranking aggregates. In Proceedings of the 17th ACM conference on Information and knowledge management (CIKM 2008), pages 1043--1052. ACM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. C. Macdonald and I. Ounis. Searching for expertise: Experiments with the voting model. Computer Journal: Special Focus on Profiling Expertise and Behaviour, 52(7):729--748, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. C. Macdonald, I. Ounis, and I. Soboroff. Overview of the TREC 2007 Blog track. In Proceedings of the 16th Text REtrieval Conference (TREC 2007), 2007.Google ScholarGoogle Scholar
  31. C. Macdonald, I. Ounis, and I. Soboroff. Is spam an issue for opinionated blog post search? In Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2009), pages 710--711. ACM, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. C. Macdonald, I. Ounis, and I. Soboroff. Overview of the TREC 2009 Blog track. In Proceedings of the 18th Text REtrieval Conference (TREC 2009), 2009.Google ScholarGoogle Scholar
  33. R. M. C. McCreadie, C. Macdonald, and I. Ounis. News article ranking: Leveraging the wisdom of bloggers. In Proceedings of the 9th International Conference on Computer-Assisted Information Retrieval (RIAO 2010), 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. J. McLean. State of the Blogosphere, introduction, 2009. http://technorati.com/blogging/article/state-of-the-blogosphere-2009-introduction.Google ScholarGoogle Scholar
  35. D. Metzler and W. B. Croft. A Markov random field model for term dependencies. In Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2005), pages 472--479. ACM, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. G. Mishne and M. de Rijke. A study of blog search. In Proceedings of the 28th European Conference on Information Retrieval (ECIR 2006), pages 289--301. Springer, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. S.-H. Na, Y. Lee, S.-H. Nam, and J.-H. Lee. Improving opinion retrieval based on query-specific sentiment lexicon. In Proceedings of the 31st European Conference on IR Research on Advances in Information Retrieval (ECIR 2009), pages 734--738. Springer-Verlag, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. I. Ounis, C. Macdonald, M. de Rijke, G. Mishne, and I. Soboroff. Overview of the TREC 2006 Blog track. In Proceedings of the 15th Text REtrieval Conference, 2006.Google ScholarGoogle Scholar
  39. I. Ounis, C. Macdonald, and I. Soboroff. On TREC Blog track. In Proceedings of the International Conference on Weblogs and Social Media (ICWSM 2008). AAAI, 2008.Google ScholarGoogle Scholar
  40. I. Ounis, C. Macdonald, and I. Soboroff. Overview of the TREC 2008 Blog track. In Proceedings of the 17th Text REtrieval Conference (TREC 2008), 2008.Google ScholarGoogle Scholar
  41. B. Pang and L. Lee. Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval, 2(1-2):1--135, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. R. L. T. Santos, B. He, C. Macdonald, and I. Ounis. Integrating proximity to subjective sentences for blog opinion retrieval. In Proceedings of the 31st European Conference on IR Research on Advances in Information Retrieval (ECIR 2009), pages 325--336. Springer, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. H. Sayyadi, M. Hurst, and A. Maykov. Event detection and tracking in social streams. In Proceedings of the International Conference on Weblogs and Social Media (ICWSM 2009). AAAI, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  44. K. Seki and K. Uehara. Adaptive subjective triggers for opinionated document retrieval. In Proceedings of the Second ACM International Conference on Web Search and Data Mining (WSDM 2009), pages 25--33. ACM, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. J. Seo and W. B. Croft. Blog site search using resource selection. In Proceeding of the 17th ACM Conference on Information and Knowledge Management (CIKM 2008), pages 1053--1062. ACM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. M. Thelwall. Bloggers during the London attacks: Top information sources and topics. In Proceedings of the 3rd International Workshop on the Weblogging Ecosystem (WWE 2006), 2006.Google ScholarGoogle Scholar
  47. O. Vechtomova. Facet-based opinion retrieval from blogs. Information Processing and Management, 46(1):71--88, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. E. M. Voorhees. TREC: Continuing information retrieval's tradition of experimentation. Commun. ACM, 50(11):51--54, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. E. M. Voorhees and L. P. Buckland, editors. Proceedings of the 15th Text REtrieval Conference (TREC 2006), 2007.Google ScholarGoogle ScholarCross RefCross Ref
  50. E. M. Voorhees and L. P. Buckland, editors. Proceedings of the 16th Text REtrieval Conference (TREC 2007), 2008.Google ScholarGoogle ScholarCross RefCross Ref
  51. E. M. Voorhees and L. P. Buckland, editors. Proceedings of the 17th Text REtrieval Conference (TREC 2008), 2009.Google ScholarGoogle Scholar
  52. E. M. Voorhees and L. P. Buckland, editors. Proceedings of the 18th Text REtrieval Conference (TREC 2009), 2010.Google ScholarGoogle Scholar
  53. E. M. Voorhees and D. K. Harman. TREC: Experiment and Evaluation in Information Retrieval. MIT Press, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. W. Weerkamp, K. Balog, and M. de Rijke. Finding key bloggers, one post at a time. In Proceedings of the 18th Conference on Artificial Intelligence (ECAI 2008), pages 318--322. IOS Press, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. T. Wilson, P. Hoffmann, S. Somasundaran, J. Kessler, J. Wiebe, Y. Choi, C. Cardie, E. Riloff, and S. Patwardhan. OpinionFinder: A system for subjectivity analysis. In Proceedings of HLT/EMNLP on Interactive Demonstrations, pages 34--35. Association for Computational Linguistics, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. P. Winn. State of the Blogosphere, introduction, 2008. http://technorati.com/blogging/article/state-of-the-blogosphere-introduction.Google ScholarGoogle Scholar
  57. Y. Yang, T. Pierce, and J. Carbonell. A study of retrospective and on-line event detection. In Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR 1998), pages 28--36. ACM, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  58. M. Zhang and X. Ye. A generation model to unify topic relevance and lexicon-based sentiment for opinion retrieval. In Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2008), pages 411--418. ACM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  59. W. Zhang, L. Jia, C. Yu, and W. Meng. Improve the effectiveness of the opinion retrieval and opinion polarity classification. In Proceedings of the 17th ACM conference on Information and knowledge management (CIKM 2008), pages 1415--1416. ACM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  60. W. Zhang, C. Yu, and W. Meng. Opinion retrieval from blogs. In Proceedings of the sixteenth ACM conference on Conference on information and knowledge management (CIKM 2007), pages 831--840. ACM, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  61. X. Zhang, Z. Zhou, and M. Wu. Positive, negative, or mixed? Mining blogs for opinions. In Proceedings of the 14th Australasian Document Computing Symposium (ADCS 2009), 2009.Google ScholarGoogle Scholar

Index Terms

  1. Blog track research at TREC
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM SIGIR Forum
      ACM SIGIR Forum  Volume 44, Issue 1
      June 2010
      88 pages
      ISSN:0163-5840
      DOI:10.1145/1842890
      Issue’s Table of Contents

      Copyright © 2010 Authors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 18 August 2010

      Check for updates

      Qualifiers

      • review-article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader