ABSTRACT
News coverage profoundly affects how countries and individuals behave in international relations. Yet, we have little empirical evidence of how news coverage varies across countries. To enable studies of global news coverage, we develop an efficient computational methodology that comprises three components: (i) a transformer model to estimate multilingual news similarity; (ii) a global event identification system that clusters news based on a similarity network of news articles; and (iii) measures of news synchrony across countries and news diversity within a country, based on country-specific distributions of news coverage of the global events. Each component achieves state-of-the art performance, scaling seamlessly to massive datasets of millions of news articles.
We apply the methodology to 60 million news articles published globally between January 1 and June 30, 2020, across 124 countries and 10 languages, detecting 4357 news events. We identify the factors explaining diversity and synchrony of news coverage across countries. Our study reveals that news media tend to cover a more diverse set of events in countries with larger Internet penetration, more official languages, larger religious diversity, higher economic inequality, and larger populations. Coverage of news events is more synchronized between countries that not only actively participate in commercial and political relations---such as, pairs of countries with high bilateral trade volume, and countries that belong to the NATO military alliance or BRICS group of major emerging economies---but also countries that share certain traits: an official language, high GDP, and high democracy indices.
Supplemental Material
- Victoria D Alexander, Grant Blank, and Scott A Hale. 2018. Digital traces of distinction? Popular orientation and user-engagement with status hierarchies in TripAdvisor reviews of cultural organizations. New Media & Society 20, 11 (2018), 4218--4236. https://doi.org/10.1177/1461444818769448Google ScholarCross Ref
- Matthew A Baum and Yuri M Zhukov. 2015. Filtering revolution: Reporting bias in international newspaper coverage of the Libyan civil war. Journal of Peace Research 52, 3 (2015), 384--400. https://doi.org/10.1177/0022343314554791 arXiv:https://doi.org/10.1177/0022343314554791Google ScholarCross Ref
- Frank R Baumgartner and Bryan D Jones. 2010. Agendas and instability in American politics. University of Chicago Press.Google Scholar
- Hila Becker, Mor Naaman, and Luis Gravano. 2011. Beyond Trending Topics: Real-World Event Identification on Twitter. ICWSM 5, 1 (2011), 438--441.Google ScholarCross Ref
- Kathleen Beckers, Andrea Masini, Julie Sevenans, Miriam van der Burg, Julie De Smedt, Hilde Van den Bulck, and Stefaan Walgrave. 2019. Are newspapers' news stories becoming more alike? Media content diversity in Belgium, 1983-- 2013. Journalism 20, 12 (Dec. 2019), 1665--1683.Google ScholarCross Ref
- Pablo J Boczkowski and Martin de Santos. 2007. When More Media Equals Less News: Patterns of Content Homogenization in Argentina's Leading Print and Online Newspapers. Political Communication 24, 2 (May 2007), 167--180.Google ScholarCross Ref
- Hajo G Boomgaarden, Rens Vliegenthart, Claes H De Vreese, and Andreas RT Schuck. 2010. News on the move: Exogenous events and news coverage of the European Union. Journal of European Public Policy 17, 4 (2010), 506--526.Google ScholarCross Ref
- Pierre Bourdieu. 1993. The Field of Cultural Production. Columbia University Press.Google Scholar
- Janez Brank, Gregor Leban, and Marko Grobelnik. 2017. Annotating documents with relevant wikipedia concepts. Proceedings of SiKDD 472 (2017).Google Scholar
- Thorsten Brants, Francine Chen, and Ayman Farahat. 2003. A system for new event detection. In Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval. 330--337.Google ScholarDigital Library
- Erik P Bucy, Walter Gantz, and Zheng Wang. 2014. Media technology and the 24-hour news cycle. In Communication technology and social change. Routledge, 143--163.Google Scholar
- Dallas Card, Amber Boydstun, Justin H Gross, Philip Resnik, and Noah A Smith. 2015. The media frames corpus: Annotations of frames across issues. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers). 438--444.Google ScholarCross Ref
- Jonathan Chang, Sean Gerrish, Chong Wang, Jordan Boyd-Graber, and David Blei. 2009. Reading tea leaves: How humans interpret topic models. Advances in neural information processing systems 22 (2009).Google Scholar
- Xi Chen, Ali Zeynali, Chico Camargo, Fabian Flöck, Devin Ga?ney, Przemyslaw Grabowicz, Scott Hale, David Jurgens, and Mattia Samory. 2022. SemEval-2022 Task 8: Multilingual news article similarity. In SemEval-2022. 1094--1106.Google Scholar
- Xi Chen, Ali Zeynali, Chico Camargo, Fabian Flöck, Devin Ga?ney, Przemyslaw Grabowicz, Scott Hale, David Jurgens, and Mattia Samory. 2022. SemEval-2022 Task 8: Multilingual news article similarity. In SemEval-2022. Association for Computational Linguistics, Seattle, United States, 1094--1106. https://doi.org/10. 18653/v1/2022.semeval-1.155Google Scholar
- Jihyang Choi. 2009. Diversity in Foreign News in US Newspapers Before and After the Invasion of Iraq. International Communication Gazette 71, 6 (Oct. 2009), 525--542.Google ScholarCross Ref
- Bernard C. Cohen. 1963. Press and Foreign Policy. Princeton University Press. http://www.jstor.org/stable/j.ctt183q0fpGoogle Scholar
- Alexis Conneau, Guillaume Lample, Marc'Aurelio Ranzato, Ludovic Denoyer, and Hervé Jégou. 2017. Word Translation Without Parallel Data. arXiv preprint arXiv:1710.04087 (2017).Google Scholar
- John David Dupree. 1971. International communication: View from'a window on the world'. Gazette (Leiden, Netherlands) 17, 4 (1971), 224--235.Google Scholar
- Thomas Eisensee and David Strömberg. 2007. News droughts, news "oods, and US disaster relief. The Quarterly Journal of Economics 122, 2 (2007), 693--728.Google ScholarCross Ref
- Chong Feng, Muzammil Khan, Arif Ur Rahman, and Arshad Ahmad. 2020. News Recommendation Systems - Accomplishments, Challenges & Future Directions. IEEE Access 8 (2020), 16702--16725.Google ScholarCross Ref
- Fangxiaoyu Feng, Yinfei Yang, Daniel Cer, Naveen Arivazhagan, and Wei Wang. 2020. Language-agnostic BERT sentence embedding. arXiv preprint arXiv:2007.01852 (2020).Google Scholar
- Fangxiaoyu Feng, Yinfei Yang, Daniel Cer, Naveen Arivazhagan, and Wei Wang. 2022. Language-agnostic BERT Sentence Embedding. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Dublin, Ireland, 878--891. https://doi.org/10.18653/v1/2022.acl-long.62Google ScholarCross Ref
- Bent Fuglede and Flemming Topsoe. 2004. Jensen-Shannon divergence and Hilbert space embedding. In International Symposium on Information Theory, 2004. ISIT 2004. Proceedings. IEEE, 31.Google ScholarCross Ref
- Johan Galtung and Mari Holmboe Ruge. 1965. The structure of foreign news: The presentation of the Congo, Cuba and Cyprus crises in four Norwegian newspapers. Journal of peace research 2, 1 (1965), 64--90.Google ScholarCross Ref
- George Gerbner and George Marvanyi. 1977. The many worlds of the world's press. Journal of communication 27, 1 (1977), 52--66.Google ScholarCross Ref
- Guy J Golan and Itai Himelboim. 2016. Can World System Theory predict news ?ow on twitter? The case of government-sponsored broadcasting. Information, Communication & Society 19, 8 (2016), 1150--1170.Google ScholarCross Ref
- Przemyslaw A. Grabowicz, José J. Ramasco, Esteban Moro, Josep M. Pujol, and Victor M. Eguiluz. 2012. Social Features of Online Networks: The Strength of Intermediary Ties in Online Social Media. PLoS ONE 7, 1 (Jan. 2012), e29358. https://doi.org/10.1371/journal.pone.0029358Google ScholarCross Ref
- Claude Grasland. 2020. International news "ow theory revisited through a space-- time interaction model: Application to a sample of 320,000 international news stories published through RSS "ows by 31 daily newspapers in 2015. International Communication Gazette 82, 3 (2020), 231--259.Google ScholarCross Ref
- Chris Greer. 2003. Sex Crime and the Media: Sex O?ending and the Press in a Divided Society. Willan. https://doi.org/10.4324/9781843924869Google ScholarCross Ref
- Lei Guo and Chris J Vargo. 2017. Global intermedia agenda setting: A big data analysis of international news ?ow. Journal of Communication 67, 4 (2017), 499--520.Google ScholarCross Ref
- Lei Guo and Chris J Vargo. 2020. Predictors of international news ?ow: Exploring a networked global media system. Journal of Broadcasting & Electronic Media 64, 3 (2020), 418--437.Google ScholarCross Ref
- Ilya Gusev, Moscow Institute of Physics and Technology, Ivan Smurov, and ABBYY. 2021. Russian News Clustering and Headline Selection Shared Task.Google Scholar
- Albert L Hester. 1973. Theoretical considerations in predicting volume and direction of international information ?ow. Gazette (Leiden, Netherlands) 19, 4 (1973), 239--247.Google Scholar
- Lifu Huang, Taylor Cassidy, Xiaocheng Feng, Heng Ji, Clare Voss, Jiawei Han, and Avirup Sil. 2016. Liberal event extraction and event schema induction. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 258--268.Google ScholarCross Ref
- Sallie Hughes and Paola Prado. 2011. Media diversity and social inequality in Latin America. The great gap: Inequality and the politics of redistribution in Latin America (2011), 109--146.Google Scholar
- A Severin Jansen, Beatrice Eugster, Michaela Maier, and Silke Adam. 2019. Who drives the agenda: Media or parties? A seven-country comparison in the run-up to the 2014 European Parliament elections. The International Journal of Press/Politics 24, 1 (2019), 7--26.Google Scholar
- Journalism.org. 2016. The state of the news media. https://assets.pewresearch. org/wp-content/uploads/sites/13/2016/06/30143308/state-of-the-news-mediareport- 2016-?nal.pdf. Accessed: 2023--12--1.Google Scholar
- Herbert G Kariel and Lynn A Rosenvall. 1984. Factors in?uencing international news ?ow. Journalism Quarterly 61, 3 (1984), 509--666.Google ScholarCross Ref
- Kyungmo Kim and George A Barnett. 1996. The determinants of international news ?ow: A network analysis. Communication Research 23, 3 (1996), 323--352.Google ScholarCross Ref
- Eric Klinenberg. 2005. Convergence: News Production in a Digital Age. Ann. Am. Acad. Pol. Soc. Sci. 597, 1 (Jan. 2005), 48--64.Google Scholar
- Giridhar Kumaran and James Allan. 2004. Text classi?cation and named entities for new event detection. In Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval. 297--304.Google ScholarDigital Library
- Wai Lam, HML Meng, KL Wong, and JCH Yen. 2001. Using contextual analysis for news event detection. International Journal of Intelligent Systems 16, 4 (2001), 525--546.Google ScholarCross Ref
- Andrea Lancichinetti, Filippo Radicchi, José J Ramasco, and Santo Fortunato. 2011. Finding statistically signi?cant communities in networks. PloS one 6, 4 (2011), e18961.Google ScholarCross Ref
- Angela M Lee. 2013. News audiences revisited: Theorizing the link between audience motivations and news consumption. Journal of Broadcasting & Electronic Media 57, 3 (2013), 300--317.Google ScholarCross Ref
- Suman Lee. 2007. International public relations as a predictor of prominence of US news coverage. Public Relations Review 33, 2 (2007), 158--165.Google ScholarCross Ref
- Kalev Leetaru and Philip A Schrodt. 2013. Gdelt: Global data on events, location, and tone, 1979--2012. In ISA annual convention, Vol. 2. Citeseer, 1--49.Google Scholar
- Jure Leskovec, Lars Backstrom, and Jon Kleinberg. 2009. Meme-tracking and the dynamics of the news cycle. In KDD (Paris, France) (KDD '09). 497--506.Google Scholar
- Zhiwei Li, Bin Wang, Mingjing Li, and Wei-Ying Ma. 2005. A probabilistic model for retrospective news event detection. In Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval. 106--113.Google ScholarDigital Library
- Benjamin Litterer, David Jurgens, and Dallas Card. 2023. When it Rains, it Pours: Modeling Media Storms and the News Ecosystem. In Findings of the Association for Computational Linguistics: EMNLP 2023. 6346--6361.Google ScholarCross Ref
- Maxwell McCombs and Amy Reynolds. 2002. News in?uence on our pictures of the world. In Media e?ects. Routledge, 11--28.Google Scholar
- Maxwell McCombs and Sebastian Valenzuela. 2021. Setting the Agenda: Mass Media and Public Opinion. Wiley.Google Scholar
- Maxwell E McCombs and Donald L Shaw. 1972. The agenda-setting function of mass media. Public opinion quarterly 36, 2 (1972), 176--187.Google Scholar
- Maxwell E McCombs and Donald L Shaw. 1993. The evolution of agenda-setting research: Twenty-?ve years in the marketplace of ideas. Journal of communication 43, 2 (1993), 58--67.Google ScholarCross Ref
- Shannon C McGregor. 2019. Social media as public opinion: How journalists use social media to represent public opinion. Journalism 20, 8 (2019), 1070--1086.Google ScholarCross Ref
- T Nicholls. 2019. Detecting textual reuse in news stories, at scale. Int. J. Commun. Syst. 13, 2019 (2019).Google Scholar
- Tony Nnaemeka and Jim Richstad. 1981. Internal Controls and Foreign News Coverage: Pacific Press Systems. Communication Research 8, 1 (1981), 97--135.Google ScholarCross Ref
- Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd. 1999. The PageRank citation ranking: Bringing order to the web. Technical Report. Stanford InfoLab.Google Scholar
- Steve Paulussen and Peter Van Aelst. 2021. News values in audience-oriented journalism: Criteria, angles, and cues of Newsworthiness in the (Digital) media context. News values from an audience perspective (2021), 37--55.Google Scholar
- Horst Po¨ ttker. 2003. News and its communicative quality: the inverted pyramid- when and why did it appear? Journalism Studies 4, 4 (2003), 501--511.Google ScholarCross Ref
- Marko Pranjic, Vid Podpecan, Marko Robnik-?ikonja, and Senja Pollak. 2020. Evaluation of related news recommendations using document similarity methods. In Proceedings of the Conference on Language Technologies and Digital Humanities, Ljubljana. nl.ijs.si, 81--86.Google Scholar
- Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. In EMNLP. Association for Computational Linguistics. https://arxiv.org/abs/1908.10084Google Scholar
- Hal Roberts, Rahul Bhargava, Linas Valiukas, Dennis Jen, Momin M Malik, Cindy Sherman Bishop, Emily B Ndulue, Aashka Dave, Justin Clark, Bruce Etling, et al. 2021. Media cloud: Massive open source collection of global news on the open web. ICWSM (2021).Google Scholar
- Gertrude Joch Robinson and Vernone M Sparkes. 1976. International news in the Canadian and American press: A comparative news flow study. Gazette (Leiden, Netherlands) 22, 4 (1976), 203--218.Google Scholar
- Ron Scollon. 2000. Generic variability in news stories in Chinese and English: A contrastive discourse study of ?ve days' newspapers. J. Pragmat. 32, 6 (May 2000), 761--791.Google ScholarCross Ref
- Elad Segev. 2015. Visible and invisible countries: News flow theory revised. Journalism 16, 3 (2015), 412--428.Google ScholarCross Ref
- Elad Segev. 2016. The group-sphere model of international news flow: A crossnational comparison of news sites. International Communication Gazette 78, 3 (2016), 200--222.Google ScholarCross Ref
- Elad Segev and Menahem Blondheim. 2013. America's global standing according to popular news sites from around the world. Political Communication 30, 1 (2013), 139--161.Google ScholarCross Ref
- Andrew K. Semmel. 1977. The elite press, the global system, and foreign news attention. International Interactions 3, 4 (1977), 317--328. https://doi.org/10.1080/ 03050627708434471Google ScholarCross Ref
- M Ángeles Serrano, Marián Boguná, and Alessandro Vespignani. 2009. Extracting the multiscale backbone of complex weighted networks. PNAS 106, 16 (2009), 6483--6488.Google ScholarCross Ref
- Claude Elwood Shannon. 1948. A mathematical theory of communication. The Bell system technical journal 27, 3 (1948), 379--423.Google Scholar
- Pamela J Shoemaker and Stephen D Reese. 1996. Mediating the Message: Theories of Influences on Mass Media Content. Longman.Google Scholar
- Pamela J Shoemaker and Timothy Vos. 2009. Gatekeeping theory. Routledge.Google Scholar
- Iknoor Singh, Yue Li, Melissa Thong, and Carolina Scarton. 2022. GateNLP-UShef at SemEval-2022 Task 8: Entity-Enriched Siamese Transformer for Multilingual News Article Similarity. In SemEval-2022. Association for Computational Linguistics, Seattle, United States, 1121--1128. https://doi.org/10.18653/v1/2022.semeval- 1.158Google ScholarCross Ref
- Kaitao Song, Xu Tan, Tao Qin, Jianfeng Lu, and Tie-Yan Liu. 2020. MPNet: Masked and Permuted Pre-training for Language Understanding. ArXiv abs/2004.09297 (2020).Google Scholar
- Elizabeth A Thomson, Peter RR White, and Philip Kitley. 2008. "Objectivity" and "hard news" reporting across cultures: Comparing the news report in English, French, Japanese and Indonesian journalism. Journalism studies 9, 2 (2008), 212--228.Google ScholarCross Ref
- A S Vatolin, SberBank / Moscow, Russia, E Y Smirnova, S S Shkarin, and SberBank / Moscow, Russia. 2021. Russian News Similarity Detection with SBERT: pretraining and "ne-tuning. Russian State University for the Humanities.Google Scholar
- Sanne Vrijenhoek, Mesut Kaya, Nadia Metoui, Judith Möller, Daan Odijk, and Natali Helberger. 2021. Recommenders with a Mission: Assessing Diversity in News Recommendations. In CHIIR (Canberra ACT, Australia) (CHIIR '21). 173--183.Google Scholar
- Michael D Ward, Andreas Beger, Josh Cutler, Matthew Dickenson, Cassy Dor?, and Ben Radford. 2013. Comparing GDELT and ICEWS event data. Analysis 21, 1 (2013), 267--297.Google Scholar
- GabrielWeimann and Hans-Bernd Brosius. 2017. Redirecting the agenda: Agendasetting in the online Era. The Agenda Setting Journal 1, 1 (2017), 63--102.Google ScholarCross Ref
- David Wilkinson and Mike Thelwall. 2012. Trending Twitter topics in English: An international comparison. Journal of the American Society for Information Science and Technology 63, 8 (2012), 1631--1646. https://doi.org/10.1002/asi.22713 arXiv:https://onlinelibrary.wiley.com/doi/pdf/10.1002/asi.22713Google ScholarDigital Library
- Haoming Denis Wu. 1998. The systemic determinants of international news coverage. The University of North Carolina at Chapel Hill.Google Scholar
- H Denis Wu. 2000. Systemic determinants of international news coverage: A comparison of 38 countries. Journal of communication 50, 2 (2000), 110--130.Google ScholarCross Ref
- H Denis Wu. 2003. Homogeneity around the world? Comparing the systemic determinants of international news ?ow between developed and developing countries. Gazette (Leiden, Netherlands) 65, 1 (2003), 9--24.Google Scholar
- H Denis Wu. 2007. A brave new world for international news? Exploring the determinants of the coverage of foreign news on US websites. International Communication Gazette 69, 6 (2007), 539--551.Google ScholarCross Ref
- Kejing Xiao, Zhaopeng Qian, and Biao Qin. 2021. A graphical decomposition and similarity measurement approach for topic detection from online news. Inf. Sci. 570 (Sept. 2021), 262--277.Google Scholar
- Guixian Xu, Yueting Meng, Zhan Chen, Xiaoyu Qiu, ChangzhiWang, and Haishen Yao. 2019. Research on Topic Detection and Tracking for Online News Texts. IEEE Access 7 (2019), 58407--58418.Google ScholarCross Ref
- Zihang Xu, Ziqing Yang, Yiming Cui, and Zhigang Chen. 2022. HFL at SemEval- 2022 Task 8: A Linguistics-inspired Regression Model with Data Augmentation for Multilingual News Similarity. In SemEval-2022. Association for Computational Linguistics, Seattle, United States, 1114--1120. https://doi.org/10.18653/v1/2022. semeval-1.157Google ScholarCross Ref
- Yinfei Yang, Daniel Cer, Amin Ahmad, Mandy Guo, Jax Law, Noah Constant, Gustavo Hernandez Abrego, Steve Yuan, Chris Tar, Yun-hsuan Sung, Brian Strope, and Ray Kurzweil. 2020. Multilingual Universal Sentence Encoder for Semantic Retrieval. In ACL: System Demonstrations. Association for Computational Linguistics, Online, 87--94. https://doi.org/10.18653/v1/2020.acl-demos.12Google ScholarCross Ref
- Yiming Yang, Jian Zhang, Jaime Carbonell, and Chun Jin. 2002. Topic-conditioned novelty detection. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining. 688--693.Google ScholarDigital Library
- Veysel Yesilbas, Jose J Padilla, and Erika Frydenlund. 2021. An analysis of global news coverage of refugees using a big data Approach. In SBP-BRiMS 2021, Proceedings 14. Springer, 111--120.Google Scholar
- Ethan Zuckerman. 2003. Global Attention Pro?les-A working paper: First steps towards a quantitative approach to the study of media attention. Berkman Center Research Publication 2003-06 (2003).Google Scholar
- Ethan Zuckerman. 2013. Rewire: Digital cosmopolitans in the age of connection. WW Norton & Company.Google Scholar
Index Terms
- Global News Synchrony and Diversity During the Start of the COVID-19 Pandemic
Recommendations
Covid-19 Economic Vulnerability Index: EU Evidence
AbstractThe COVID-19 pandemic outbreak caused many negative effects on both the global and national economies. To implement effective policies to mitigate the negative impact of a pandemic, it is necessary to identify particularly vulnerable areas. The ...
Are Africa on the Right Track to Prevent COVID-19? Reflections from African Business Magazine's Coverage of the COVID-19 Pandemic
ICMHI '22: Proceedings of the 6th International Conference on Medical and Health InformaticsAfrican Business Magazine is one of market leaders in providing country supplements, industry reports and market intelligence on Africa. African Business was first published in January 1982. Its headquarters are in London. The monthly magazine covers ...
Measuring the Resilience to the Covid-19 Pandemic of Eurozone Economies with Their 2050 Forecasts
AbstractThis paper measures the resilience of Eurozone economies following the economic shock of the Covid-19 pandemic that hit the global economy. Q2 2022 to Q4 2050 real GDP forecasts of 17 countries of the Eurozone are generated with wavelet analysis ...
Comments