ABSTRACT
Algorithmic harms are commonly categorized as either allocative or representational. This study specifically addresses the latter, examining current definitions of representational harms to discern what is included and what is not. This analysis motivates our expansion beyond behavioral definitions to encompass harms to cognitive and affective states. The paper outlines high-level requirements for measurement: identifying the necessary expertise to implement this approach and illustrating it through a case study. Our work highlights the unique vulnerabilities of large language models to perpetrating representational harms, particularly when these harms go unmeasured and unmitigated. The work concludes by presenting proposed mitigations and delineating when to employ them. The overarching aim of this research is to establish a framework for broadening the definition of representational harms and to translate insights from fairness research into practical measurement and mitigation praxis.
- Frances E Aboud and Morton J Mendelson. 1996. Determinants of friendship selection and quality: Developmental perspectives. The company they keep: Friendship in childhood and adolescence (1996), 87–112.Google Scholar
- Gerald R Adams, Judy Shea, and Steven A Fitch. 1979. Toward the development of an objective assessment of ego-identity status. Journal of youth and adolescence 8, 2 (1979), 223–237.Google ScholarCross Ref
- Ifeoma Ajunwa, Sorelle Friedler, Carlos E Scheidegger, and Suresh Venkatasubramanian. 2016. Hiring by algorithm: predicting and preventing disparate impact. Available at SSRN (2016).Google ScholarCross Ref
- Naeem Akhtar, Muhammad Nadeem Akhtar, Muhammad Usman, Moazzam Ali, and Umar Iqbal Siddiqi. 2020. COVID-19 restrictions and consumers’ psychological reactance toward offline shopping freedom restoration. The Service Industries Journal 40, 13-14 (2020), 891–913.Google ScholarCross Ref
- Iuliia Alieva. 2023. How American media framed 2016 presidential election using data visualization: The case study of the New York times and the Washington post. Journalism Practice 17, 4 (2023), 814–840.Google ScholarCross Ref
- Mike Ananny and Kate Crawford. 2018. Seeing without knowing: Limitations of the transparency ideal and its application to algorithmic accountability. new media & society 20, 3 (2018), 973–989.Google Scholar
- Nazanin Andalibi, Cassidy Pyle, Kristen Barta, Lu Xian, Abigail Z Jacobs, and Mark S Ackerman. 2023. Conceptualizing Algorithmic Stigmatization. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. 1–18.Google ScholarDigital Library
- Albert Bandura, Dorothea Ross, and Sheila A Ross. 1963. Imitation of film-mediated aggressive models.The Journal of Abnormal and Social Psychology 66, 1 (1963), 3.Google Scholar
- Albert Bandura, C Barr Taylor, S Lloyd Williams, Ivan N Mefford, and Jack D Barchas. 1985. Catecholamine secretion as a function of perceived coping self-efficacy.Journal of consulting and clinical psychology 53, 3 (1985), 406.Google Scholar
- Melissa Bell and Nichole Bayliss. 2015. The Tough Guise: Teaching Violent Masculinity as the Only Way to Be a Man: Tough Guise 2: Violence, Manhood and American Culture. Sex Roles 72 (2015), 566–568.Google Scholar
- Jenn K Bergen, Sharissa Unger Hantke, and Verna St Denis. 2023. Contemporary challenges and approaches in anti-racist teacher education. (2023).Google Scholar
- John W Berry, Joseph E Trimble, and Esteban L Olmedo. 1986. Assessment of acculturation. (1986).Google Scholar
- Federico Bianchi, Pratyusha Kalluri, Esin Durmus, Faisal Ladhak, Myra Cheng, Debora Nozza, Tatsunori Hashimoto, Dan Jurafsky, James Zou, and Aylin Caliskan. 2023. Easily accessible text-to-image generation amplifies demographic stereotypes at large scale. In Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency. 1493–1504.Google ScholarDigital Library
- Daniel Boduszek and Agata Debowska. 2020. Feelings of Inadequacy Scale. In Encyclopedia of Personality and Individual Differences. Springer, 1574–1576.Google Scholar
- Robert Böhm, Hannes Rusch, and Jonathan Baron. 2020. The psychology of intergroup conflict: A review of theories and measures. Journal of Economic Behavior & Organization 178 (2020), 947–962.Google ScholarCross Ref
- Kenneth A Bollen and Rick H Hoyle. 1990. Perceived cohesion: A conceptual and empirical examination. Social forces 69, 2 (1990), 479–504.Google Scholar
- Thomas W Britt, Winny Shen, Robert R Sinclair, Matthew R Grossman, and David M Klieger. 2016. How much do we really know about employee resilience?Industrial and Organizational Psychology 9, 2 (2016), 378–404.Google Scholar
- Samuel C Bullock and Earline Houston. 1987. Perceptions of racism by Black medical students attending White medical schools. Journal of the National Medical Association 79, 6 (1987), 601.Google Scholar
- Antonio Byrd. 2023. Truth-Telling: Critical Inquiries on LLMs and the Corpus Texts That Train Them.Composition Studies 51, 1 (2023), 135–142.Google Scholar
- Albert V Carron, W Neil Widmeyer, and Lawrence R Brawley. 1985. The development of an instrument to assess cohesion in sport teams: The Group Environment Questionnaire. Journal of Sport and Exercise psychology 7, 3 (1985), 244–266.Google ScholarCross Ref
- Charles S Carver. 1997. You want to measure coping but your protocol’too long: Consider the brief cope. International journal of behavioral medicine 4, 1 (1997), 92–100.Google Scholar
- Matthew Chalmers and Ian MacColl. 2003. Seamful and seamless design in ubiquitous computing. In Workshop at the crossroads: The interaction of HCI and systems issues in UbiComp, Vol. 8.Google Scholar
- Myra Cheng, Esin Durmus, and Dan Jurafsky. 2023. Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models. arXiv preprint arXiv:2305.18189 (2023).Google Scholar
- Rodney Clark, Norman B Anderson, Vernessa R Clark, and David R Williams. 1999. Racism as a stressor for African Americans: A biopsychosocial model.American psychologist 54, 10 (1999), 805.Google Scholar
- Kathryn M Connor, Jonathan RT Davidson, L Erik Churchill, Andrew Sherwood, Richard H Weisler, and Edna Foa. 2000. Psychometric properties of the Social Phobia Inventory (SPIN): New self-rating scale. The British Journal of Psychiatry 176, 4 (2000), 379–386.Google ScholarCross Ref
- John N Constantino, Christian P Gruber, 2012. Social responsiveness scale: SRS-2. (2012).Google Scholar
- Anna L Cox, Sandy JJ Gould, Marta E Cecchinato, Ioanna Iacovides, and Ian Renfree. 2016. Design frictions for mindful interactions: The case for microboundaries. In Proceedings of the 2016 CHI conference extended abstracts on human factors in computing systems. 1389–1397.Google ScholarDigital Library
- Laura Craig-Bray and Gerald R Adams. 1986. Different methodologies in the assessment of identity: Congruence between self-report and interview techniques?Journal of Youth and Adolescence 15, 3 (1986), 191–204.Google Scholar
- Isiaah Crawford, Kevin W Allison, Brian D Zamboni, and Tomas Soto. 2002. The influence of dual-identity development on the psychosocial functioning of African-American gay and bisexual men. Journal of Sex Research 39, 3 (2002), 179–189.Google ScholarCross Ref
- Ewart J De Visser, Samuel S Monfort, Ryan McKendrick, Melissa AB Smith, Patrick E McKnight, Frank Krueger, and Raja Parasuraman. 2016. Almost human: Anthropomorphism increases trust resilience in cognitive agents.Journal of Experimental Psychology: Applied 22, 3 (2016), 331.Google Scholar
- Pieter Delobelle and Bettina Berendt. 2022. Fairdistillation: mitigating stereotyping in language models. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 638–654.Google Scholar
- Gregory J Digirolamo and Douglas L Hintzman. 1997. First impressions are lasting impressions: A primacy effect in memory for repetitions. Psychonomic Bulletin & Review 4, 1 (1997), 121–124.Google ScholarCross Ref
- Catherine D’ignazio and Lauren F Klein. 2023. Data feminism. MIT press.Google Scholar
- Pierluigi Diotaiuti, Giuseppe Valente, Stefania Mancone, Angela Grambone, Andrea Chirico, and Fabio Lucidi. 2022. The use of the Decision Regret Scale in non-clinical contexts. Frontiers in Psychology 13 (2022), 945669.Google ScholarCross Ref
- Upol Ehsan and Mark O Riedl. 2021. Explainability pitfalls: Beyond dark patterns in explainable AI. arXiv preprint arXiv:2109.12480 (2021).Google Scholar
- Julia Elad-Strenger, Michal Reifen Tagar, Thomas Kessler, Yossi Hasson, Deborah Shulman, Kea Brahms, and Eran Halperin. 2022. Out of sight, out of mind: The emotional determinant of “harmful inaction” intergroup conflict. Journal of Experimental Social Psychology 101 (2022), 104304.Google ScholarCross Ref
- Dalia Elsouhag, Bengt Arnetz, Hikmet Jamil, Mark A Lumley, Carissa L Broadbridge, and Judy Arnetz. 2015. Factors associated with healthcare utilization among Arab immigrants and Iraqi refugees. Journal of Immigrant and Minority Health 17 (2015), 1305–1312.Google ScholarCross Ref
- Jean Endicott, John Nee, Wilma Harrison, and Richard Blumenthal. 1993. Quality of Life Enjoyment and Satisfaction Questionnaire: a new measure.Psychopharmacology bulletin 29, 2 (1993), 321–326.Google Scholar
- Donna Farland-Smith, Kevin Finson, William J Boone, and Melissa Yale. 2014. An investigation of media influences on elementary students representations of scientists. Journal of Science Teacher Education 25, 3 (2014), 355–366.Google ScholarCross Ref
- Lisa K Fazio, Nadia M Brashier, B Keith Payne, and Elizabeth J Marsh. 2015. Knowledge does not protect against illusory truth.Journal of experimental psychology: general 144, 5 (2015), 993.Google Scholar
- Sumam Fernando. 1984. Racism as a cause of depression. International Journal of Social Psychiatry 30, 1-2 (1984), 41–49.Google ScholarCross Ref
- Vinitha Gadiraju, Shaun Kane, Sunipa Dev, Alex Taylor, Ding Wang, Emily Denton, and Robin Brewer. 2023. " I wouldn’t say offensive but...": Disability-Centered Perspectives on Large Language Models. In Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency. 205–216.Google ScholarDigital Library
- David Gauntlett. 2008. Media, gender and identity: An introduction. Routledge.Google Scholar
- Avijit Ghosh, Matthew Jagielski, and Christo Wilson. 2022. Subverting Fair Image Search with Generative Adversarial Perturbations. In Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency. 637–650.Google ScholarDigital Library
- Amelia Glaese, Nat McAleese, Maja Trębacz, John Aslanides, Vlad Firoiu, Timo Ewalds, Maribeth Rauh, Laura Weidinger, Martin Chadwick, Phoebe Thacker, 2022. Improving alignment of dialogue agents via targeted human judgements. arXiv preprint arXiv:2209.14375 (2022).Google Scholar
- Shelly Grabe and Anjali Dutt. 2015. Counter narratives, the psychology of liberation, and the evolution of a women’s social movement in Nicaragua.Peace and Conflict: Journal of Peace Psychology 21, 1 (2015), 89.Google ScholarCross Ref
- Lana Ruvolo Grasser. 2022. Addressing mental health concerns in refugees and displaced populations: is enough being done?Risk management and healthcare policy (2022), 909–922.Google Scholar
- Sophia Harrison, Eleonora Gualdoni, and Gemma Boleda. 2023. Run Like a Girl! Sports-Related Gender Bias in Language and Vision. arXiv preprint arXiv:2305.14468 (2023).Google Scholar
- Aumyo Hassan and Sarah J Barber. 2021. The effects of repetition frequency on the illusory truth effect. Cognitive research: principles and implications 6, 1 (2021), 1–12.Google Scholar
- Leslie RM Hausmann, Janet Ward Schofield, and Rochelle L Woods. 2007. Sense of belonging as a predictor of intentions to persist among African American and White first-year college students. Research in higher education 48 (2007), 803–839.Google Scholar
- Marc W Heerdink, Gerben A Van Kleef, Astrid C Homan, and Agneta H Fischer. 2013. On the social influence of emotions in groups: interpersonal effects of anger and happiness on conformity versus deviance.Journal of Personality and Social Psychology 105, 2 (2013), 262.Google Scholar
- Anne S Helsdingen, Karel Van den Bosch, Tamara Van Gog, and Jeroen JG van Merriënboer. 2010. The effects of critical thinking instruction on training complex decision making. Human factors 52, 4 (2010), 537–545.Google Scholar
- Ingvild Oxås Henriksen, Ingunn Ranøyen, Marit Sæbø Indredavik, and Frode Stenseng. 2017. The role of self-esteem in the development of psychiatric problems: a three-year prospective study in a clinical sample of adolescents. Child and adolescent psychiatry and mental health 11 (2017), 1–9.Google Scholar
- Fernanda Herrera and Jeremy N Bailenson. 2021. Virtual reality perspective-taking at scale: Effect of avatar representation, choice, and head movement on prosocial behaviors. new media & society 23, 8 (2021), 2189–2209.Google Scholar
- Kimberly Hively and Amani El-Alayli. 2014. “You throw like a girl:” The effect of stereotype threat on women’s athletic performance and gender stereotypes. Psychology of Sport and Exercise 15, 1 (2014), 48–55.Google ScholarCross Ref
- Kelly M Hoffman, Sophie Trawalter, Jordan R Axt, and M Norman Oliver. 2016. Racial bias in pain assessment and treatment recommendations, and false beliefs about biological differences between blacks and whites. Proceedings of the National Academy of Sciences 113, 16 (2016), 4296–4301.Google ScholarCross Ref
- Saghar Hosseini, Hamid Palangi, and Ahmed Hassan Awadallah. 2023. An Empirical Study of Metrics to Measure Representational Harms in Pre-Trained Language Models. arXiv preprint arXiv:2301.09211 (2023).Google Scholar
- Sarah Inman and David Ribes. 2019. " Beautiful Seams" Strategic Revelations and Concealments. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. 1–14.Google ScholarDigital Library
- Oliver Jacobs, Farid Pazhoohi, and Alan Kingstone. 2023. Brief exposure increases mind perception to ChatGPT and is moderated by the individual propensity to anthropomorphize. (2023).Google Scholar
- Justin Jagosh, Ann C Macaulay, Pierre Pluye, JON Salsberg, Paula L Bush, JIM Henderson, Erin Sirett, Geoff Wong, Margaret Cargo, Carol P Herbert, 2012. Uncovering the benefits of participatory research: implications of a realist review for health research and practice. The Milbank Quarterly 90, 2 (2012), 311–346.Google ScholarCross Ref
- Rebecca L Johnson, Giada Pistilli, Natalia Menédez-González, Leslye Denisse Dias Duran, Enrico Panai, Julija Kalpokiene, and Donald Jay Bertulfo. 2022. The Ghost in the Machine has an American accent: value conflict in GPT-3. arXiv preprint arXiv:2203.07785 (2022).Google Scholar
- Suzanne B Johnson and Page L Anderson. 2014. Stereotype confirmation concern and fear of negative evaluation among African Americans and Caucasians with Social Anxiety Disorder. Journal of anxiety disorders 28, 4 (2014), 390–393.Google ScholarCross Ref
- Stephanie T Jones and Natalie Araujo Melo. 2021. We tell these stories to survive: Towards abolition in computer science education. Canadian Journal of Science, Mathematics and Technology Education 21 (2021), 290–308.Google ScholarCross Ref
- PK Kannan, Werner Reinartz, and Peter C Verhoef. 2016. The path to purchase and attribution modeling: Introduction to special section., 449–456 pages.Google Scholar
- Jared Katzman, Angelina Wang, Morgan Scheuerman, Su Lin Blodgett, Kristen Laird, Hanna Wallach, and Solon Barocas. 2023. Taxonomizing and Measuring Representational Harms: A Look at Image Tagging. arXiv preprint arXiv:2305.01776 (2023).Google Scholar
- Ibram X Kendi. 2023. How to be an antiracist. One world.Google Scholar
- Nitasha Tiku Kevin Schaul, Szu Yu Chen. 2023. Inside the secret list of websites that make ai like chatgpt sound smart. https://www.washingtonpost.com/technology/interactive/2023/ai-chatbot-learning/Google Scholar
- Youjeong Kim and S Shyam Sundar. 2012. Anthropomorphism of computers: Is it mindful or mindless?Computers in Human Behavior 28, 1 (2012), 241–250.Google Scholar
- Hadas Kotek, Rikker Dockum, and David Sun. 2023. Gender bias and stereotypes in Large Language Models. In Proceedings of The ACM Collective Intelligence Conference. 12–24.Google ScholarDigital Library
- Kevin T Larkin, Elizabeth M Semenchuk, Nicole L Frazer, Sonia Suchday, and Robert L Taylor. 1998. Cardiovascular and behavioral response to social confrontation: Measuring real-life stress in the laboratory. Annals of Behavioral Medicine 20 (1998), 294–301.Google ScholarCross Ref
- Mark R Leary. 2015. Emotional responses to interpersonal rejection. Dialogues in clinical neuroscience (2015).Google Scholar
- Robert W Lent and Steven D Brown. 2006. On conceptualizing and assessing social cognitive constructs in career research: A measurement guide. Journal of career assessment 14, 1 (2006), 12–35.Google ScholarCross Ref
- Robert W Lent, Steven D Brown, Gail Hackett, 2002. Social cognitive career theory. Career choice and development 4, 1 (2002), 255–311.Google Scholar
- Q Vera Liao and Jennifer Wortman Vaughan. 2023. AI Transparency in the Age of LLMs: A Human-Centered Research Roadmap. arXiv preprint arXiv:2306.01941 (2023).Google Scholar
- Oliver Lindhiem, Isaac T Petersen, Lucas K Mentch, and Eric A Youngstrom. 2020. The importance of calibration in clinical psychology. Assessment 27, 4 (2020), 840–854.Google ScholarCross Ref
- Anne Maass, Daniela Salvi, Luciano Arcuri, and Gün R Semin. 1989. Language use in intergroup contexts: The linguistic intergroup bias.Journal of personality and social psychology 57, 6 (1989), 981.Google Scholar
- Steven C Martino, Deborah M Scharf, Claude M Setodji, and William G Shadel. 2011. Measuring exposure to protobacco marketing and media: a field study using ecological momentary assessment. Nicotine & Tobacco Research 14, 4 (2011), 398–406.Google ScholarCross Ref
- Allison Master, Sapna Cheryan, and Andrew N Meltzoff. 2016. Computing whether she belongs: Stereotypes undermine girls’ interest and sense of belonging in computer science.Journal of educational psychology 108, 3 (2016), 424.Google Scholar
- Rich McCormick. 2016. The NYT’s election forecast needle is stressing people out with fake jitter. https://www.theverge.com/2016/11/8/13571216/new-york-times-election-forecast-jitter-needleGoogle Scholar
- Ebony McGee. 2018. “Black genius, Asian fail”: The detriment of stereotype lift and stereotype threat in high-achieving Asian and Black STEM students. AERA Open 4, 4 (2018), 2332858418816658.Google ScholarCross Ref
- Johanna Christina Neumann, Thomas Berger, and Jan Ilhan Kizilhan. 2021. Development of a questionnaire to measure the perceived injustice of people who have experienced violence in war and conflict areas: Perceived Injustice Questionnaire (PIQ). International journal of environmental research and public health 18, 23 (2021), 12357.Google ScholarCross Ref
- Terrence Neumann, Maria De-Arteaga, and Sina Fazelpour. 2022. Justice in misinformation detection systems: An analysis of algorithms, stakeholders, and potential harms. In Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency. 1504–1515.Google ScholarDigital Library
- Jennifer Jane Newson, Vladyslav Pastukh, and Tara C Thiagarajan. 2021. Poor separation of clinical symptom profiles by DSM-5 disorder criteria. Frontiers in Psychiatry 12 (2021), 775762.Google ScholarCross Ref
- Kin Wai Ng, Frederick Mubang, Lawrence O Hall, John Skvoretz, and Adriana Iamnitchi. 2023. Experimental evaluation of baselines for forecasting social media timeseries. EPJ Data Science 12, 1 (2023), 8.Google ScholarCross Ref
- Thinh On, Subhodeep Ghosh, Mengnan Du, and Senjuti Basu Roy. 2002. Proportionate Diversification of Top-k LLM Results using Database Queries. DEF 2 (2002), 1.Google Scholar
- Bill Ottman, CEO Co-founder, Minds Daryl Davis, Race Reconciliator, Jack Ottman, COO Co-founder, Minds Jesse Morton, Sophia Moskalenko, James Daly, Julian Rapaport, [n. d.]. The Censorship Effect. ([n. d.]).Google Scholar
- Anaelia Ovalle, Palash Goyal, Jwala Dhamala, Zachary Jaggers, Kai-Wei Chang, Aram Galstyan, Richard Zemel, and Rahul Gupta. 2023. “I’m fully who I am”: Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation. In Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency. 1246–1266.Google ScholarDigital Library
- Cory Owen. 2015. The" model Minority" Myth and Its Impact on Anxiety and Stress on Asian American Students. Ph. D. Dissertation. University of Houston.Google Scholar
- Jennifer O’Meara. 2016. What “The Bechdel Test” doesn’t tell us: examining women’s verbal and vocal (dis) empowerment in cinema. Feminist media studies 16, 6 (2016), 1120–1123.Google Scholar
- Lauv Patel, Tripti Shukla, Xiuzhen Huang, David W Ussery, and Shanzhi Wang. 2020. Machine learning methods in drug discovery. Molecules 25, 22 (2020), 5277.Google ScholarCross Ref
- Leonard I Pearlin. 1989. The sociological study of stress. Journal of health and social behavior (1989), 241–256.Google Scholar
- Christopher Peterson, Steven F Maier, and Martin EP Seligman. 1993. Learned helplessness: A theory for the age of personal control. Oxford University Press, USA.Google Scholar
- Jean S Phinney. 1992. The multigroup ethnic identity measure: A new scale for use with diverse groups. Journal of adolescent research 7, 2 (1992), 156–176.Google ScholarCross Ref
- Jean S Phinney, Bruce T Lochner, and Rodolfo Murphy. 1990. Ethnic identity development and psychological adjustment in adolescence.Sage Publications, Inc.Google Scholar
- Dimitrios Pnevmatikos, Panagiota Christodoulou, and Nikolaos Fachantidis. 2020. STAKEHOLDERS’INVOLVEMENT IN PARTICIPATORY DESIGN APPROACHES OF LEARNING ENVIRONMENTS: A SYSTEMATIC REVIEW OF THE LITERATURE. EDULEARN20 proceedings (2020), 5543–5552.Google Scholar
- Geoff Potvin, Zahra Hazari, Raina Khatri, Hemeng Cheng, T Blake Head, Robynne M Lock, Anne F Kornahrens, Kathryne Sparks Woodle, Rebecca E Vieyra, Beth A Cunningham, 2023. Examining the effect of counternarratives about physics on women’s physics career intentions. Physical Review Physics Education Research 19, 1 (2023), 010126.Google ScholarCross Ref
- Emily Jane Prince and Julie Hadwin. 2013. The role of a sense of school belonging in understanding the effectiveness of inclusion of children with special educational needs. International Journal of Inclusive Education 17, 3 (2013), 238–262.Google ScholarCross Ref
- Kurtis Pykes. 2023. Promoting responsible AI: Content moderation in chatgpt. https://www.datacamp.com/blog/promoting-responsible-ai-content-moderation-in-chatgptGoogle Scholar
- Rida Qadri, Renee Shelby, Cynthia L Bennett, and Emily Denton. 2023. AI’s Regimes of Representation: A Community-centered Study of Text-to-Image Models in South Asia. In Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency. 506–517.Google ScholarDigital Library
- M Afzalur Rahim. 1983. Rahim Organizational Conflict Inventory–II. Journal of Applied Psychology (1983).Google Scholar
- Charvi Rastogi, Marco Tulio Ribeiro, Nicholas King, Harsha Nori, and Saleema Amershi. 2023. Supporting human-ai collaboration in auditing llms with llms. In Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society. 913–926.Google ScholarDigital Library
- P Neal Ritchey and Harold D Fishbein. 2001. The lack of an association between adolescent friends’ prejudices and stereotypes. Merrill-Palmer Quarterly (1982-) (2001), 188–206.Google Scholar
- Michael Ritsner, Ilan Modai, and Alexander Ponizovsky. 2002. Assessing psychological distress in psychiatric patients: validation of the Talbieh Brief Distress Inventory. Comprehensive Psychiatry 43, 3 (2002), 229–234.Google ScholarCross Ref
- Michael Ritsner, Jonathan Rabinowitz, and Michael Slyuzberg. 1995. The Talbieh Brief Distress Inventory: a brief instrument to measure psychological distress among immigrants. Comprehensive Psychiatry 36, 6 (1995), 448–453.Google ScholarCross Ref
- Steven O Roberts, Carmelle Bareket-Shavit, Forrest A Dollins, Peter D Goldie, and Elizabeth Mortenson. 2020. Racial inequality in psychological research: Trends of the past and recommendations for the future. Perspectives on psychological science 15, 6 (2020), 1295–1309.Google Scholar
- Steven G Rogelberg, Gwenith G Fisher, Douglas C Maynard, Milton D Hakel, and Michael Horvath. 2001. Attitudes toward surveys: Development of a measure and its relationship to respondent behavior. Organizational Research Methods 4, 1 (2001), 3–25.Google ScholarCross Ref
- Georg Schomerus, Holger Muehlan, Charlotte Auer, Philip Horsfield, Samuel Tomczyk, Simone Freitag, Sara Evans-Lacko, Silke Schmidt, and Susanne Stolzenburg. 2019. Validity and psychometric properties of the self-identification as having a mental illness scale (SELF-I) among currently untreated persons with mental health problems. Psychiatry Research 273 (2019), 303–308.Google ScholarCross Ref
- M Seligman. 1975. Helplessness: On Depression, Development, and Death.” an Francisco.Google Scholar
- Gün R Semin and Klaus Fiedler. 1988. The cognitive functions of linguistic categories in describing persons: Social cognition and language.Journal of personality and Social Psychology 54, 4 (1988), 558.Google Scholar
- Mohammad Ahmad Sheikh, Amit Kumar Goel, and Tapas Kumar. 2020. An approach for prediction of loan approval using machine learning algorithm. In 2020 International Conference on Electronics and Sustainable Communication Systems (ICESC). IEEE, 490–494.Google ScholarCross Ref
- Renee Shelby, Shalaleh Rismani, Kathryn Henne, AJung Moon, Negar Rostamzadeh, Paul Nicholas, N’Mah Yilla-Akbari, Jess Gallegos, Andrew Smart, Emilio Garcia, 2023. Sociotechnical Harms of Algorithmic Systems: Scoping a Taxonomy for Harm Reduction. In Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society. 723–741.Google ScholarDigital Library
- Tom Simonite. 2021. Ai and the list of dirty, naughty, obscene, and otherwise bad words. https://www.wired.com/story/ai-list-dirty-naughty-obscene-bad-words/Google Scholar
- Vijai P Singh. 1977. Some theoretical and methodological problems in the study of ethnic identity: a cross-cultural perspective. Annals of the New York Academy of Sciences 285, 1 (1977), 32–45.Google ScholarCross Ref
- Michael D Slater. 2007. Reinforcing spirals: The mutual influence of media selectivity and media effects and their impact on individual behavior and social identity. Communication theory 17, 3 (2007), 281–303.Google Scholar
- Irene Solaiman, Zeerak Talat, William Agnew, Lama Ahmad, Dylan Baker, Su Lin Blodgett, Hal Daumé III, Jesse Dodge, Ellie Evans, Sara Hooker, 2023. Evaluating the Social Impact of Generative AI Systems in Systems and Society. arXiv preprint arXiv:2306.05949 (2023).Google Scholar
- Nicolas Spatola, Clément Belletier, Pierre Chausse, Maria Augustinova, Alice Normand, Vincent Barra, Ludovic Ferrand, and Pascal Huguet. 2019. Improved cognitive control in presence of anthropomorphized robots. International Journal of Social Robotics 11 (2019), 463–476.Google ScholarCross Ref
- Russell Spears, Bertjan Doosje, and Naomi Ellemers. 1997. Self-stereotyping in the face of threats to group status and distinctiveness: The role of group identification. Personality and social psychology bulletin 23, 5 (1997), 538–553.Google Scholar
- Elizabeth Stephens. 2023. The mechanical Turk: A short history of ‘artificial artificial intelligence’. Cultural Studies 37, 1 (2023), 65–87.Google ScholarCross Ref
- Harini Suresh and John Guttag. 2021. A framework for understanding sources of harm throughout the machine learning life cycle. In Equity and access in algorithms, mechanisms, and optimization. 1–9.Google Scholar
- Clarice Tang, Liz Thyer, Rosalind Bye, Belinda Kenny, Nikki Tulliani, Nicole Peel, Rebecca Gordon, Stefania Penkala, Caterina Tannous, Yu-Ting Sun, 2023. Impact of online learning on sense of belonging among first year clinical health students during COVID-19: student and academic perspectives. BMC Medical Education 23, 1 (2023), 100.Google ScholarCross Ref
- Cristina Teresa-Morales, Margarita Rodríguez-Pérez, Miriam Araujo-Hernández, and Carmen Feria-Ramírez. 2022. Current stereotypes associated with nursing and nursing professionals: An integrative review. International journal of environmental research and public health 19, 13 (2022), 7640.Google ScholarCross Ref
- Shannon Vallor. 2022. The AI Mirror: Reclaiming our Humanity in an Age of Machine Thinking. In Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society. 6–6.Google ScholarDigital Library
- Angelina Wang, Solon Barocas, Kristen Laird, and Hanna Wallach. 2022. Measuring representational harms in image captioning. In Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency. 324–335.Google ScholarDigital Library
- Hanchen Wang, Tianfan Fu, Yuanqi Du, Wenhao Gao, Kexin Huang, Ziming Liu, Payal Chandak, Shengchao Liu, Peter Van Katwyk, Andreea Deac, 2023. Scientific discovery in the age of artificial intelligence. Nature 620, 7972 (2023), 47–60.Google Scholar
- John Broadus Watson. 1919. Psychology: From the standpoint of a behaviorist. JB Lippincott.Google ScholarCross Ref
- Adam Waytz, Nicholas Epley, and John T Cacioppo. 2010. Social cognition unbound: Insights into anthropomorphism and dehumanization. Current Directions in Psychological Science 19, 1 (2010), 58–62.Google ScholarCross Ref
- Laura Weidinger, John Mellor, Maribeth Rauh, Conor Griffin, Jonathan Uesato, Po-Sen Huang, Myra Cheng, Mia Glaese, Borja Balle, Atoosa Kasirzadeh, 2021. Ethical and social risks of harm from language models. arXiv preprint arXiv:2112.04359 (2021).Google Scholar
- Laura Weidinger, Jonathan Uesato, Maribeth Rauh, Conor Griffin, Po-Sen Huang, John Mellor, Amelia Glaese, Myra Cheng, Borja Balle, Atoosa Kasirzadeh, 2022. Taxonomy of risks posed by language models. In Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency. 214–229.Google ScholarDigital Library
- Marc Weiser. 1994. The world is not a desktop. interactions 1, 1 (1994), 7–8.Google Scholar
- Thomas A Widiger and Cristina Crego. 2019. The Five Factor Model of personality structure: an update. World Psychiatry 18, 3 (2019), 271.Google ScholarCross Ref
- GC Williams, ZR Freedman, EL Deci, and J Leone. 1998. Perceived competence scales. Diabetes Care 21, 10 (1998), 1644–1651.Google ScholarCross Ref
- Katharina Wolff, Svein Larsen, and Torvald Øgaard. 2019. How to define and measure risk perceptions. Annals of Tourism Research 79 (2019), 102759.Google ScholarCross Ref
- Frieda Wong and Richard Halgin. 2006. The “model minority”: Bane or blessing for Asian Americans?Journal of Multicultural Counseling and Development 34, 1 (2006), 38–49.Google Scholar
- Haolun Wu, Bhaskar Mitra, Chen Ma, Fernando Diaz, and Xue Liu. 2022. Joint multisided exposure fairness for recommendation. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. 703–714.Google ScholarDigital Library
- Kyra Yee, Uthaipon Tantipongpipat, and Shubhanshu Mishra. 2021. Image cropping on twitter: Fairness metrics, their limitations, and the importance of representation, design, and agency. Proceedings of the ACM on Human-Computer Interaction 5, CSCW2 (2021), 1–24.Google ScholarDigital Library
- Yue Yu, Yuchen Zhuang, Jieyu Zhang, Yu Meng, Alexander Ratner, Ranjay Krishna, Jiaming Shen, and Chao Zhang. 2023. Large language model as attributed training data generator: A tale of diversity and bias. arXiv preprint arXiv:2306.15895 (2023).Google Scholar
- Kaitlyn Zhou, Kawin Ethayarajh, and Dan Jurafsky. 2022. Richer countries and richer representations. arXiv preprint arXiv:2205.05093 (2022).Google Scholar
Index Terms
- Beyond Behaviorist Representational Harms: A Plan for Measurement and Mitigation
Recommendations
Measuring Representational Harms in Image Captioning
FAccT '22: Proceedings of the 2022 ACM Conference on Fairness, Accountability, and TransparencyPrevious work has largely considered the fairness of image captioning systems through the underspecified lens of “bias.” In contrast, we present a set of techniques for measuring five types of representational harms, as well as the resulting ...
Taxonomizing and measuring representational harms: a look at image tagging
AAAI'23/IAAI'23/EAAI'23: Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial IntelligenceIn this paper, we examine computational approaches for measuring the "fairness" of image tagging systems, finding that they cluster into five distinct categories, each with its own analytic foundation. We also identify a range of normative concerns that ...
Beyond the skin bag: on the moral responsibility of extended agencies
The growing prominence of computers in contemporary life, often seemingly with minds of their own, invites rethinking the question of moral responsibility. If the moral responsibility for an act lies with the subject that carried it out, it follows that ...
Comments