research-article

Open Access

Beyond Behaviorist Representational Harms: A Plan for Measurement and Mitigation

Authors:
Jennifer Chien

University of California San Diego, UCSD, United States of America

University of California San Diego, UCSD, United States of America

0009-0009-8768-1761
View Profile

,
David Danks

University of California San Diego, UCSD, USA

University of California San Diego, UCSD, USA

0000-0003-4541-5966
View Profile

FAccT '24: Proceedings of the 2024 ACM Conference on Fairness, Accountability, and TransparencyJune 2024Pages 933–946https://doi.org/10.1145/3630106.3658946

Published:05 June 2024Publication History

FAccT '24: Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency

Pages 933–946

ABSTRACT

Algorithmic harms are commonly categorized as either allocative or representational. This study specifically addresses the latter, examining current definitions of representational harms to discern what is included and what is not. This analysis motivates our expansion beyond behavioral definitions to encompass harms to cognitive and affective states. The paper outlines high-level requirements for measurement: identifying the necessary expertise to implement this approach and illustrating it through a case study. Our work highlights the unique vulnerabilities of large language models to perpetrating representational harms, particularly when these harms go unmeasured and unmitigated. The work concludes by presenting proposed mitigations and delineating when to employ them. The overarching aim of this research is to establish a framework for broadening the definition of representational harms and to translate insights from fairness research into practical measurement and mitigation praxis.

References

Frances E Aboud and Morton J Mendelson. 1996. Determinants of friendship selection and quality: Developmental perspectives. The company they keep: Friendship in childhood and adolescence (1996), 87–112.Google Scholar
Gerald R Adams, Judy Shea, and Steven A Fitch. 1979. Toward the development of an objective assessment of ego-identity status. Journal of youth and adolescence 8, 2 (1979), 223–237.Google ScholarCross Ref
Ifeoma Ajunwa, Sorelle Friedler, Carlos E Scheidegger, and Suresh Venkatasubramanian. 2016. Hiring by algorithm: predicting and preventing disparate impact. Available at SSRN (2016).Google ScholarCross Ref
Naeem Akhtar, Muhammad Nadeem Akhtar, Muhammad Usman, Moazzam Ali, and Umar Iqbal Siddiqi. 2020. COVID-19 restrictions and consumers’ psychological reactance toward offline shopping freedom restoration. The Service Industries Journal 40, 13-14 (2020), 891–913.Google ScholarCross Ref
Iuliia Alieva. 2023. How American media framed 2016 presidential election using data visualization: The case study of the New York times and the Washington post. Journalism Practice 17, 4 (2023), 814–840.Google ScholarCross Ref
Mike Ananny and Kate Crawford. 2018. Seeing without knowing: Limitations of the transparency ideal and its application to algorithmic accountability. new media & society 20, 3 (2018), 973–989.Google Scholar
Nazanin Andalibi, Cassidy Pyle, Kristen Barta, Lu Xian, Abigail Z Jacobs, and Mark S Ackerman. 2023. Conceptualizing Algorithmic Stigmatization. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. 1–18.Google ScholarDigital Library
Albert Bandura, Dorothea Ross, and Sheila A Ross. 1963. Imitation of film-mediated aggressive models.The Journal of Abnormal and Social Psychology 66, 1 (1963), 3.Google Scholar
Albert Bandura, C Barr Taylor, S Lloyd Williams, Ivan N Mefford, and Jack D Barchas. 1985. Catecholamine secretion as a function of perceived coping self-efficacy.Journal of consulting and clinical psychology 53, 3 (1985), 406.Google Scholar
Melissa Bell and Nichole Bayliss. 2015. The Tough Guise: Teaching Violent Masculinity as the Only Way to Be a Man: Tough Guise 2: Violence, Manhood and American Culture. Sex Roles 72 (2015), 566–568.Google Scholar
Jenn K Bergen, Sharissa Unger Hantke, and Verna St Denis. 2023. Contemporary challenges and approaches in anti-racist teacher education. (2023).Google Scholar
John W Berry, Joseph E Trimble, and Esteban L Olmedo. 1986. Assessment of acculturation. (1986).Google Scholar
Federico Bianchi, Pratyusha Kalluri, Esin Durmus, Faisal Ladhak, Myra Cheng, Debora Nozza, Tatsunori Hashimoto, Dan Jurafsky, James Zou, and Aylin Caliskan. 2023. Easily accessible text-to-image generation amplifies demographic stereotypes at large scale. In Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency. 1493–1504.Google ScholarDigital Library
Daniel Boduszek and Agata Debowska. 2020. Feelings of Inadequacy Scale. In Encyclopedia of Personality and Individual Differences. Springer, 1574–1576.Google Scholar
Robert Böhm, Hannes Rusch, and Jonathan Baron. 2020. The psychology of intergroup conflict: A review of theories and measures. Journal of Economic Behavior & Organization 178 (2020), 947–962.Google ScholarCross Ref
Kenneth A Bollen and Rick H Hoyle. 1990. Perceived cohesion: A conceptual and empirical examination. Social forces 69, 2 (1990), 479–504.Google Scholar
Thomas W Britt, Winny Shen, Robert R Sinclair, Matthew R Grossman, and David M Klieger. 2016. How much do we really know about employee resilience?Industrial and Organizational Psychology 9, 2 (2016), 378–404.Google Scholar
Samuel C Bullock and Earline Houston. 1987. Perceptions of racism by Black medical students attending White medical schools. Journal of the National Medical Association 79, 6 (1987), 601.Google Scholar
Antonio Byrd. 2023. Truth-Telling: Critical Inquiries on LLMs and the Corpus Texts That Train Them.Composition Studies 51, 1 (2023), 135–142.Google Scholar
Albert V Carron, W Neil Widmeyer, and Lawrence R Brawley. 1985. The development of an instrument to assess cohesion in sport teams: The Group Environment Questionnaire. Journal of Sport and Exercise psychology 7, 3 (1985), 244–266.Google ScholarCross Ref
Charles S Carver. 1997. You want to measure coping but your protocol’too long: Consider the brief cope. International journal of behavioral medicine 4, 1 (1997), 92–100.Google Scholar
Matthew Chalmers and Ian MacColl. 2003. Seamful and seamless design in ubiquitous computing. In Workshop at the crossroads: The interaction of HCI and systems issues in UbiComp, Vol. 8.Google Scholar
Myra Cheng, Esin Durmus, and Dan Jurafsky. 2023. Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models. arXiv preprint arXiv:2305.18189 (2023).Google Scholar
Rodney Clark, Norman B Anderson, Vernessa R Clark, and David R Williams. 1999. Racism as a stressor for African Americans: A biopsychosocial model.American psychologist 54, 10 (1999), 805.Google Scholar
Kathryn M Connor, Jonathan RT Davidson, L Erik Churchill, Andrew Sherwood, Richard H Weisler, and Edna Foa. 2000. Psychometric properties of the Social Phobia Inventory (SPIN): New self-rating scale. The British Journal of Psychiatry 176, 4 (2000), 379–386.Google ScholarCross Ref
John N Constantino, Christian P Gruber, 2012. Social responsiveness scale: SRS-2. (2012).Google Scholar
Anna L Cox, Sandy JJ Gould, Marta E Cecchinato, Ioanna Iacovides, and Ian Renfree. 2016. Design frictions for mindful interactions: The case for microboundaries. In Proceedings of the 2016 CHI conference extended abstracts on human factors in computing systems. 1389–1397.Google ScholarDigital Library
Laura Craig-Bray and Gerald R Adams. 1986. Different methodologies in the assessment of identity: Congruence between self-report and interview techniques?Journal of Youth and Adolescence 15, 3 (1986), 191–204.Google Scholar
Isiaah Crawford, Kevin W Allison, Brian D Zamboni, and Tomas Soto. 2002. The influence of dual-identity development on the psychosocial functioning of African-American gay and bisexual men. Journal of Sex Research 39, 3 (2002), 179–189.Google ScholarCross Ref
Ewart J De Visser, Samuel S Monfort, Ryan McKendrick, Melissa AB Smith, Patrick E McKnight, Frank Krueger, and Raja Parasuraman. 2016. Almost human: Anthropomorphism increases trust resilience in cognitive agents.Journal of Experimental Psychology: Applied 22, 3 (2016), 331.Google Scholar
Pieter Delobelle and Bettina Berendt. 2022. Fairdistillation: mitigating stereotyping in language models. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 638–654.Google Scholar
Gregory J Digirolamo and Douglas L Hintzman. 1997. First impressions are lasting impressions: A primacy effect in memory for repetitions. Psychonomic Bulletin & Review 4, 1 (1997), 121–124.Google ScholarCross Ref
Catherine D’ignazio and Lauren F Klein. 2023. Data feminism. MIT press.Google Scholar
Pierluigi Diotaiuti, Giuseppe Valente, Stefania Mancone, Angela Grambone, Andrea Chirico, and Fabio Lucidi. 2022. The use of the Decision Regret Scale in non-clinical contexts. Frontiers in Psychology 13 (2022), 945669.Google ScholarCross Ref
Upol Ehsan and Mark O Riedl. 2021. Explainability pitfalls: Beyond dark patterns in explainable AI. arXiv preprint arXiv:2109.12480 (2021).Google Scholar
Julia Elad-Strenger, Michal Reifen Tagar, Thomas Kessler, Yossi Hasson, Deborah Shulman, Kea Brahms, and Eran Halperin. 2022. Out of sight, out of mind: The emotional determinant of “harmful inaction” intergroup conflict. Journal of Experimental Social Psychology 101 (2022), 104304.Google ScholarCross Ref
Dalia Elsouhag, Bengt Arnetz, Hikmet Jamil, Mark A Lumley, Carissa L Broadbridge, and Judy Arnetz. 2015. Factors associated with healthcare utilization among Arab immigrants and Iraqi refugees. Journal of Immigrant and Minority Health 17 (2015), 1305–1312.Google ScholarCross Ref
Jean Endicott, John Nee, Wilma Harrison, and Richard Blumenthal. 1993. Quality of Life Enjoyment and Satisfaction Questionnaire: a new measure.Psychopharmacology bulletin 29, 2 (1993), 321–326.Google Scholar
Donna Farland-Smith, Kevin Finson, William J Boone, and Melissa Yale. 2014. An investigation of media influences on elementary students representations of scientists. Journal of Science Teacher Education 25, 3 (2014), 355–366.Google ScholarCross Ref
Lisa K Fazio, Nadia M Brashier, B Keith Payne, and Elizabeth J Marsh. 2015. Knowledge does not protect against illusory truth.Journal of experimental psychology: general 144, 5 (2015), 993.Google Scholar
Sumam Fernando. 1984. Racism as a cause of depression. International Journal of Social Psychiatry 30, 1-2 (1984), 41–49.Google ScholarCross Ref
Vinitha Gadiraju, Shaun Kane, Sunipa Dev, Alex Taylor, Ding Wang, Emily Denton, and Robin Brewer. 2023. " I wouldn’t say offensive but...": Disability-Centered Perspectives on Large Language Models. In Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency. 205–216.Google ScholarDigital Library
David Gauntlett. 2008. Media, gender and identity: An introduction. Routledge.Google Scholar
Avijit Ghosh, Matthew Jagielski, and Christo Wilson. 2022. Subverting Fair Image Search with Generative Adversarial Perturbations. In Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency. 637–650.Google ScholarDigital Library
Amelia Glaese, Nat McAleese, Maja Trębacz, John Aslanides, Vlad Firoiu, Timo Ewalds, Maribeth Rauh, Laura Weidinger, Martin Chadwick, Phoebe Thacker, 2022. Improving alignment of dialogue agents via targeted human judgements. arXiv preprint arXiv:2209.14375 (2022).Google Scholar
Shelly Grabe and Anjali Dutt. 2015. Counter narratives, the psychology of liberation, and the evolution of a women’s social movement in Nicaragua.Peace and Conflict: Journal of Peace Psychology 21, 1 (2015), 89.Google ScholarCross Ref
Lana Ruvolo Grasser. 2022. Addressing mental health concerns in refugees and displaced populations: is enough being done?Risk management and healthcare policy (2022), 909–922.Google Scholar
Sophia Harrison, Eleonora Gualdoni, and Gemma Boleda. 2023. Run Like a Girl! Sports-Related Gender Bias in Language and Vision. arXiv preprint arXiv:2305.14468 (2023).Google Scholar
Aumyo Hassan and Sarah J Barber. 2021. The effects of repetition frequency on the illusory truth effect. Cognitive research: principles and implications 6, 1 (2021), 1–12.Google Scholar
Leslie RM Hausmann, Janet Ward Schofield, and Rochelle L Woods. 2007. Sense of belonging as a predictor of intentions to persist among African American and White first-year college students. Research in higher education 48 (2007), 803–839.Google Scholar
Marc W Heerdink, Gerben A Van Kleef, Astrid C Homan, and Agneta H Fischer. 2013. On the social influence of emotions in groups: interpersonal effects of anger and happiness on conformity versus deviance.Journal of Personality and Social Psychology 105, 2 (2013), 262.Google Scholar
Anne S Helsdingen, Karel Van den Bosch, Tamara Van Gog, and Jeroen JG van Merriënboer. 2010. The effects of critical thinking instruction on training complex decision making. Human factors 52, 4 (2010), 537–545.Google Scholar
Ingvild Oxås Henriksen, Ingunn Ranøyen, Marit Sæbø Indredavik, and Frode Stenseng. 2017. The role of self-esteem in the development of psychiatric problems: a three-year prospective study in a clinical sample of adolescents. Child and adolescent psychiatry and mental health 11 (2017), 1–9.Google Scholar
Fernanda Herrera and Jeremy N Bailenson. 2021. Virtual reality perspective-taking at scale: Effect of avatar representation, choice, and head movement on prosocial behaviors. new media & society 23, 8 (2021), 2189–2209.Google Scholar
Kimberly Hively and Amani El-Alayli. 2014. “You throw like a girl:” The effect of stereotype threat on women’s athletic performance and gender stereotypes. Psychology of Sport and Exercise 15, 1 (2014), 48–55.Google ScholarCross Ref
Kelly M Hoffman, Sophie Trawalter, Jordan R Axt, and M Norman Oliver. 2016. Racial bias in pain assessment and treatment recommendations, and false beliefs about biological differences between blacks and whites. Proceedings of the National Academy of Sciences 113, 16 (2016), 4296–4301.Google ScholarCross Ref
Saghar Hosseini, Hamid Palangi, and Ahmed Hassan Awadallah. 2023. An Empirical Study of Metrics to Measure Representational Harms in Pre-Trained Language Models. arXiv preprint arXiv:2301.09211 (2023).Google Scholar
Sarah Inman and David Ribes. 2019. " Beautiful Seams" Strategic Revelations and Concealments. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. 1–14.Google ScholarDigital Library
Oliver Jacobs, Farid Pazhoohi, and Alan Kingstone. 2023. Brief exposure increases mind perception to ChatGPT and is moderated by the individual propensity to anthropomorphize. (2023).Google Scholar
Justin Jagosh, Ann C Macaulay, Pierre Pluye, JON Salsberg, Paula L Bush, JIM Henderson, Erin Sirett, Geoff Wong, Margaret Cargo, Carol P Herbert, 2012. Uncovering the benefits of participatory research: implications of a realist review for health research and practice. The Milbank Quarterly 90, 2 (2012), 311–346.Google ScholarCross Ref
Rebecca L Johnson, Giada Pistilli, Natalia Menédez-González, Leslye Denisse Dias Duran, Enrico Panai, Julija Kalpokiene, and Donald Jay Bertulfo. 2022. The Ghost in the Machine has an American accent: value conflict in GPT-3. arXiv preprint arXiv:2203.07785 (2022).Google Scholar
Suzanne B Johnson and Page L Anderson. 2014. Stereotype confirmation concern and fear of negative evaluation among African Americans and Caucasians with Social Anxiety Disorder. Journal of anxiety disorders 28, 4 (2014), 390–393.Google ScholarCross Ref
Stephanie T Jones and Natalie Araujo Melo. 2021. We tell these stories to survive: Towards abolition in computer science education. Canadian Journal of Science, Mathematics and Technology Education 21 (2021), 290–308.Google ScholarCross Ref
PK Kannan, Werner Reinartz, and Peter C Verhoef. 2016. The path to purchase and attribution modeling: Introduction to special section., 449–456 pages.Google Scholar
Jared Katzman, Angelina Wang, Morgan Scheuerman, Su Lin Blodgett, Kristen Laird, Hanna Wallach, and Solon Barocas. 2023. Taxonomizing and Measuring Representational Harms: A Look at Image Tagging. arXiv preprint arXiv:2305.01776 (2023).Google Scholar
Ibram X Kendi. 2023. How to be an antiracist. One world.Google Scholar
Nitasha Tiku Kevin Schaul, Szu Yu Chen. 2023. Inside the secret list of websites that make ai like chatgpt sound smart. https://www.washingtonpost.com/technology/interactive/2023/ai-chatbot-learning/Google Scholar
Youjeong Kim and S Shyam Sundar. 2012. Anthropomorphism of computers: Is it mindful or mindless?Computers in Human Behavior 28, 1 (2012), 241–250.Google Scholar
Hadas Kotek, Rikker Dockum, and David Sun. 2023. Gender bias and stereotypes in Large Language Models. In Proceedings of The ACM Collective Intelligence Conference. 12–24.Google ScholarDigital Library
Kevin T Larkin, Elizabeth M Semenchuk, Nicole L Frazer, Sonia Suchday, and Robert L Taylor. 1998. Cardiovascular and behavioral response to social confrontation: Measuring real-life stress in the laboratory. Annals of Behavioral Medicine 20 (1998), 294–301.Google ScholarCross Ref
Mark R Leary. 2015. Emotional responses to interpersonal rejection. Dialogues in clinical neuroscience (2015).Google Scholar
Robert W Lent and Steven D Brown. 2006. On conceptualizing and assessing social cognitive constructs in career research: A measurement guide. Journal of career assessment 14, 1 (2006), 12–35.Google ScholarCross Ref
Robert W Lent, Steven D Brown, Gail Hackett, 2002. Social cognitive career theory. Career choice and development 4, 1 (2002), 255–311.Google Scholar
Q Vera Liao and Jennifer Wortman Vaughan. 2023. AI Transparency in the Age of LLMs: A Human-Centered Research Roadmap. arXiv preprint arXiv:2306.01941 (2023).Google Scholar
Oliver Lindhiem, Isaac T Petersen, Lucas K Mentch, and Eric A Youngstrom. 2020. The importance of calibration in clinical psychology. Assessment 27, 4 (2020), 840–854.Google ScholarCross Ref
Anne Maass, Daniela Salvi, Luciano Arcuri, and Gün R Semin. 1989. Language use in intergroup contexts: The linguistic intergroup bias.Journal of personality and social psychology 57, 6 (1989), 981.Google Scholar
Steven C Martino, Deborah M Scharf, Claude M Setodji, and William G Shadel. 2011. Measuring exposure to protobacco marketing and media: a field study using ecological momentary assessment. Nicotine & Tobacco Research 14, 4 (2011), 398–406.Google ScholarCross Ref
Allison Master, Sapna Cheryan, and Andrew N Meltzoff. 2016. Computing whether she belongs: Stereotypes undermine girls’ interest and sense of belonging in computer science.Journal of educational psychology 108, 3 (2016), 424.Google Scholar
Rich McCormick. 2016. The NYT’s election forecast needle is stressing people out with fake jitter. https://www.theverge.com/2016/11/8/13571216/new-york-times-election-forecast-jitter-needleGoogle Scholar
Ebony McGee. 2018. “Black genius, Asian fail”: The detriment of stereotype lift and stereotype threat in high-achieving Asian and Black STEM students. AERA Open 4, 4 (2018), 2332858418816658.Google ScholarCross Ref
Johanna Christina Neumann, Thomas Berger, and Jan Ilhan Kizilhan. 2021. Development of a questionnaire to measure the perceived injustice of people who have experienced violence in war and conflict areas: Perceived Injustice Questionnaire (PIQ). International journal of environmental research and public health 18, 23 (2021), 12357.Google ScholarCross Ref
Terrence Neumann, Maria De-Arteaga, and Sina Fazelpour. 2022. Justice in misinformation detection systems: An analysis of algorithms, stakeholders, and potential harms. In Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency. 1504–1515.Google ScholarDigital Library
Jennifer Jane Newson, Vladyslav Pastukh, and Tara C Thiagarajan. 2021. Poor separation of clinical symptom profiles by DSM-5 disorder criteria. Frontiers in Psychiatry 12 (2021), 775762.Google ScholarCross Ref
Kin Wai Ng, Frederick Mubang, Lawrence O Hall, John Skvoretz, and Adriana Iamnitchi. 2023. Experimental evaluation of baselines for forecasting social media timeseries. EPJ Data Science 12, 1 (2023), 8.Google ScholarCross Ref
Thinh On, Subhodeep Ghosh, Mengnan Du, and Senjuti Basu Roy. 2002. Proportionate Diversification of Top-k LLM Results using Database Queries. DEF 2 (2002), 1.Google Scholar
Bill Ottman, CEO Co-founder, Minds Daryl Davis, Race Reconciliator, Jack Ottman, COO Co-founder, Minds Jesse Morton, Sophia Moskalenko, James Daly, Julian Rapaport, [n. d.]. The Censorship Effect. ([n. d.]).Google Scholar
Anaelia Ovalle, Palash Goyal, Jwala Dhamala, Zachary Jaggers, Kai-Wei Chang, Aram Galstyan, Richard Zemel, and Rahul Gupta. 2023. “I’m fully who I am”: Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation. In Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency. 1246–1266.Google ScholarDigital Library
Cory Owen. 2015. The" model Minority" Myth and Its Impact on Anxiety and Stress on Asian American Students. Ph. D. Dissertation. University of Houston.Google Scholar
Jennifer O’Meara. 2016. What “The Bechdel Test” doesn’t tell us: examining women’s verbal and vocal (dis) empowerment in cinema. Feminist media studies 16, 6 (2016), 1120–1123.Google Scholar
Lauv Patel, Tripti Shukla, Xiuzhen Huang, David W Ussery, and Shanzhi Wang. 2020. Machine learning methods in drug discovery. Molecules 25, 22 (2020), 5277.Google ScholarCross Ref
Leonard I Pearlin. 1989. The sociological study of stress. Journal of health and social behavior (1989), 241–256.Google Scholar
Christopher Peterson, Steven F Maier, and Martin EP Seligman. 1993. Learned helplessness: A theory for the age of personal control. Oxford University Press, USA.Google Scholar
Jean S Phinney. 1992. The multigroup ethnic identity measure: A new scale for use with diverse groups. Journal of adolescent research 7, 2 (1992), 156–176.Google ScholarCross Ref
Jean S Phinney, Bruce T Lochner, and Rodolfo Murphy. 1990. Ethnic identity development and psychological adjustment in adolescence.Sage Publications, Inc.Google Scholar
Dimitrios Pnevmatikos, Panagiota Christodoulou, and Nikolaos Fachantidis. 2020. STAKEHOLDERS’INVOLVEMENT IN PARTICIPATORY DESIGN APPROACHES OF LEARNING ENVIRONMENTS: A SYSTEMATIC REVIEW OF THE LITERATURE. EDULEARN20 proceedings (2020), 5543–5552.Google Scholar
Geoff Potvin, Zahra Hazari, Raina Khatri, Hemeng Cheng, T Blake Head, Robynne M Lock, Anne F Kornahrens, Kathryne Sparks Woodle, Rebecca E Vieyra, Beth A Cunningham, 2023. Examining the effect of counternarratives about physics on women’s physics career intentions. Physical Review Physics Education Research 19, 1 (2023), 010126.Google ScholarCross Ref
Emily Jane Prince and Julie Hadwin. 2013. The role of a sense of school belonging in understanding the effectiveness of inclusion of children with special educational needs. International Journal of Inclusive Education 17, 3 (2013), 238–262.Google ScholarCross Ref
Kurtis Pykes. 2023. Promoting responsible AI: Content moderation in chatgpt. https://www.datacamp.com/blog/promoting-responsible-ai-content-moderation-in-chatgptGoogle Scholar
Rida Qadri, Renee Shelby, Cynthia L Bennett, and Emily Denton. 2023. AI’s Regimes of Representation: A Community-centered Study of Text-to-Image Models in South Asia. In Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency. 506–517.Google ScholarDigital Library
M Afzalur Rahim. 1983. Rahim Organizational Conflict Inventory–II. Journal of Applied Psychology (1983).Google Scholar
Charvi Rastogi, Marco Tulio Ribeiro, Nicholas King, Harsha Nori, and Saleema Amershi. 2023. Supporting human-ai collaboration in auditing llms with llms. In Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society. 913–926.Google ScholarDigital Library
P Neal Ritchey and Harold D Fishbein. 2001. The lack of an association between adolescent friends’ prejudices and stereotypes. Merrill-Palmer Quarterly (1982-) (2001), 188–206.Google Scholar
Michael Ritsner, Ilan Modai, and Alexander Ponizovsky. 2002. Assessing psychological distress in psychiatric patients: validation of the Talbieh Brief Distress Inventory. Comprehensive Psychiatry 43, 3 (2002), 229–234.Google ScholarCross Ref
Michael Ritsner, Jonathan Rabinowitz, and Michael Slyuzberg. 1995. The Talbieh Brief Distress Inventory: a brief instrument to measure psychological distress among immigrants. Comprehensive Psychiatry 36, 6 (1995), 448–453.Google ScholarCross Ref
Steven O Roberts, Carmelle Bareket-Shavit, Forrest A Dollins, Peter D Goldie, and Elizabeth Mortenson. 2020. Racial inequality in psychological research: Trends of the past and recommendations for the future. Perspectives on psychological science 15, 6 (2020), 1295–1309.Google Scholar
Steven G Rogelberg, Gwenith G Fisher, Douglas C Maynard, Milton D Hakel, and Michael Horvath. 2001. Attitudes toward surveys: Development of a measure and its relationship to respondent behavior. Organizational Research Methods 4, 1 (2001), 3–25.Google ScholarCross Ref
Georg Schomerus, Holger Muehlan, Charlotte Auer, Philip Horsfield, Samuel Tomczyk, Simone Freitag, Sara Evans-Lacko, Silke Schmidt, and Susanne Stolzenburg. 2019. Validity and psychometric properties of the self-identification as having a mental illness scale (SELF-I) among currently untreated persons with mental health problems. Psychiatry Research 273 (2019), 303–308.Google ScholarCross Ref
M Seligman. 1975. Helplessness: On Depression, Development, and Death.” an Francisco.Google Scholar
Gün R Semin and Klaus Fiedler. 1988. The cognitive functions of linguistic categories in describing persons: Social cognition and language.Journal of personality and Social Psychology 54, 4 (1988), 558.Google Scholar
Mohammad Ahmad Sheikh, Amit Kumar Goel, and Tapas Kumar. 2020. An approach for prediction of loan approval using machine learning algorithm. In 2020 International Conference on Electronics and Sustainable Communication Systems (ICESC). IEEE, 490–494.Google ScholarCross Ref
Renee Shelby, Shalaleh Rismani, Kathryn Henne, AJung Moon, Negar Rostamzadeh, Paul Nicholas, N’Mah Yilla-Akbari, Jess Gallegos, Andrew Smart, Emilio Garcia, 2023. Sociotechnical Harms of Algorithmic Systems: Scoping a Taxonomy for Harm Reduction. In Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society. 723–741.Google ScholarDigital Library
Tom Simonite. 2021. Ai and the list of dirty, naughty, obscene, and otherwise bad words. https://www.wired.com/story/ai-list-dirty-naughty-obscene-bad-words/Google Scholar
Vijai P Singh. 1977. Some theoretical and methodological problems in the study of ethnic identity: a cross-cultural perspective. Annals of the New York Academy of Sciences 285, 1 (1977), 32–45.Google ScholarCross Ref
Michael D Slater. 2007. Reinforcing spirals: The mutual influence of media selectivity and media effects and their impact on individual behavior and social identity. Communication theory 17, 3 (2007), 281–303.Google Scholar
Irene Solaiman, Zeerak Talat, William Agnew, Lama Ahmad, Dylan Baker, Su Lin Blodgett, Hal Daumé III, Jesse Dodge, Ellie Evans, Sara Hooker, 2023. Evaluating the Social Impact of Generative AI Systems in Systems and Society. arXiv preprint arXiv:2306.05949 (2023).Google Scholar
Nicolas Spatola, Clément Belletier, Pierre Chausse, Maria Augustinova, Alice Normand, Vincent Barra, Ludovic Ferrand, and Pascal Huguet. 2019. Improved cognitive control in presence of anthropomorphized robots. International Journal of Social Robotics 11 (2019), 463–476.Google ScholarCross Ref
Russell Spears, Bertjan Doosje, and Naomi Ellemers. 1997. Self-stereotyping in the face of threats to group status and distinctiveness: The role of group identification. Personality and social psychology bulletin 23, 5 (1997), 538–553.Google Scholar
Elizabeth Stephens. 2023. The mechanical Turk: A short history of ‘artificial artificial intelligence’. Cultural Studies 37, 1 (2023), 65–87.Google ScholarCross Ref
Harini Suresh and John Guttag. 2021. A framework for understanding sources of harm throughout the machine learning life cycle. In Equity and access in algorithms, mechanisms, and optimization. 1–9.Google Scholar
Clarice Tang, Liz Thyer, Rosalind Bye, Belinda Kenny, Nikki Tulliani, Nicole Peel, Rebecca Gordon, Stefania Penkala, Caterina Tannous, Yu-Ting Sun, 2023. Impact of online learning on sense of belonging among first year clinical health students during COVID-19: student and academic perspectives. BMC Medical Education 23, 1 (2023), 100.Google ScholarCross Ref
Cristina Teresa-Morales, Margarita Rodríguez-Pérez, Miriam Araujo-Hernández, and Carmen Feria-Ramírez. 2022. Current stereotypes associated with nursing and nursing professionals: An integrative review. International journal of environmental research and public health 19, 13 (2022), 7640.Google ScholarCross Ref
Shannon Vallor. 2022. The AI Mirror: Reclaiming our Humanity in an Age of Machine Thinking. In Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society. 6–6.Google ScholarDigital Library
Angelina Wang, Solon Barocas, Kristen Laird, and Hanna Wallach. 2022. Measuring representational harms in image captioning. In Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency. 324–335.Google ScholarDigital Library
Hanchen Wang, Tianfan Fu, Yuanqi Du, Wenhao Gao, Kexin Huang, Ziming Liu, Payal Chandak, Shengchao Liu, Peter Van Katwyk, Andreea Deac, 2023. Scientific discovery in the age of artificial intelligence. Nature 620, 7972 (2023), 47–60.Google Scholar
John Broadus Watson. 1919. Psychology: From the standpoint of a behaviorist. JB Lippincott.Google ScholarCross Ref
Adam Waytz, Nicholas Epley, and John T Cacioppo. 2010. Social cognition unbound: Insights into anthropomorphism and dehumanization. Current Directions in Psychological Science 19, 1 (2010), 58–62.Google ScholarCross Ref
Laura Weidinger, John Mellor, Maribeth Rauh, Conor Griffin, Jonathan Uesato, Po-Sen Huang, Myra Cheng, Mia Glaese, Borja Balle, Atoosa Kasirzadeh, 2021. Ethical and social risks of harm from language models. arXiv preprint arXiv:2112.04359 (2021).Google Scholar
Laura Weidinger, Jonathan Uesato, Maribeth Rauh, Conor Griffin, Po-Sen Huang, John Mellor, Amelia Glaese, Myra Cheng, Borja Balle, Atoosa Kasirzadeh, 2022. Taxonomy of risks posed by language models. In Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency. 214–229.Google ScholarDigital Library
Marc Weiser. 1994. The world is not a desktop. interactions 1, 1 (1994), 7–8.Google Scholar
Thomas A Widiger and Cristina Crego. 2019. The Five Factor Model of personality structure: an update. World Psychiatry 18, 3 (2019), 271.Google ScholarCross Ref
GC Williams, ZR Freedman, EL Deci, and J Leone. 1998. Perceived competence scales. Diabetes Care 21, 10 (1998), 1644–1651.Google ScholarCross Ref
Katharina Wolff, Svein Larsen, and Torvald Øgaard. 2019. How to define and measure risk perceptions. Annals of Tourism Research 79 (2019), 102759.Google ScholarCross Ref
Frieda Wong and Richard Halgin. 2006. The “model minority”: Bane or blessing for Asian Americans?Journal of Multicultural Counseling and Development 34, 1 (2006), 38–49.Google Scholar
Haolun Wu, Bhaskar Mitra, Chen Ma, Fernando Diaz, and Xue Liu. 2022. Joint multisided exposure fairness for recommendation. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. 703–714.Google ScholarDigital Library
Kyra Yee, Uthaipon Tantipongpipat, and Shubhanshu Mishra. 2021. Image cropping on twitter: Fairness metrics, their limitations, and the importance of representation, design, and agency. Proceedings of the ACM on Human-Computer Interaction 5, CSCW2 (2021), 1–24.Google ScholarDigital Library
Yue Yu, Yuchen Zhuang, Jieyu Zhang, Yu Meng, Alexander Ratner, Ranjay Krishna, Jiaming Shen, and Chao Zhang. 2023. Large language model as attributed training data generator: A tale of diversity and bias. arXiv preprint arXiv:2306.15895 (2023).Google Scholar
Kaitlyn Zhou, Kawin Ethayarajh, and Dan Jurafsky. 2022. Richer countries and richer representations. arXiv preprint arXiv:2205.05093 (2022).Google Scholar

Index Terms

Beyond Behaviorist Representational Harms: A Plan for Measurement and Mitigation

Recommendations

Measuring Representational Harms in Image Captioning
FAccT '22: Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency

Previous work has largely considered the fairness of image captioning systems through the underspecified lens of “bias.” In contrast, we present a set of techniques for measuring five types of representational harms, as well as the resulting ...
Read More
Taxonomizing and measuring representational harms: a look at image tagging
AAAI'23/IAAI'23/EAAI'23: Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial Intelligence

In this paper, we examine computational approaches for measuring the "fairness" of image tagging systems, finding that they cluster into five distinct categories, each with its own analytic foundation. We also identify a range of normative concerns that ...
Read More
Beyond the skin bag: on the moral responsibility of extended agencies

The growing prominence of computers in contemporary life, often seemingly with minds of their own, invites rethinking the question of moral responsibility. If the moral responsibility for an act lies with the subject that carried it out, it follows that ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

FAccT '24: Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency
June 2024
2580 pages
ISBN:9798400704505
DOI:10.1145/3630106

Copyright © 2024 Owner/Author
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 5 June 2024
Check for updates
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 12
  Total Downloads
- Downloads (Last 12 months)12
- Downloads (Last 6 weeks)12
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Beyond Behaviorist Representational Harms: A Plan for Measurement and Mitigation

FAccT '24: Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency

ABSTRACT

References

Cited By

Index Terms

Recommendations

Measuring Representational Harms in Image Captioning

Taxonomizing and measuring representational harms: a look at image tagging

Beyond the skin bag: on the moral responsibility of extended agencies

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media