J-Guard: Journalism Guided Adversarially Robust Detection of AI-generated News

Kumarage, Tharindu; Bhattacharjee, Amrita; Padejski, Djordje; Roschke, Kristy; Gillmor, Dan; Ruston, Scott; Liu, Huan; Garland, Joshua

Computer Science > Computation and Language

arXiv:2309.03164 (cs)

[Submitted on 6 Sep 2023]

Title:J-Guard: Journalism Guided Adversarially Robust Detection of AI-generated News

Authors:Tharindu Kumarage, Amrita Bhattacharjee, Djordje Padejski, Kristy Roschke, Dan Gillmor, Scott Ruston, Huan Liu, Joshua Garland

View PDF

Abstract:The rapid proliferation of AI-generated text online is profoundly reshaping the information landscape. Among various types of AI-generated text, AI-generated news presents a significant threat as it can be a prominent source of misinformation online. While several recent efforts have focused on detecting AI-generated text in general, these methods require enhanced reliability, given concerns about their vulnerability to simple adversarial attacks. Furthermore, due to the eccentricities of news writing, applying these detection methods for AI-generated news can produce false positives, potentially damaging the reputation of news organizations. To address these challenges, we leverage the expertise of an interdisciplinary team to develop a framework, J-Guard, capable of steering existing supervised AI text detectors for detecting AI-generated news while boosting adversarial robustness. By incorporating stylistic cues inspired by the unique journalistic attributes, J-Guard effectively distinguishes between real-world journalism and AI-generated news articles. Our experiments on news articles generated by a vast array of AI models, including ChatGPT (GPT3.5), demonstrate the effectiveness of J-Guard in enhancing detection capabilities while maintaining an average performance decrease of as low as 7% when faced with adversarial attacks.

Comments:	This Paper is Accepted to The 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (IJCNLP-AACL 2023)
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2309.03164 [cs.CL]
	(or arXiv:2309.03164v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2309.03164

Submission history

From: Tharindu Kumarage [view email]
[v1] Wed, 6 Sep 2023 17:06:31 UTC (137 KB)

Computer Science > Computation and Language

Title:J-Guard: Journalism Guided Adversarially Robust Detection of AI-generated News

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:J-Guard: Journalism Guided Adversarially Robust Detection of AI-generated News

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators