Women Are Underrepresented Among Authors of Retracted Publications: Retrospective Study of 134 Medical Journals

We examined the gender distribution of authors of retracted articles in 134 medical journals across 10 disciplines, compared it with the gender distribution of authors of all published articles, and found that women were underrepresented among authors of retracted articles, and, in particular, of articles retracted for misconduct.


Introduction
There is extensive literature highlighting the inequalities experienced by female researchers throughout their academic careers [1][2][3].By contrast, there is insufficient data on the association between article retractions and gender.A study of 113 PubMed retraction notices from 2016 showed that fraud and plagiarism were found mainly in articles authored by men and errors in data and analysis were seen mainly in articles authored by women [4].Another study using a database of retracted articles  showed that women represented 27% of first authors and 24% of last authors, but there was no comparison group (ie, the representation of women and men as authors of publications) [5].There was also no comparison group in a US study that examined 228 cases of misconduct (1994)(1995)(1996)(1997)(1998)(1999)(2000)(2001)(2002)(2003)(2004)(2005)(2006)(2007)(2008)(2009)(2010)(2011)(2012) and found that 149 (65%) were authored by men [6].Finally, a study assessing factors associated with 611 retractions (2010-2011) found no association with gender, but gender was not determined using a validated tool [7].
In this study, we compared the representation of female first and last authors in retracted articles and all publications by examining 134 medical journals.

Methods
Multimedia Appendix 1 describes in detail the methods used.For publications, we used the results of Hart and Perlis [1], which calculated the proportion of female first and last authors of publications in 134 journals across 10 medical specialties for 2008 and 2017.For retractions, we retrieved all PubMed articles published in these journals between January 2003 and December 2022 that were retracted.We evaluated the 2003-2022 period to have a sufficiently large sample size.We retrieved the reason(s) for retraction using the Retraction Watch database and grouped the 102 reasons into 4 main reasons: scientific misconduct only, error(s) only, scientific misconduct and error(s), and reason not related to the author(s).We used the Gender API software to determine first and last authors' gender [8] and, if inference accuracy was <80%, checked the gender manually by consulting websites with photos.Data extraction was done in duplicate by authors PS and MA.Discrepancies were resolved through discussion among research team members.We assessed first and last authorship as these positions indicate the greatest involvement in the article in most biomedical disciplines.
We computed the proportion of retractions and stratified the results by gender and discipline.To exclude ambiguous names that could skew the gender distribution, we repeated the analyses with retractions whose authors' gender was determined with >60% or >70% accuracy [1,3].Data were analyzed descriptively.Since this study did not involve the collection of personal health-related data, it did not require ethical review, according to current Swiss law.
After excluding anonymous retracted articles and those with first names as initials, gender could be determined for 398 first authors and 395 last authors.Women were first or last authors of 100 (25.1%) and 55 (13.9%) retractions, respectively, while their proportion as first or last authors of all publications was 41.3% and 26.1% in 2008 and 45.4% and 33.4% in 2017, respectively.
The proportion of female first and last authors of all publications was higher in 2017 than in 2008 for all 10 disciplines.The proportion of women was lower for retractions compared to all publications for all 10 disciplines for first authors and 7 disciplines for last authors.
The results were similar when using subsamples.For example, the proportion of female first and last authors of retractions was 24.3% (93/383) and 13.8% (53/383), respectively, when the authors' gender was determined with an accuracy of >60%.It was 24.6% (90/366) and 14% (53/379), respectively, when accuracy was >70%.The sum of the results of each discipline exceeds the total results because 5 journals were classified into 2 different disciplines.

Discussion
We found that women were underrepresented among authors of retracted articles, and, in particular, of articles retracted for misconduct.
Compared with the study by Pinho-Gomes et al [5], the proportion of retractions authored by women was similar for first authorship (25% vs 27%) but not for last authorship (14% vs 24%), but these authors included all biomedical journals.Another study showed that women were especially underrepresented among authors of articles retracted for misconduct [4].
Retractions for misconduct can be seen as proxies for scientific integrity, and our results suggest that it varies with gender.Identifying the underlying reasons for these gender disparities is challenging.No studies had directly tackled this topic, making it difficult to draw conclusive findings.Biological, social, and cultural factors can interact in a complex way and contribute to the more pronounced competitive tendencies of men versus women, which can be a possible risk factor for misconduct [9].

RenderX
Alternatively, women may be less often targeted by investigations than men [10].
Our study has two main limitations.Gender was determined using Gender API and a manual search instead of self-identification.We dichotomized gender into female and male, which did not allow us to assess nonbinary identity.
License (https://creativecommons.org/licenses/by/4.0/),which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited.The complete bibliographic information, a link to the original publication on https://www.jmir.org/,as well as this copyright and license information must be included.

Figure 1 .
Figure 1.Graph of the proportion of women as first and last authors of retracted articles (2003-2022) and of all publications (2008 and 2017).Data shown by medical specialty.

Table 1 .
Proportion of women as first and last authors of retracted articles (2003-2022) and of all publications (2008 and 2017).Data shown by medical specialty.