Published June 26, 2020 | Version 1.0.0
Dataset Open

Data from: International authorship and collaboration across bioRxiv preprints

  • 1. University of Minnesota

Description

Data and supplementary tables for "International authorship and collaboration across bioRxiv preprints," a paper first posted to bioRxiv and now published in eLife.

  • "reproduce.md" includes all R code used to generate figures and perform analyses described in the paper.
  • "biorxiv_countries.postgres.backup" is a database snapshot that can be loaded into a PostgreSQL database to access all data collected and used in the study.
  • "schema.pdf" describes each field in each table of the database.
  • "manual_edits.sql" describes all corrections made to the automated inference of the country-level affiliations inferred for all authors.
  • "affiliation_corrections.csv" lists every unique affiliation string that was re-categorized after institutional corrections. The consequences of the corrections described in "manual_edits.sql."
  • "institution_corrections_summary.csv" summarizes "affiliation_corrections.csv" by listing each "before" and "after" correction one time. It is important to note that each before/after pair does not necessarily indicate that every affiliation string from the "before" institution was reassigned to the "after" institution, just that at least one affiliation string was switched from one to the other.
    • Note that the final two "corrections" files describe steps taken to correct the institution-level associations between authors and countries. The final set of corrections assigned authors to countries using heuristics that did not take institution-level accuracy into account.

Version history:

  • 1.0.0: New files uploaded reflecting substantial corrections to the data, mostly linked to classification of authors and preprints previously without a country classification. (26 Jun 2020)
  • 0.2.1: Added "schema.pdf" file, previously only in the manuscript.
  • 0.2.0: Added new files "affiliation_corrections.csv" and "institution_corrections_summary.csv"
  • 0.1.1: Database snapshot added.
  • 0.1.0: First version with supplementary tables added.

Files

affiliation_corrections.csv

Files (49.1 MB)

Name Size Download all
md5:cf2a1836c0624ea58fab430094a43c66
1.6 MB Preview Download
md5:2d6dca85aa49c6d7e42141da4ee80b67
1.5 MB Preview Download
md5:7ce48584a603c7b96ad807e796168f5a
39.7 MB Download
md5:6c1a4c27824c6965fb9a9f999788fbf1
1.5 MB Preview Download
md5:2a1e7ceef517556e0eb755ba2e20eca7
315.5 kB Preview Download
md5:58843ecd05f8cdfdb7f548753345e406
1.9 MB Preview Download
md5:2b11512b52cf70837fdc6788061ab270
79.1 kB Preview Download
md5:ed3e857eb63c7a76c11a8ec1a042aa31
71.9 kB Download
md5:957f06df32ddcc401648e9baa0daef99
91.6 kB Preview Download
md5:e904c2faf249bf6181ff28b1c52c69ab
25.5 kB Preview Download
md5:341f206484f00c94a263cdae23afd118
69.9 kB Preview Download
md5:aaa88320e0303bcbcc4883c9dde28ed2
99.3 kB Preview Download
md5:83a66c239c8d5603141188a8e1d0d459
2.1 MB Preview Download
md5:c7b0c05aaa802f7f1bb7354472f6a3f1
5.1 kB Preview Download
md5:083de77a262853fc4a7277274cea9751
11.4 kB Preview Download
md5:089199f46cf8ffd303312c81d8b35e76
9.7 kB Preview Download
md5:35bb45dec0740a51a8a26aa0d07045ab
2.6 kB Preview Download
md5:350f6a60c5a0f9501033f44dbbf4ee99
2.8 kB Preview Download
md5:bedd30acb6e16b7807afa0a3ef0fbba2
6.9 kB Preview Download
md5:b0a5f9633507acd421e2d18075610bc2
7.9 kB Preview Download
md5:ff11599358dc1c227d0191ea7f9421f8
16.2 kB Preview Download
md5:044ca44d21261d0fb3e768072bc71cfd
1.1 kB Preview Download

Additional details

Related works

Is supplement to
Preprint: 10.1101/2020.04.25.060756 (DOI)