There is a newer version of the record available.

Published July 9, 2020 | Version 1.0.0
Dataset Open

Associated Data: RASPD+: Fast protein-ligand binding free energy prediction using simplified physicochemical features

  • 1. Molecular and Cellular Modelling Group, Heidelberg Institute of Theoretical Studies, Schloss-Wolfsbrunnenweg 35, 69118 Heidelberg, Germany; Institute of Pharmacy and Molecular Biotechnology (IPMB), Heidelberg University, Im Neuenheimer Feld 364, 69120 Heidelberg, Germany
  • 2. Supercomputing Facility for Bioinformatics \& Computational Biology, Department of Chemistry, Kusuma School of Biological Sciences, Indian Institute of Technology Delhi, Hauz Khas, New Delhi, 110016, India
  • 3. Molecular and Cellular Modelling Group, Heidelberg Institute of Theoretical Studies, Schloss-Wolfsbrunnenweg 35, 69118 Heidelberg, Germany; Center for Molecular Biology (ZMBH), DKFZ-ZMBH Alliance, Heidelberg University, Im Neuenheimer Feld 282, 69120 Heidelberg, Germany; Interdisciplinary Center for Scientific Computing (IWR), Heidelberg University, Im Neuenheimer Feld 205, 69120 Heidelberg, Germany
  • 4. Molecular and Cellular Modelling Group, Heidelberg Institute of Theoretical Studies, Schloss-Wolfsbrunnenweg 35, 69118 Heidelberg, Germany; Center for Molecular Biology (ZMBH), DKFZ-ZMBH Alliance, Heidelberg University, Im Neuenheimer Feld 282, 69120 Heidelberg, Germany

Description

Additional digital data to "RASPD+: Fast protein-ligand binding free energy prediction using simplified physicochemical features" (ChemRxiv preprint:https://doi.org/10.26434/chemrxiv.12636704.v1).

Associated code can be found at: https://github.com/HITS-MCM/RASPDplus

Files:

  • weights.tar.gz: contains the model weights of one random dataset split and its associated crossvalidation folds. Used for standard RASPD+ evaluation.
  • additional_model_replicates.tar.gz: contains the remaining models trained on the full set of descriptors.
  • external_test_sets.tar.gz: contains the descriptor tables for all external test sets used
  • DUD_descriptors.tar.gz: contains the descriptor tables for 30 proteins from the Directory of useful decoys (DUD) dataset
  • run_outputs.tar.gz: Performance metric data and predicted values created during the model training and evaluation runs. Basis for the figures and metrics in the manuscript.

Files

Files (2.4 GB)

Name Size Download all
md5:88956bc187e70d9fa99cc9f6742ebe4a
1.7 GB Download
md5:fe97dee57536eeb77104e26d85531b68
4.1 MB Download
md5:92d6edbc69525ae713b911ecad74ceaf
81.4 kB Download
md5:dc867dc93781642a3c3f94e27eba0fde
492.4 MB Download
md5:2f062eaae3be6219e1b477f5102f6b74
214.9 MB Download

Additional details

Related works

Is supplement to
Preprint: 10.26434/chemrxiv.12636704.v1 (DOI)