There is a newer version of the record available.

Published November 29, 2022 | Version 05.6
Dataset Open

Dataset - Papyrus - A large scale curated dataset aimed at bioactivity predictions

Description

This repository contains version 05.6 of the Papyrus dataset, an aggregated dataset of small molecule bioactivities, as described in the preprint "Papyrus - A large scale curated dataset aimed at bioactivity predictions".

Changes compared to version 05.5

- applied small molecule filter that filters out compounds with a MW < 200 or > 800, heavy metal containing compounds and mixtures

- include TID column which contains information on the original protein identifier

Files

05.6_additional_files.zip

Files (13.3 GB)

Name Size Download all
md5:b517f795d1168435fad1d0adfe33f6aa
104.2 MB Download
md5:b22c0b063c2502088389ff9658cc9397
51.3 kB Preview Download
md5:fcedda2800c6c9d8fd606b0e1d4525c4
2.1 GB Download
md5:4d647397b89b3da1690283b508e5e002
96.6 MB Download
md5:2c0c6062377ff53f9bbb7f5c23d8fc9f
1.5 GB Download
md5:18c63cf83eab57c85228851c931a74b5
3.1 GB Download
md5:102908a632d132c2ee28da2a16bf67d9
439.8 MB Download
md5:f4bdd3889a31fb9b31915f2f14dc59a7
117.1 MB Download
md5:43a5ae43bbea9fa53f61bddf14afcc45
3.3 GB Download
md5:7edb808331eda39be4cfff73675e8d28
500.1 MB Download
md5:27e4ffbb999e1d20c5cb852c97b1f0a1
447.8 MB Download
md5:0d4c234b0b4955fb7ff665611341b366
207.1 MB Download
md5:b0f62a23923a4741d651811818fc94ac
1.9 MB Download
md5:31bc7c86adc7ff3c35a18a0b79397440
711.5 MB Download
md5:536983256a3ad5ac45822eacc63acc6f
744.4 MB Download
md5:b75d37e2c813fb8b3df73e33feb31097
12.8 kB Preview Download

Additional details

Related works

Is cited by
Preprint: 10.26434/chemrxiv-2021-1rxhk (DOI)
Is described by
Presentation: 10.5281/zenodo.6771177 (DOI)
Is new version of
Dataset: 10.5281/zenodo.7019874 (DOI)
Dataset: 10.4121/16896406.v3 (DOI)

Funding

European Commission
eTRANSAFE - Enhacing TRANslational SAFEty Assessment through Integrative Knowledge Management 777365