Published January 31, 2023 | Version 1.2
Dataset Open

Raw Data for IntoValue Dataset

  • 1. QUEST Center for Responsible Research, BIH @ Charité Universitätsmedizin Berlin

Description

This data deposit includes large raw data used for the "IntoValue" dataset, which underlies several projects at the QUEST Center for Responsible Research in the Berlin Institute of Health (BIH) @ Charité. An initial version of the IntoValue dataset is available in Zenodo: https://doi.org/10.5281/zenodo.5141342. Based on this initial version, the dataset is actively developed and maintained in GitHub: https://github.com/maia-sh/intovalue-data. This Zenodo deposit serves to store large raw data files for individual trials and are used in that GitHub repository. These data are deposited for computational reproducibility and documentation; they are not intended to be used for additional projects and do not reflect the most current/accurate data available from each source.

 

This deposit contains raw data from the following sources:

PubMed (pubmed.zip): PubMed XML files are provided courtesty of the U.S. National Library of Medicine and were accessed via the Entrez Programming Utilities (E-utilities) API. The files were downloaded on 2021-08-15 and do not reflect the most current/accurate data available from NLM. The following scripts were used to download and create these files: get-pubmed.R; download-pubmed.R.

German Clinical Trials Registry (DRKS) (drks.zip): DRKS does not provide an API and was webscrapped on 2022-11-01. The following scripts were used to download and create these XML files: get-drks.R; drks-functions.R

ClinicalTrials.gov (ctgov.zip): ClinicalTrials.gov was accessed via the Clinical Trials Transformation Initiative (CTTI) Aggregate Content of ClinicalTrials.gov (AACT) via its PostgreSQL database API.The API was queried and CSV files were generated on 2022-11-01. The following scripts were used to download and create these files: get-process-aact.R.

ClinicalTrials.gov 2018 (ctgov_2018.zip): Additional trial data for 2018. ClinicalTrials.gov was accessed via the Clinical Trials Transformation Initiative (CTTI) Aggregate Content of ClinicalTrials.gov (AACT) via its PostgreSQL database API.The API was queried and CSV files were generated on 2022-11-01. The following scripts were used to download and create these files: get-process-aact.R.

Notes

This work was funded under a grant from the Federal Ministry of Education and Research of Germany (Bundesministerium für Bildung und Forschung - BMBF) [01PW18012].

Files

ctgov.zip

Files (17.9 MB)

Name Size Download all
md5:9de0d82fbcb02156920b55896db8b869
2.3 MB Preview Download
md5:a235e4d260ebe5c15310c0e5cab8b623
2.2 MB Preview Download
md5:a0e1a2c9488b412b665c24760a4d6a80
5.0 MB Preview Download
md5:1fed997b0e6e33bba34fd1cf05ead8c6
8.5 MB Preview Download

Additional details

Related works

Is compiled by
Software: https://github.com/maia-sh/intovalue-data/releases/tag/v1.1 (URL)
Is derived from
Dataset: 10.5281/zenodo.5141342 (DOI)
Is source of
Dataset: 10.17605/OSF.IO/26DGX (DOI)