Published September 13, 2021 | Version 3.0
Dataset Open

Fighting COVID-19 with computational tools: an AI guided review of 17,000 studies - The CSCoV database.

  • 1. KAUST

Description

CSCoV (Computational Studies about COVID-19) is a dataset containing COVID-19 related studies extracted from PubMed, bioRxiv, medRxiv, and arXiv, together with article and author related metrics obtained from Semantic Scholar (plus page views from bioRxiv and medRxiv). Using machine learning, the articles are categorized in six topics (Pharmacology, Genomics, Epidemiology, Healthcare, Clinical Medicine, Clinical Imaging) and prioritized. The database is periodically updated.

  • Publication: TBA
  • Files included in this release:
    • cscov_09_2021.png: dataset statistics for the current CSCoV release.
    • cscov_09_2021.tsv: CSCoV database.
    • schema.json: metadata.
    • cscov_09_2021.tar.gz: Doc2Vec and DeepWalk features used for the DL model
  • Source code: https://github.com/SFB-KAUST/covid-review

Files

cscov_09_2021.png

Files (181.0 MB)

Name Size Download all
md5:7eba0bdea9d223375901a99000f7433d
167.1 kB Preview Download
md5:5b6c8de2ceef2a35a45409655eea7d3a
137.6 MB Download
md5:d525640ff385f448f8878e735d2ac7f0
43.2 MB Download
md5:37026e6b34923954f766f40ef7eb829d
3.3 kB Preview Download