There is a newer version of the record available.

Published June 15, 2021 | Version v3
Journal article Open

Supplementary Material: COVID-19 document type screening

  • 1. Pontificia Universidad Católica de Chile

Description

This is the dataset for COVID-19 document type screening. 

It is composed of: 

- Epistemonikos train dataset 

- CORD-19 test dataset adapted for Evidence Based Medicine domain

- XLNET model fine-tuned on Epistemonikos dataset. 

- BioBERT model fine-tuned on Epistemonikos dataset.

Epistemonikos XLNET models fine-tuned on Cord-19:

- Episte-XLNET fine-tuned with random sampling strategy. 

- Episte-XLNET fine-tuned with data augmentation strategy.

- Episte-XLNET fine-tuned with uncertainty sampling strategy (iteration 1 and 2). 

Scripts to run experiments can be found at: https://github.com/afcarvallo/covid_19_document_type_screening

Files

biobert_model.zip

Files (4.4 GB)

Name Size Download all
md5:dd7a8615c6977b17f80e97b777cae005
433.3 MB Download
md5:bc3f6b9b648b8b3a063d5b5954eb213b
804.8 MB Preview Download
md5:f6fc88612a8d371852606c77c7bd1aa9
36.2 MB Preview Download
md5:26747b89d80a66053149328442b8fe79
760.8 MB Download
md5:959830a44a0a2fc391e6a47aeaa986b9
466.9 MB Download
md5:c033d1ad9433f2dfb391251f85f98274
466.9 MB Download
md5:e81650db157889bc73f3f4868a9afcb7
466.9 MB Download
md5:0dfc66e0bef5c6766d72481abfcbd7e5
466.9 MB Download
md5:d5a990144e726b56e9cd0f8c938ebb74
466.9 MB Download