Supplementary Material: COVID-19 document type screening
Description
This is the dataset for COVID-19 document type screening.
It is composed of:
- Epistemonikos train dataset
- CORD-19 test dataset adapted for Evidence Based Medicine domain
- XLNET model fine-tuned on Epistemonikos dataset.
- BioBERT model fine-tuned on Epistemonikos dataset.
Epistemonikos XLNET models fine-tuned on Cord-19:
- Episte-XLNET fine-tuned with random sampling strategy.
- Episte-XLNET fine-tuned with data augmentation strategy.
- Episte-XLNET fine-tuned with uncertainty sampling strategy (iteration 1 and 2).
Scripts to run experiments can be found at: https://github.com/afcarvallo/covid_19_document_type_screening
Files
biobert_model.zip
Files
(4.4 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:dd7a8615c6977b17f80e97b777cae005
|
433.3 MB | Download |
|
md5:bc3f6b9b648b8b3a063d5b5954eb213b
|
804.8 MB | Preview Download |
|
md5:f6fc88612a8d371852606c77c7bd1aa9
|
36.2 MB | Preview Download |
|
md5:26747b89d80a66053149328442b8fe79
|
760.8 MB | Download |
|
md5:959830a44a0a2fc391e6a47aeaa986b9
|
466.9 MB | Download |
|
md5:c033d1ad9433f2dfb391251f85f98274
|
466.9 MB | Download |
|
md5:e81650db157889bc73f3f4868a9afcb7
|
466.9 MB | Download |
|
md5:0dfc66e0bef5c6766d72481abfcbd7e5
|
466.9 MB | Download |
|
md5:d5a990144e726b56e9cd0f8c938ebb74
|
466.9 MB | Download |