Published September 1, 2020 | Version v1
Dataset Open

MIDAS hand-annotated news articles

Authors/Creators

  • 1. Institute Jozef Stefan

Description

This dataset was produced in 2020 from the data collected throughout 2019 for the development of the MIDAS project (http://www.midasproject.eu/)

The data is distributed throughout 5 topics:

- EUS: Childhood Obesity (UC Basque Country)
- FIN: Mental Health (UC Finland)
- IRE: Diabetes (UC Ireland)
- NIR: Children in Care (UC Northern Ireland)
- INF: Infectious Diseases including Coronavirus (UC Influenzanet)

The available data comes in 3 kinds and file formats:

TXT - the source of news including ID, title and body of text
CSV - the hand annotation of the news articles in TXT with 5 to 10 MeSH headings
JSON - the input file for the evaluation of the classifier, including the title, news article body and MeSH heading IDs (available from https://www.ncbi.nlm.nih.gov/mesh/)

The CSV files with name starting in "f1_", "pr_", "re_" are the results of the F1/Precision/Recall evaluation for each of the cases.

## AUTHORS

Joao Pita Costa, Anthony Staines, Jarmo Pääkkönen, Jenni Konttila, Joseba Bidaurrazaga, Oihana Belar, Christine Henderson

## ACKNOWLEDGMENTS

This work was supported by the European Commission H2020 project MIDAS (G.A. nr. 727721). 


## LICENSE

This dataset is licensed over Creative Commons.

Notes

This work was supported by the European Commission H2020 project MIDAS (G.A. nr. 727721).

Files

EUS.csv

Files (479.8 kB)

Name Size Download all
md5:739903e516fbecdc41207c83a0c61ea2
5.8 kB Preview Download
md5:308cef7bf45284ab54759697f6ad5818
60.5 kB Preview Download
md5:c60aed4637743171a1150dc4dbd26f9e
7.2 kB Preview Download
md5:0b66d97b26149cfa3c103d0c2a4c4fc4
6.7 kB Preview Download
md5:1bafb3def59ea4b35b9e2bf6e0fc7dc1
6.3 kB Preview Download
md5:5c02cc91c5d5f377a9c44f22f5f2e844
7.0 kB Preview Download
md5:bd3ed455d1178a0f9c9f34f5d02a9ccb
7.1 kB Preview Download
md5:d11408e0fd57350fa02beba9e0b073ff
6.9 kB Preview Download
md5:e1a43853ef76fb946203e03f21e81f68
72.4 kB Preview Download
md5:6ba0d68c734c3fbe4c4b004cbde631b9
6.6 kB Preview Download
md5:c90307e2be9a2d72ad9a7c55cc7f5d66
52.7 kB Preview Download
md5:788814fd5ff80198e74ad7d4ff150328
6.2 kB Preview Download
md5:4d199f036d106b0b5ea1e111e42a93f2
75.2 kB Preview Download
md5:73d0411829d35f094cfca66d7191a7d9
1.3 kB Preview Download
md5:fd6e20464f0442ba457a419b0286c980
5.5 kB Preview Download
md5:9939e55d6851dce2d964461c3188ae46
86.5 kB Preview Download
md5:2ab7d3490e613ab0793df5703100aeae
7.0 kB Preview Download
md5:a0e6c409095a2ebf09fa5e09b8dc1685
6.6 kB Preview Download
md5:cfcd10022ceffab0fd91f3c4e98be777
6.7 kB Preview Download
md5:7a9ef14a25cc4c99761ad171d45aa30b
6.7 kB Preview Download
md5:4a3c190878d8830391d76d39cb267249
6.5 kB Preview Download
md5:8018bc44a24ccf3470d64c15e4946658
7.0 kB Preview Download
md5:d39c6d684bef952375dee25ad6bb46de
6.7 kB Preview Download
md5:527c8745385480b7b983aae24da04984
6.3 kB Preview Download
md5:7a9ef14a25cc4c99761ad171d45aa30b
6.7 kB Preview Download
md5:64b4bda502cd493d8096a8a37e360578
6.0 kB Preview Download