Published December 27, 2023 | Version v1
Dataset Open

Rare Diseases hand-annotated news articles: Angelman, De Lange, Fragile X, Kleefstra

  • 1. Jožef Stefan Institute

Description

This dataset was produced in 2023 from the data collected throughout 2023 from Event Registry (news) for the development of the Rare Diseases Mining project (https://idefine-europe.org/medline)

The data is distributed across 4 specific diseases supporting the research paper "Automatic text classification and interactive data visualization of published scientific and news articles on Rare Diseases"

The available data comes in the file formats:
CSV - the hand annotation of the news articles in TXT with 5 to 10 MeSH headings

This work was prepared by Joao Pita Costa (researcher) and curated by Tanja Zdolšek Draksler (domain expert)  

Files

Angelman.csv

Files (92.1 kB)

Name Size Download all
md5:0d50eed90bd726a6c5102c92d7f9e4ce
23.6 kB Preview Download
md5:5caeec8e65cd6ee632c65350affd9f9a
23.1 kB Preview Download
md5:7616cd903e00652c8cd594b1ba13c993
20.8 kB Preview Download
md5:0d1e5fd7564b51e09088cb07425882c6
24.5 kB Preview Download