FONA corpus: Food & Nutrition Abstracts Multilingual corpus

doi:10.5281/zenodo.5518895

Medical NLP (maintained by NLP4BIA unit at BSC)– language technology resources for clinical and biomedical documents in multiple languages

Published September 10, 2021 | Version 1.1

Dataset Open

FONA corpus: Food & Nutrition Abstracts Multilingual corpus

1. Barcelona Supercomputing Center

The FONA corpus is a collection of case reports specifically selected to foster the development of Language Technologies, Text Mining and NLP for applications in the domain of food & nutrition.

It contains a large collection of documents (titles and abstracts) with metadata information on their MeSH terms. In addition, a subset of the collection contains automatically recognized entities of the following categories:

medical procedures
symptoms
diseases
medications
occupational and demographic information
species (pathogens)
cancer morphology

Notes

Funded by the Plan de Impulso de las Tecnologías del Lenguaje (Plan TL).

Files

iberhelt.zip

Files (12.1 MB)

Name	Size	Download all
iberhelt.zip md5:d0d4c00d84c1536f652f2c158e86d0e7	12.1 MB	Preview Download

495

Views

Downloads

Show more details

	All versions	This version
Views	495	237
Downloads	28	20
Data volume	254.1 MB	254.1 MB

More info on how stats are collected....

DOI

Resource type

Dataset

Publisher

Zenodo

Languages

Spanish

Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: September 21, 2021
Modified: September 21, 2021

FONA corpus: Food & Nutrition Abstracts Multilingual corpus

Creators

Description

Notes

Files

iberhelt.zip

Files (12.1 MB)