Biotea-2-Bioschemas test data
- 1. EMBL-EBI, ELIXIR Hub
- 2. Universidad Politécnica de Madrid
- 3. BASF
- 4. ZB MED
Description
Biotea-2-Bioschemas mapps Biotea model to schema.org following the approach proposed by Bioschemas. Here we present the test data used in Biotea GitHub pages, corresponding to 2596 PubMed Open Access (PMC-OA) subset publications together with the software used to render schema.org markup.
Date deposited includes (i) publications retrieved from PMC-OA API, i.e., full text in JATS/XML, (ii) ontology terms recognized in the abstracts and obtained from the NCBO Annotator, i.e., semantic annotations, and (iii) the same annotations following the PubAnnotation format.
Software deposited includes (i) biotea-bioschemas-metadata which parses JATS/XML files and creates Bioschemas markup including metadata, abstract and references, (ii) biotea-bioschemas-annotations which parses PubAnnotation annotations and creates Bioschemas markup, and (iii) biotea-bioschemas-showcase which uses the other two in order to display markup in a graphical basic way and render it as a script element in the HTML following the JSON-LD format. The corresonding GitHub repositories are: (i) https://github.com/biotea/biotea-bioschemas-metadata, (ii) https://github.com/biotea/biotea-bioschemas-annotations, and (iii) https://github.com/biotea/biotea-bioschemas-showcase.
Biotea-2-bioschemas can be seen in action at http://biotea.github.io/bioschemas/
Files
biotea-bioschemas-annotations.zip
Files
(114.9 MB)
Name | Size | Download all |
---|---|---|
md5:c34f1f84a88028552e7c3e97e73ba2d7
|
302.9 kB | Preview Download |
md5:26b922758381dd1ddfa721573d56c26b
|
374.3 kB | Preview Download |
md5:6ac6e567eafc4d49c7ef273b04fb8adf
|
779.7 kB | Preview Download |
md5:73d13456195ba596a7a764432c23a30d
|
41.1 MB | Preview Download |
md5:6c21ac3b985c4164085e99725517d0f9
|
62.8 MB | Preview Download |
md5:567cf1ac315e9be5c34e718fa79d1938
|
9.5 MB | Preview Download |
Additional details
References
- Gray AJG, Goble CA, Jiménez R. Bioschemas: From Potato Salad to Protein Annotation. International Semantic Web Conference. 2017; Available: https://pdfs.semanticscholar.org/74ec/a9c89622bff731b21b03acb4f2400a0f00fa.pdf
- Garcia Castro LJ, McLaughlin C, Garcia A. Biotea: RDFizing PubMed Central in support for the paper as an interface to the Web of Data. J Biomed Semantics. BioMed Central; 2013;4: S5.
- Garcia A, Lopez F, Garcia L, Giraldo O, Bucheli V, Dumontier M. Biotea: semantics for Pubmed Central. PeerJ. PeerJ Inc.; 2018;6: e4201.
- Jonquet C, Shah NH, Youn CH, Musen MA, Storey M-A. NCBO Annotator: Semantic Annotation of Biomedical Data. Proceedings of the 2009 International Semantic Web Conference. 2009. Available:
- Kim J-D, Wang Y. PubAnnotation: a persistent and shareable corpus and annotation repository. Proceedings of the 2012 Workshop on Biomedical Natural Language Processing. Association for Computational Linguistics; 2012. pp. 202–205.