There is a newer version of the record available.

Published April 14, 2025 | Version v2
Dataset Open

SeMRA Raw Semantic Mapping Database

  • 1. ROR icon RWTH Aachen University
  • 2. ROR icon Northeastern University

Description

An automatically assembled dataset of raw semantic mappings produced by python -m semra.database. This incorporates mappings from the following places:

  1. Ontologies indexed in the Bioregistry (primary)
  2. Databases integrated in PyOBO (primary)
  3. Biomappings (secondary)
  4. Wikidata (primary/secondary)
  5. Custom resources integrated in SeMRA (primary)

This is a database of raw mapping without further processing. For processed mapping datasets, we suggest smaller domain-specific processing rules (see https://github.com/biopragmatics/semra/tree/main/notebooks/landscape for examples).

How to Run

  1. Download all artifacts from this Record
  2. Make sure that you have Docker running locally
  3. Run sh run_on_docker.sh from the command line
  4. Navigate to http://localhost:8773 to see the SeMRA dashboard or to http://localhost:7474 for direct access to the Neo4j graph database

Licensing

Mappings are licensed according to their primary resources. These are explicitly annotated in the SSSOM file on each row (when available) and on the mapping set level in the Neo4j graph database artifacts. Inferred mappings are distributed under the public domain CC0-1.0 license.

Files

Files (5.7 GB)

Name Size Download all
md5:985af1a8a37e52603db80d3b6dd8b057
285.2 MB Download
md5:8477cb8d651ee494adcaf915dbdf4817
1.9 kB Download
md5:8cdf76ba3985d026480a1851d6b99490
2.8 GB Download
md5:5ba6e61381bf05c6f1529d93fc2091ea
913.0 MB Download
md5:c5a97d1cae6ca9255ad1e9f37129f7c2
412.5 MB Download
md5:ae95b8db3eab118327392296998f01f6
913.0 MB Download
md5:06794890e1a34351e2c67010f1da67dc
7.5 kB Download
md5:9e73641316307a65601b46523050989c
389.7 MB Download
md5:42931847b01fa699b49e9b381d501f60
290 Bytes Download
md5:5d9e5eafac264498cc911ae9a3864cec
271 Bytes Download

Additional details

Related works

Requires
Software: 10.5281/zenodo.8192828 (DOI)

Software

Repository URL
https://github.com/biopragmatics/semra
Programming language
Python
Development Status
Active