Graph Neural Network and Sentence Transformer Embeddings for SNOMED CT concepts
Creators
Description
Embeddings for SNOMED CT concepts produced by Graph Neural Networks (GNNs) or sentence transformer. Each file contains a JSON file that links the ID of a SNOMED CT concept to its corresponding embedding.
Files base_mini_lm_dict.json and fine_tuned_mini_lm_dict.json contain the embeddings of the sentence transformer models, where the former is using the base MiniLM model and the latter is using the fine-tuned MiniLM model on the concept similarity task. Files gnn_mul_sct_dict.json and gnn_sim_sct_dict.json contain the embeddings produced by a GNN on a dataset produced by transforming the SNOMED CT ontology and on the task of concept similarity.
These embeddings were generated and studied in the paper Assessing the Effectiveness of Embedding Methods in Capturing Clinical Information from SNOMED CT () and more information can also be found in the following repository: https://github.com/JavierCastellD/AssessingSNOMEDEmbeddings.
Files
base_mini_lm_sct_dict.json
Additional details
Software
- Repository URL
- https://github.com/JavierCastellD/AssessingSNOMEDEmbeddings