Published December 14, 2023 | Version v1
Presentation Open

Question answering about Research Data Management

  • 1. GEOMAR
  • 2. Institute for Geoinformatics, Münster

Description

Presentation and code of team TLDR, which participated in Challenge 3: Question answering about Research Data Management of the Dataxplorers Hackathon 2023. The challenge was provided by members of the NFDI4Earth consortium for Earth System Sciences. The software attached employs Large Language Models in conjunction with custom embeddings to answer the questions of users correctly and in a concise and understandable way. The application also allows for the creation of new embeddings based on existing websites, as well as some basic functionality to perform preliminary checks on .csv-files against data curation guidelines. 

It must be noted that the Large Language Models employed are made available through the blablador API provided by Helmholtz AI workgroup at FZ Jülich. Thus, the application requires a valid access token to the blablador API which is as of now only available for members of the Helmholtz community. Since the API is closely related to the API openAI employs, customizing the application to work with openAI and/or other providers should be possible. 

Files

DataXplorers_Hackathon_Challenge_3_GEOMAR.pdf

Files (841.5 kB)

Name Size Download all
md5:4287488ba2795837e49a0649153a7194
700.2 kB Preview Download
md5:a91bca2f3b6293706033342eeec5f96b
141.3 kB Preview Download

Additional details

Additional titles

Subtitle
Hackathon contribution of the GEOMAR, Kiel