Published October 10, 2023
| Version v1
Presentation
Open
New Ways of Creating Research Data: Conversion of Unstructured Text to TEI XML using GPT on the Correspondence of Hugo Schuchardt with a Web Prototype for Prompt Engineering. FORGE 2023. Tübingen
Authors/Creators
- 1. Zentrum für Informationsmodellierung, Universität Graz
- 2. Digital Humanities Craft OG
- 3. Independent Software Developer
Description
This paper explores the use of prompt engineering to streamline the creation of humanities research data by converting unstructured correspondence texts into the TEI XML format. The approach optimizes language models such as GPT to produce accurate structured data while preserving context. The paper discusses the iterative refinement of the conversion process, challenges and potential solutions, and presents a user-friendly web prototype. Overall, prompt engineering shows potential for improving the efficiency of research data creation in the humanities.
Files
Abstract-FORGE23.pdf
Files
(2.1 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:c4cfe721381c380ac774d88bc898652b
|
195.3 kB | Preview Download |
|
md5:cc3b6cdc40d94b19fbcb9b70a40fe90c
|
1.9 MB | Preview Download |