{ "access": { "embargo": { "active": false, "reason": null }, "files": "public", "record": "public", "status": "open" }, "created": "2022-09-21T14:51:40.370257+00:00", "custom_fields": {}, "deletion_status": { "is_deleted": false, "status": "P" }, "files": { "count": 1, "enabled": true, "entries": { "Back_to_Analog-Printing_TEI_Editions_Kupreyev.pdf": { "checksum": "md5:186a81f8cc709703e3148312b7c7f29e", "ext": "pdf", "id": "00143c7d-1a8d-46d4-a2aa-60cf1e383b85", "key": "Back_to_Analog-Printing_TEI_Editions_Kupreyev.pdf", "metadata": null, "mimetype": "application/pdf", "size": 3213333 } }, "order": [], "total_bytes": 3213333 }, "id": "7101456", "is_draft": false, "is_published": true, "links": { "access": "https://zenodo.org/api/records/7101456/access", "access_links": "https://zenodo.org/api/records/7101456/access/links", "access_request": "https://zenodo.org/api/records/7101456/access/request", "access_users": "https://zenodo.org/api/records/7101456/access/users", "archive": "https://zenodo.org/api/records/7101456/files-archive", "archive_media": "https://zenodo.org/api/records/7101456/media-files-archive", "communities": "https://zenodo.org/api/records/7101456/communities", "communities-suggestions": "https://zenodo.org/api/records/7101456/communities-suggestions", "doi": "https://doi.org/10.5281/zenodo.7101456", "draft": "https://zenodo.org/api/records/7101456/draft", "files": "https://zenodo.org/api/records/7101456/files", "latest": "https://zenodo.org/api/records/7101456/versions/latest", "latest_html": "https://zenodo.org/records/7101456/latest", "media_files": "https://zenodo.org/api/records/7101456/media-files", "parent": "https://zenodo.org/api/records/7101455", "parent_doi": "https://zenodo.org/doi/10.5281/zenodo.7101455", "parent_html": "https://zenodo.org/records/7101455", "requests": "https://zenodo.org/api/records/7101456/requests", "reserve_doi": "https://zenodo.org/api/records/7101456/draft/pids/doi", "self": "https://zenodo.org/api/records/7101456", "self_doi": "https://zenodo.org/doi/10.5281/zenodo.7101456", "self_html": "https://zenodo.org/records/7101456", "self_iiif_manifest": "https://zenodo.org/api/iiif/record:7101456/manifest", "self_iiif_sequence": "https://zenodo.org/api/iiif/record:7101456/sequence/default", "versions": "https://zenodo.org/api/records/7101456/versions" }, "media_files": { "count": 0, "enabled": false, "entries": {}, "order": [], "total_bytes": 0 }, "metadata": { "creators": [ { "person_or_org": { "family_name": "Kupreyev", "given_name": "Maxim N.", "identifiers": [ { "identifier": "0000-0002-8472-6954", "scheme": "orcid" } ], "name": "Kupreyev, Maxim N.", "type": "personal" } } ], "description": "
According to Sahle (2017) [1] digital editions are guided by a digital paradigm in their theory, method, and practice, and thus “cannot be given in print without significant loss of content and functionality”. This talk touches upon the challenges of printing TEI XML datasets, but also highlights a useful diagnostic value of the PDF export for the data quality. PDF output, indeed, represents only a part of the encoded information, but it can play an essential role in data curation and quality assurance.
\n\nThe “School of Salamanca” [2] project, jointly sponsored by the AdWL Mainz [3], MPI-LHLT [4] and Goethe-University Frankfurt [5], publishes the works of the jurists and theologians related to the University of Salamanca - the intellectual center of the Spanish monarchy during the 16th and 17th centuries. Based on a selected set of print editions we create a digital text corpus, which will include 116 works encoded in TEI XML. In addition, we also compose a historic dictionary of circa 300 essential terms, rendering the fundamental importance of the School of Salamanca for the early modern discourse about law, politics, religion, and ethics.
\n\nOur TEI XML data is controlled by the RNG schema and is exported to HTML and JSON IIIF for web display [6]. Recently, a PDF printout option was added. Considering the complexity and the depth of annotation we decided to use the established XSL-FO technology, supported by a free Apache FOP processor integrated in the Oxygen Author workflow. Similar results can be achieved with the CSS Paged Media Module or TEI Publisher. The PDF export highlighted issues which pertain to two ontologically different areas:
\n\nThe issues of the first type refer, for example, to the representation of the marginal notes and their anchors, and to the correlation in pagination between XML, IIIF and PDF. The problems of the second type include, for instance, different XML encoding of semantically identical chunks of information, which escaped the Schematron check-ups, but became visible with print layout.
\n\nPDF generation in the School of Salamanca was initially intended to be one of the export methods of the TEI data. It is now implemented early in the TEI production pipeline as a diagnostic tool, exposing the semantic and structural inconsistencies of the data, which can now be corrected before the final XML release. PDF production thus adheres to one of the principles of agile software testing, which states that capturing and eliminating defects in the early stages of research data life cycle is less time-consuming, less resource-intensive and less prone to collateral bugs (Crispin 2008) [7].
\n\nREFERENCES:
\n\n[1] Sahle, Patrick. 2017. "What is a Scholarly Digital Edition?" in Digital Scholarly Editing, edited by Matthew James Driscoll and Elena Pierazzo, 19-39. Cambridge: Open Book Publishers.
\n\n[2] https://www.salamanca.school/en/index.html , accessed on 20.06.2022.
\n\n[3] Akademie der Wissenschaften und der Literatur Mainz, https://www.adwmainz.de/startseite.html.
\n\n[4] Max Planck Institute for Legal History and Legal Theory, https://www.lhlt.mpg.de/en.
\n\n[5] Goethe Universität Frankfurt am Main, https://www.goethe-university-frankfurt.de/en?locale=en.
\n\n[6] https://blog.salamanca.school/de/2022/04/27/the-school-of-salamanca-text-workflow-from-the-early-modern-print-to-tei-all/,
\nhttps://blog.salamanca.school/de/2020/03/17/deutsch-entwicklung-der-webanwendung-v2-0/ , accessed on 20.06.2022.
[7] Crispin, LIsa. 2008. Agile Testing: A Practical Guide for Testers and Agile Teams. Addison-Wesley.
", "languages": [ { "id": "eng", "title": { "en": "English" } } ], "publication_date": "2022-09-21", "publisher": "Zenodo", "resource_type": { "id": "publication-conferencepaper", "title": { "de": "Konferenzbeitrag", "en": "Conference paper" } }, "rights": [ { "description": { "en": "The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited." }, "icon": "cc-by-icon", "id": "cc-by-4.0", "props": { "scheme": "spdx", "url": "https://creativecommons.org/licenses/by/4.0/legalcode" }, "title": { "en": "Creative Commons Attribution 4.0 International" } } ], "subjects": [ { "subject": "digital humanities" }, { "subject": "digital editions" }, { "subject": "XSL-FO" }, { "subject": "XML" }, { "subject": "TEI" }, { "subject": "PDF export" }, { "subject": "Agile" }, { "subject": "data quality assurance" }, { "subject": "software testing" } ], "title": "Back to analog: the added value of printing digital editions", "version": "1" }, "parent": { "access": { "owned_by": { "user": 411728 } }, "communities": { "default": "90047b15-eaa5-49fc-9564-25af6b3bcda1", "entries": [ { "access": { "member_policy": "open", "members_visibility": "public", "record_policy": "open", "review_policy": "open", "visibility": "public" }, "children": { "allow": false }, "created": "2022-08-22T00:05:26.849520+00:00", "custom_fields": {}, "deletion_status": { "is_deleted": false, "status": "P" }, "id": "90047b15-eaa5-49fc-9564-25af6b3bcda1", "links": {}, "metadata": { "curation_policy": "", "page": "The TEI2022 22th annual Conference and Members’ Meeting of the Text Encoding Initiative Consortium (TEI), was held 13-16 September 2022 (Tue-Fri) at Newcastle University, Newcastle Upon Tyne, United Kingdom with pre-conference workshops 12-13 September 2022 (Mon-Tue).
", "title": "TEI2022 Conference_Text as data" }, "revision_id": 0, "slug": "tei2022", "updated": "2022-08-22T00:07:08.598967+00:00" } ], "ids": [ "90047b15-eaa5-49fc-9564-25af6b3bcda1" ] }, "id": "7101455", "pids": { "doi": { "client": "datacite", "identifier": "10.5281/zenodo.7101455", "provider": "datacite" } } }, "pids": { "doi": { "client": "datacite", "identifier": "10.5281/zenodo.7101456", "provider": "datacite" }, "oai": { "identifier": "oai:zenodo.org:7101456", "provider": "oai" } }, "revision_id": 14, "stats": { "all_versions": { "data_volume": 899918732.0, "downloads": 282, "unique_downloads": 253, "unique_views": 312, "views": 438 }, "this_version": { "data_volume": 469146618.0, "downloads": 146, "unique_downloads": 129, "unique_views": 182, "views": 296 } }, "status": "published", "updated": "2023-02-01T10:13:14.104473+00:00", "versions": { "index": 1, "is_latest": false } }