Published April 12, 2023 | Version 3.0
Software Open

SlavConvert: Unicode Font Converter for Cyrillic

  • 1. Institute for Slavic Studies, Heidelberg University

Description

The programme SlavConvert processes Word OpenXML (*.docx) documents with outdated ASCII font encodings for the modern and historical Cyrillic and Greek scripts, and converts them to Unicode while preserving the original formatting. It is a cross-platform implementation of the core functionality of the conversion tool developed as part of the Freiburg University/BMBF project SlaVaComp (https://core.ac.uk/display/27019832).

Two previous versions based on shell scripts were published on the Web site of the Heidelberg University pilot project "OCR-Technologien im Vergleich" and described in: "New Cyrillic Text-Processing and Transcoding Tools for Unix(-like) Systems". Newsletter of the Early Slavic Studies Association 35, No. 2 (Fall 2022), pp. 3–6.

This updated version comprises a single Perl script with a Tk GUI enabling basic file and font selection, as well as an expanded font table in CSV format based on the Excel "profile" spreadsheet from the original SlaVaComp converter (data used with permission from Simon Škilevič). Please note that several dependencies are required to run the script.  Bug reports and suggestions for improvement are welcome and can be sent to: epolyglott@slav.uni-heidelberg.de.

Special thanks go to Achim Rabus and Aleksei Pentkovskii for testing the programme and providing ideas and suggestions for its development.

Files

chartable.csv

Files (135.7 kB)

Name Size Download all
md5:fe9721d6f4cc4f609a145a773579aeab
125.0 kB Preview Download
md5:4c645b9c56811700ed921d722e37854f
10.8 kB Download