Published December 20, 2018 | Version 1.4.1
Software Open

Tokeniser for the Alsatian Dialects

  • 1. Université de Strasbourg

Description

A python module to tokenise texts in the Alsatian dialects. See the module header for help on how to use the tokeniser.

The module requires Python 2.7.

This tool was developed in the context of the RESTAURE project, funded by the French ANR. The tokeniser is also decribed in the following article: https://hal.archives-ouvertes.fr/hal-01539160.

Version 1.4.1 fixes a bug occurring when the space is missing after a comma.

Files

Files (23.3 kB)

Name Size Download all
md5:c46e7bae366dd24e78ba450244e688d2
23.3 kB Download

Additional details

Related works