among/fusus: Delivered
Description
This release markes the handing over of this repostiory from Dirk to Cornelis as main contributer.
So far, Dirk has written most of the code, although all of the work is the result of a close cooperation between Cornelis and Dirk.
Cornelis provided the seminal ideas, organized the project and procured the funding. Cornelis and Dirk discussed every problem and issue underway in Slack.
The main results are (between brackets the location in this repo)
- the fusus code: OCR pipeline and PDF text extraction (fusus)
- example data (examples) (attached as example.zip)
- output data: Lakhnawi TF, TSV, HTML, PDF; Affifi TF, TSV, HTML, PDF (ur) (attached as Lakhnawi.zip and Affifi.zip)
- documentation: Readme, doc-strings in the fusus code, extra markdown files (fusus/docs), (the built site is attached to this release as site.zip)
- notebooks (notebooks) - view them on nb-viewer
Files
among/fusus-v0.5.zip
Files
(217.5 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:ab470c3e46c0591ae38b860a78ac9235
|
217.5 MB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/among/fusus/tree/v0.5 (URL)