There is a newer version of the record available.

Published October 7, 2022 | Version 1.0.0
Software Open

PLIX - Pipeline for Information Extraction

  • 1. German Aerospace Center

Description

PLIX (Pipeline for Information Extraction) is a Python package and command line tool for information extraction from (PDF) documents.
As of now, it provides functionality to extract all raw data from PDFs and then extract key-value-unit tuples from it.

Files

plix.zip

Files (3.8 MB)

Name Size Download all
md5:0c90f12cd5b6cb427333124beb3299e0
3.8 MB Preview Download