Published April 5, 2024 | Version 1.0
Dataset Open

Construction Industry Steel Ordering Lists (CISOL) Dataset

  • 1. Bauhaus-Universität Weimar

Description

The Construction Industry Steel Ordering Lists (CISOL) dataset comprises table-centric, real-world documents from the construction industry, annotated to facilitate the testing and training of deep learning models for table detection (TD) and table structure recognition (TSR). 

CISOL Key Features:

  • Steel ordering lists from 24 construction projects carried out between 2015-2023, contributed by 10 distinct German structural engineering firms.
  • Anonymized images to ensure the unrecognizability of specific project or creator information.
  • A total of 3280 images, with 844 annotated following the CISOL annotation guidelines.

CISOL is structured into two tracks:

  • Track A: TD-TSR version for end-to-end table detection and table structure recognition tasks.
  • Track B: TSR-only version for table structure recognition tasks, featuring images cropped to the actual table areas with accordingly adjusted annotations.

The dataset is developed in accordance with the FAIR Principles, ensuring that it is Findable, Accessible, Interoperable, and Reusable. The CISOL dataset permits expansion following the established annotation guideline.

Access to the CISOL Leaderboard will be provided at EvalAI.

 

Files

cisol_annotation_guideline.pdf

Files (772.5 MB)

Name Size Download all
md5:5c3e3f29fcc1211074aa9ca0486f5aa4
5.4 MB Preview Download
md5:e4476967a6eb08c6837785bc8d42ca16
384.9 kB Preview Download
md5:b0b04825a99cd8ae8860b80fafad00b1
170.1 MB Preview Download
md5:1fdfb897aa1b5764630cfb8fb2c870fa
119.0 MB Preview Download
md5:e8753f84030d426800f3d036987940ac
477.7 MB Preview Download

Additional details

Dates

Created
2023-02