Published April 5, 2024
| Version 1.0
Dataset
Open
Construction Industry Steel Ordering Lists (CISOL) Dataset
Description
The Construction Industry Steel Ordering Lists (CISOL) dataset comprises table-centric, real-world documents from the construction industry, annotated to facilitate the testing and training of deep learning models for table detection (TD) and table structure recognition (TSR).
CISOL Key Features:
- Steel ordering lists from 24 construction projects carried out between 2015-2023, contributed by 10 distinct German structural engineering firms.
- Anonymized images to ensure the unrecognizability of specific project or creator information.
- A total of 3280 images, with 844 annotated following the CISOL annotation guidelines.
CISOL is structured into two tracks:
- Track A: TD-TSR version for end-to-end table detection and table structure recognition tasks.
- Track B: TSR-only version for table structure recognition tasks, featuring images cropped to the actual table areas with accordingly adjusted annotations.
The dataset is developed in accordance with the FAIR Principles, ensuring that it is Findable, Accessible, Interoperable, and Reusable. The CISOL dataset permits expansion following the established annotation guideline.
Access to the CISOL Leaderboard will be provided at EvalAI.
Files
cisol_annotation_guideline.pdf
Files
(772.5 MB)
Name | Size | Download all |
---|---|---|
md5:5c3e3f29fcc1211074aa9ca0486f5aa4
|
5.4 MB | Preview Download |
md5:e4476967a6eb08c6837785bc8d42ca16
|
384.9 kB | Preview Download |
md5:b0b04825a99cd8ae8860b80fafad00b1
|
170.1 MB | Preview Download |
md5:1fdfb897aa1b5764630cfb8fb2c870fa
|
119.0 MB | Preview Download |
md5:e8753f84030d426800f3d036987940ac
|
477.7 MB | Preview Download |
Additional details
Dates
- Created
-
2023-02