Replication package for the research article entitled "From Sketches to Canonical Models: Benchmarking Vision–Language Models for UML Generation"
Authors/Creators
Description
This repository contains the dataset and the output used for the paper entitled:
"From Sketches to Canonical Models: Benchmarking Vision–Language Models for UML Generation"
Submitted to the International Conference on Evaluation and Assessment in Software Engineering (EASE), 2026 edition
Repository Structure
- informal_diagrams/
Contains the original informal diagrams extracted from academic papers. These were used as input for all tested language models.
- generated_diagrams/
Contains UML diagrams generated by four multimodal Large Language Models (LLMs):
-
- ChatGPT 4o (OpenAI)
- Claude 3.7 Sonnet (Anthropic)
- Gemini 2.5 Pro (Google)
- Llama 4 Maverick (Meta)
Each file was generated from an informal diagram using a unified prompting approach and validated for PlantUML syntax.
- error_outputs/
Contains any failed or invalid output files that did not pass the PlantUML syntax validation step. These are useful for analyzing failure cases in UML generation.
- samples/
Contains some samples mentioned in the article, for a quick analysis.
Purpose
This dataset supports our evaluation of:
- Syntactic correctness of UML outputs
- Semantic alignment with the original informal input
- Model performance variation across five UML diagram types (Class, Activity, Use Case, State Machine, Sequence)
Files
ChatGPT_paper_100_Figure2_activity_diagram_response.png
Files
(263.3 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:90131a49f83d532a9cdbcef0659f24e8
|
40.3 kB | Preview Download |
|
md5:5e3ba2fe31dad9142678a3c1f462824a
|
61.9 kB | Preview Download |
|
md5:b546d294eb663b7100a2613464c067d7
|
65.4 kB | Preview Download |
|
md5:42364a252682e19ccb0f8da44f1f7842
|
38.0 kB | Preview Download |
|
md5:53f662fe179251816d83505ef01aabec
|
19.1 kB | Preview Download |
|
md5:c90e0f1d2356a53c294d9b63cce14669
|
74.9 kB | Preview Download |
|
md5:0bc5d000c3aff5f68416ba46ac40f595
|
85.1 kB | Preview Download |
|
md5:2e27b51baf071fcb9585614cee4d5ad5
|
74.6 kB | Preview Download |
|
md5:961c14fe9ff0751714450dbf7829246b
|
92.2 kB | Preview Download |
|
md5:6918e9670683cc6fbde1ec0bc5ea2f60
|
46.1 kB | Preview Download |
|
md5:13040baaa8e00ed8bed50c31cc6ef548
|
36.1 kB | Preview Download |
|
md5:74fcc146e6d5bd958d75503c48c44079
|
71.3 kB | Preview Download |
|
md5:6fb5c349afcc84d2b02443e8ddd9f9f2
|
60.8 kB | Preview Download |
|
md5:f55622ebdfe9a74bbbdb2407045c36e0
|
75.6 kB | Preview Download |
|
md5:ad9e95f319cc00a606074c2ed4b27278
|
72.6 kB | Preview Download |
|
md5:bb285ddbcb087f5ac1324e4df21d9c3c
|
27.3 MB | Preview Download |
|
md5:398410c37039d54b1b2b60c48e02e86f
|
165.1 kB | Preview Download |
|
md5:e00fbb710204ea694278d323f714f15e
|
136.2 kB | Preview Download |
|
md5:338e831c91eb5f36cd5fdf0310f74dc8
|
44.7 kB | Preview Download |
|
md5:e6af743fd64294dfcf023f8bb12de39c
|
244.3 kB | Preview Download |
|
md5:92599da6c8d7c2b696fcccc524b67d0d
|
137.6 kB | Preview Download |
|
md5:65a43ffc42391917ef96837fa93185ba
|
162.1 MB | Preview Download |
|
md5:a5246fc5804b78438f9b82464c02a649
|
71.6 MB | Preview Download |
|
md5:142850e7b9d74bd930a6b7fd8678144c
|
33.3 kB | Preview Download |
|
md5:10445247bd4de8c524e7b1b67cc19151
|
27.9 kB | Preview Download |
|
md5:b530c64b68c4b4df6b45b504da001200
|
18.5 kB | Preview Download |
|
md5:737768e54ad2d58c7587869b0e7b6367
|
24.9 kB | Preview Download |
|
md5:fa500fc6bc506cd5f471be17bb42417a
|
96.7 kB | Preview Download |
|
md5:380feb96a8850ad19f5ef308b1ffc962
|
202.9 kB | Preview Download |
|
md5:e5af619b88fbff82d8fd2e9afa9a869b
|
317.3 kB | Preview Download |
|
md5:a58cea167cd4e21a8b848564cd84756d
|
1.4 kB | Preview Download |