Published March 1, 2026 | Version v1
Diagram Open

Replication package for the research article entitled "From Sketches to Canonical Models: Benchmarking Vision–Language Models for UML Generation"

Authors/Creators

Description

This repository contains the dataset and the output used for the paper entitled:  
"From Sketches to Canonical Models: Benchmarking Vision–Language Models for UML Generation"
Submitted to the International Conference on Evaluation and Assessment in Software Engineering (EASE), 2026 edition

Repository Structure

- informal_diagrams/ 

Contains the original informal diagrams extracted from academic papers. These were used as input for all tested language models.

- generated_diagrams/

Contains UML diagrams generated by four multimodal Large Language Models (LLMs):

    • ChatGPT 4o (OpenAI)
    • Claude 3.7 Sonnet (Anthropic)
    • Gemini 2.5 Pro (Google)
    • Llama 4 Maverick (Meta)

  Each file was generated from an informal diagram using a unified prompting approach and validated for PlantUML syntax.

- error_outputs/

Contains any failed or invalid output files that did not pass the PlantUML syntax validation step. These are useful for analyzing failure cases in UML generation.

- samples/ 

Contains some samples mentioned in the article, for a quick analysis.

 

Purpose

This dataset supports our evaluation of:
- Syntactic correctness of UML outputs
- Semantic alignment with the original informal input
- Model performance variation across five UML diagram types (Class, Activity, Use Case, State Machine, Sequence)

Files

ChatGPT_paper_100_Figure2_activity_diagram_response.png

Files (263.3 MB)

Name Size Download all
md5:90131a49f83d532a9cdbcef0659f24e8
40.3 kB Preview Download
md5:5e3ba2fe31dad9142678a3c1f462824a
61.9 kB Preview Download
md5:b546d294eb663b7100a2613464c067d7
65.4 kB Preview Download
md5:42364a252682e19ccb0f8da44f1f7842
38.0 kB Preview Download
md5:53f662fe179251816d83505ef01aabec
19.1 kB Preview Download
md5:c90e0f1d2356a53c294d9b63cce14669
74.9 kB Preview Download
md5:0bc5d000c3aff5f68416ba46ac40f595
85.1 kB Preview Download
md5:2e27b51baf071fcb9585614cee4d5ad5
74.6 kB Preview Download
md5:961c14fe9ff0751714450dbf7829246b
92.2 kB Preview Download
md5:6918e9670683cc6fbde1ec0bc5ea2f60
46.1 kB Preview Download
md5:13040baaa8e00ed8bed50c31cc6ef548
36.1 kB Preview Download
md5:74fcc146e6d5bd958d75503c48c44079
71.3 kB Preview Download
md5:6fb5c349afcc84d2b02443e8ddd9f9f2
60.8 kB Preview Download
md5:f55622ebdfe9a74bbbdb2407045c36e0
75.6 kB Preview Download
md5:ad9e95f319cc00a606074c2ed4b27278
72.6 kB Preview Download
md5:bb285ddbcb087f5ac1324e4df21d9c3c
27.3 MB Preview Download
md5:398410c37039d54b1b2b60c48e02e86f
165.1 kB Preview Download
md5:e00fbb710204ea694278d323f714f15e
136.2 kB Preview Download
md5:338e831c91eb5f36cd5fdf0310f74dc8
44.7 kB Preview Download
md5:e6af743fd64294dfcf023f8bb12de39c
244.3 kB Preview Download
md5:92599da6c8d7c2b696fcccc524b67d0d
137.6 kB Preview Download
md5:65a43ffc42391917ef96837fa93185ba
162.1 MB Preview Download
md5:a5246fc5804b78438f9b82464c02a649
71.6 MB Preview Download
md5:142850e7b9d74bd930a6b7fd8678144c
33.3 kB Preview Download
md5:10445247bd4de8c524e7b1b67cc19151
27.9 kB Preview Download
md5:b530c64b68c4b4df6b45b504da001200
18.5 kB Preview Download
md5:737768e54ad2d58c7587869b0e7b6367
24.9 kB Preview Download
md5:fa500fc6bc506cd5f471be17bb42417a
96.7 kB Preview Download
md5:380feb96a8850ad19f5ef308b1ffc962
202.9 kB Preview Download
md5:e5af619b88fbff82d8fd2e9afa9a869b
317.3 kB Preview Download
md5:a58cea167cd4e21a8b848564cd84756d
1.4 kB Preview Download