Dataset Open Access

A labeled Ecore metamodel dataset for domain clustering

Önder Babur

Manually labeled 555 metamodels mined from GitHub in April 2017. 

Domains: (1) bibliography, (2) conference management, (3) bug/issue tracker, (4) build systems, (5) document/office products, (6) requirement/use case, (7) database/sql, (8) state machines, (9) petri nets

Procedure for constructing the dataset: fully manual, by searching for certain keywords and regexes (e.g. "state" and "transition" for state machines) in the metamodels and inspecting the results for inclusion. 

Format for the file names: ABSINDEX_CLUSTER_ITEMINDEX_name_hash.ecore

Files (1.0 MB)
Name Size
manualDomains.zip
md5:130e6599fef4e5e183a9837ee8660a46
1.0 MB Download
40
3
views
downloads
All versions This version
Views 4037
Downloads 33
Data volume 3.0 MB3.0 MB
Unique views 3333
Unique downloads 33

Share

Cite as