Journal article Open Access

Contract Understanding Atticus Dataset (CUAD) v1

The Atticus Project

Contract Understanding Atticus Dataset (CUAD) v1 is a corpus of more than 13,000 labels in 510 commercial legal contracts that have been manually labeled by The Atticus Project to identify 41 categories of important clauses that lawyers look for when reviewing contracts.

We tested CUAD v1 against ten pretrained AI models and published the results on arXiv here.

Code for replicating the results, together with the model trained on CUAD, is published on Github here.

Files (105.9 MB)
Name Size
CUAD_v1.zip
md5:c38f490a984420b8a62600db401fafd5
105.9 MB Download
3,064
466
views
downloads
All versions This version
Views 3,0643,064
Downloads 466466
Data volume 49.3 GB49.3 GB
Unique views 2,7402,740
Unique downloads 435435

Share

Cite as