Journal article Open Access

Contract Understanding Atticus Dataset (CUAD) v1

The Atticus Project

Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="" xmlns:oai_dc="" xmlns:xsi="" xsi:schemaLocation="">
  <dc:creator>The Atticus Project</dc:creator>
  <dc:description>Contract Understanding Atticus Dataset (CUAD) v1 is a corpus of more than 13,000 labels in 510 commercial legal contracts that have been manually labeled by The Atticus Project to identify 41 categories of important clauses that lawyers look for when reviewing contracts.

We tested CUAD v1 against ten pretrained AI models and published the results on arXiv here.

ReadMe and Datasheet are published here. Code for replicating the results, together with the model trained on CUAD, is published on Github here. </dc:description>
  <dc:title>Contract Understanding Atticus Dataset (CUAD) v1</dc:title>
All versions This version
Views 13,46313,459
Downloads 4,5034,503
Data volume 476.8 GB476.8 GB
Unique views 12,03812,034
Unique downloads 4,1124,112


Cite as