Journal article Open Access

Contract Understanding Atticus Dataset (CUAD) v1

The Atticus Project

MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="">
  <controlfield tag="005">20210729221304.0</controlfield>
  <controlfield tag="001">4595826</controlfield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">105883672</subfield>
    <subfield code="z">md5:c38f490a984420b8a62600db401fafd5</subfield>
    <subfield code="u"></subfield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2021-03-11</subfield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire</subfield>
    <subfield code="o"></subfield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="a">The Atticus Project</subfield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Contract Understanding Atticus Dataset (CUAD) v1</subfield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u"></subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2"></subfield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;Contract Understanding Atticus Dataset (CUAD) v1 is a corpus of more than 13,000 labels in 510 commercial legal contracts that have been manually labeled by &lt;a href=""&gt;The Atticus Project&lt;/a&gt;&amp;nbsp;to identify 41 categories of important clauses that lawyers look for when reviewing contracts.&lt;/p&gt;

&lt;p&gt;We tested CUAD v1 against ten&amp;nbsp;pretrained AI models and published the results on arXiv&amp;nbsp;&lt;a href=""&gt;here&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;ReadMe and Datasheet are published &lt;a href=""&gt;here&lt;/a&gt;. Code for replicating the results, together with the model trained on CUAD, is published on Github &lt;a href=""&gt;here&lt;/a&gt;.&amp;nbsp;&lt;/p&gt;</subfield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.4595825</subfield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.4595826</subfield>
    <subfield code="2">doi</subfield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">publication</subfield>
    <subfield code="b">article</subfield>
All versions This version
Views 13,46313,459
Downloads 4,5034,503
Data volume 476.8 GB476.8 GB
Unique views 12,03812,034
Unique downloads 4,1124,112


Cite as