Published October 5, 2020 | Version beta
Dataset Open

Atticus Open Contract Dataset (AOK) (beta)

  • 1. The Atticus Project

Description

Atticus Open Contract Dataset (AOK)(beta) is a corpus of 5,000+ labels in 200 commercial legal contracts that have been manually labeled by legal experts to identify 40 types of clauses that are important during contract review in connection with corporate transactions, such as mergers and acquisitions, IPO, and corporate financing.

AOK Dataset is curated and maintained by The Atticus Project, Inc., a non-profit organization, to support NLP research and development in legal contract review.

If you download this dataset, we'd love to know more about you and your project! Please fill out this short form: https://forms.gle/h47GUENTTbBqH39m7.

Check out our website at atticusprojectai.org.

 

Update: The expanded 1.0 version of the dataset is available here https://zenodo.org/record/4595826

Files

aok_beta.zip

Files (39.2 MB)

Name Size Download all
md5:8edd59277d0d5643a13ccfd754752426
39.2 MB Preview Download