A dataset of legal contracts with rich expert annotations
Contract Understanding Atticus Dataset (CUAD) v1 is a corpus of 13,000+ labels in 510 commercial legal contracts that have been manually labeled under the supervision of experienced lawyers to identify 41 types of legal clauses that are considered important in contact review in connection with a corporate transaction, including mergers & acquisitions, etc.
CUAD is curated and maintained by The Atticus Project, Inc. to support NLP research and development in legal contract review.
Read the full CUAD v1 announcement here!
• 13,000+ labels
• 510 contracts
• 41 categories of clauses
Check out the performance results publication on arXiv here.
Check out the code for replicating the results and the trained model here.
Spencer P. Goodson
Alexander Kwonji Rosenberg
William R. Sawyers
Law Student Leaders
Sheetal Sharma Saran
Law Student Contributors
Technical Advisors & Contributors