Open-Source Datasets

Explore Our Datasets

Three landmark legal AI datasets — free, open-source, and built by attorneys. Choose the one that fits your work.

What Are These Datasets?

The Atticus Project creates free, open-source datasets that teach AI systems to understand contracts. Each dataset contains real contracts and expert-labeled examples, allowing researchers and companies to build better legal AI tools. All datasets are released under CC BY 4.0 — completely free to download, use, and share.

Which dataset is right for you?

  • CUAD: Working with commercial contracts? CUAD teaches AI to identify 41 different clause types across purchase agreements, NDAs, employment contracts, and more.
  • MAUD: Focused on M&A? MAUD covers 92 deal points across 152 real merger agreements — helping AI understand acquisition documents.
  • ACORD: Building clause search or extraction tools? ACORD has 126,000+ expert-rated query-clause pairs for training semantic search models.

Filter by Category

Showing all 58 clause types

CUAD

Contract Title

Document Name · Contract Basics

The official name or title of the contract document itself — used for identification and filing.

Software License Agreement

View CUAD Dataset →
CUADHigh Priority

Who's Signing This?

Parties · Contract Basics

The names and roles of each signing party — clearly identifies who the contract is between.

This Agreement is entered into between Acme Corp ('Company') and Dev Studios Inc ('Contractor').

View CUAD Dataset →
CUADHigh Priority

When Did They Sign?

Agreement Date · Contract Basics

The date the contract was signed or officially executed — important for legal validity and enforceability.

This Agreement is dated as of January 15, 2024.

View CUAD Dataset →
CUADHigh Priority

When Does It Start?

Effective Date · Contract Basics

The date when rights and obligations under the contract actually begin — may be different from the signature date.

This Agreement shall be effective as of the date first written above.

View CUAD Dataset →
CUADHigh Priority

When Does It End?

Expiration Date · Contract Basics

The date when the contract's initial term ends — after which it may expire, renew, or require renegotiation.

Unless earlier terminated, this Agreement shall continue for a period of one (1) year from the Effective Date.

View CUAD Dataset →
CUADHigh Priority

Either Side Can End the Contract

Termination for Convenience · Termination

Lets one or both parties end the contract at any time, without needing a specific reason — usually with advance notice.

Either party may terminate this Agreement upon thirty (30) days' written notice to the other party.

View CUAD Dataset →
CUADHigh Priority

Auto-Renewal Clause

Renewal Term · Payments & Term

The contract automatically extends for another period unless someone takes action to cancel it before the deadline.

This Agreement shall automatically renew for successive one-year terms unless either party provides sixty (60) days' written notice of non-renewal.

View CUAD Dataset →
CUAD

Notice Requirements

Notice Period to Terminate Renewal · Termination

Specifies how far in advance you must notify the other party if you don't want the contract to renew.

Either party wishing to terminate must provide written notice at least ninety (90) days prior to the end of the then-current term.

View CUAD Dataset →
CUAD

Services Continue After Exit

Post-Termination Services · Termination

Any services or obligations that must continue even after the contract officially ends — such as returning property or transitional support.

Following termination, Vendor shall provide transition services for thirty (30) days at no additional charge.

View CUAD Dataset →

Showing 9 of 58 results

Download the Data

All datasets are free and open-source under CC BY 4.0. Download directly from GitHub or Hugging Face.

CUAD

NeurIPS 2021

510 commercial contracts labeled across 41 clause types.

MAUD

EMNLP 2023

152 merger agreements annotated across 92 M&A deal points.

ACORD

ACL 2025

126,000+ expert-rated query-clause pairs for semantic retrieval.

Get in Touch

Questions about the data?

Whether you're a researcher looking to collaborate, a developer building on our datasets, or an attorney curious about how this work applies to your practice — we'd love to hear from you.

Contact Us