Skip to content
Vol. I · No. 251
Mon · 8 Jun
A Daily Lexicon of Trustworthy Data
No. 248
248·01 · Definition DriftNo. 248 · 28 May 2026 · 2 min

The data contract grew up. It stopped being a schema and started being a governance control.

An engineering convenience became a control point. A control point needs a controller.

PromotedThe EditorSignal, Not Theater

A contract is a promise about the shape of data. The interesting question is never the shape. It is who has to keep the promise when the column changes underneath it.

What happened: the data contract has migrated from a build-time nicety to a governance instrument. dbt's model contracts now refuse to build a model whose output does not match the declared columns, types, and constraints — the agreement is checked before data is written, not after it breaks a dashboard. The practitioners who developed the pattern, Andrew Jones (who published it in 2021) and Chad Sanderson (who reframed it as governance-as-code), describe the same move: pin the expectation, enforce it at the source.

Why it matters: enforcement relocates accountability. A test that fails after the fact files a complaint; a contract that fails the build assigns a debt to whoever owns the producing model. Buying a tool that breaks the build is one purchase order. Deciding who is blamed when it does is a commitment nobody volunteered for.

What it reveals about the field: we are good at the mechanical half of trust — schema, types, merge gate — and shy about the social half. A contract encodes what the data means and who answers for it. With no owner, it points at an empty chair. The drift moved from the data to the org chart.

What to watch: watch where the contract is owned versus written. Consumer-defined, producer-owned is Sanderson's design; the failure mode is a contract a downstream team authored that no upstream engineer agreed to honor. When the first breaks a build, the argument over who fixes it is the real governance program. The rest was tooling.

The takeaway

A data contract only governs once a named person owns the definition it encodes. Until then it is a build step with opinions.

The claim, mapped
  1. dbt model contracts cause a model to fail to build when its output does not match the declared columns, types, and constraints, checking the agreement before data is written.

    supports01
  2. Data contracts are an architectural pattern published by Andrew Jones in 2021 that pairs data producers and consumers and centers on quality.

    supports02
  3. Chad Sanderson frames data contracts as governance-as-code that places accountability with data producers and solves quality at the source.

    supports03
  4. Industry framing treats data contracts as a governance mechanism establishing ownership and expectations between producers and consumers.

    supports0304
Sources
01
dbt Developer Hub (dbt Labs) — Model contracts2025-09-01 · Tier 1 · primaryA model's contract defines the shape of the returned dataset; if the model's logic or input data does not conform to that shape, the model does not build.
02
andrew-jones.com — Data Contracts — the book. Out now!2023-09-01 · Tier 2 · practitionerJones describes data contracts as a pattern (first published April 2021) where data generators and consumers work closely as partners and focus on quality rather than quantity.
03
Data Products (Chad Sanderson, Substack) — The Rise of Data Contracts2022-08-22 · Tier 2 · practitionerSanderson frames contracts as API-like agreements that solve quality at the source and ties them to producer accountability: "It's hard to take accountability without awareness."
04
Elementary Data — Data Contracts: Ensuring Data Quality Upstream2024-01-01 · Tier 4 · vendorFrames data contracts as agreements between producers and consumers that clarify expectations about how data should be produced and used — a governance and accountability mechanism.
Mark this entry
Marginalia · 0 notes

No notes yet. The margin is open.

Sign in to add a note. The margin is moderated — we keep it useful, not cruel.

Related entries
Owner Missing
Who Owns the Event Definition? The Registry Knows; the Org Doesn't

A compatibility check can block a breaking change. It can't name who's allowed to make one.

Definition Drift
Everyone Agreed on the YAML. Nobody Agreed on the Owner.

A real standard for data contracts now exists. The argument it was supposed to settle has simply moved up a layer.

Definition Drift
Your Pipeline Learned to Call Yesterday's Breakage Normal

Anomaly detection now defines 'good' for you. It defines it as 'whatever usually happens.'