Skip to content
Vol. I · No. 251
Mon · 8 Jun
A Daily Lexicon of Trustworthy Data
No. 246
246·02 · Definition DriftNo. 246 · 26 May 2026 · 2 min

Master data management returns wearing an AI badge. The job description has not changed.

The models needed one definition of "customer." That was always the assignment.

PromotedThe EditorVocabulary Court

Master data management spent a decade as the slide everyone skipped. It is back on the agenda, and the reason given is that the models cannot work without one agreed definition of the things they reason about.

What happened is a rehabilitation, not a discovery. Master data management is the discipline of producing one trusted, semantically consistent view of the entities a business runs on, such as customer, product, and supplier, across systems that disagree about them. It was unglamorous, so it was deferred. Now the same definition is reintroduced as a precondition for model accuracy, which is the polite way of admitting it was a precondition for accuracy all along.

It matters because the failure mode survived the rebrand intact. A golden record exists to consolidate conflicting records from multiple systems into one version a business can act on. When five systems hold three definitions of "active customer," a model trained or grounded on that spread does not resolve the conflict. It averages it, then reports the average with composure.

What it reveals is the oldest pattern in the catalog: the organization would rather buy a capability than assign a definition. Matching and deduplication now arrive with machine-learning trim, useful for the mechanical work of linking records. It does not decide which record is correct. That is a stewardship decision, and stewardship has an owner or it does not exist.

What to watch is the order of operations. If the data-quality program is scoped after the model ships, the definition is being treated as a downstream cleanup task rather than the spec. The reveal is plain: the technology did not create the need for one definition of "customer." It just made the disagreement faster, and gave it a confidence interval.

The takeaway

A single definition was never a tooling deliverable. It is a decision someone owns, written down before the model reads it.

The claim, mapped
  1. MDM is the discipline of maintaining a single, semantically consistent view of core entities such as customer, product, and supplier across systems.

    supports01
  2. A golden record consolidates conflicting records from multiple systems into one trusted version, and modern matching can use machine-learning techniques.

    supports03
  3. Data quality, including consistency across sources, is foundational to model accuracy; inconsistency across systems is a data-quality problem.

    supports02
  4. Inconsistent definitions across systems are not resolved by a model grounded or trained on them; the disagreement persists.

    context0102
Sources
01
Wikipedia — Master data management2026-05-01 · Tier 4 · primaryDescribes MDM as a discipline ensuring uniformity, accuracy, semantic consistency and accountability of master data, aiming at a single version of the truth across systems.
02
TechTarget — Data quality in AI: 9 common issues and best practices2025-03-18 · Tier 3 · practitionerArgues data quality is foundational to AI because it directly affects model accuracy and reliability, and that inconsistency across multiple sources signals a data-quality problem.
03
Profisee — What Is a Golden Record in MDM?2024-01-01 · Tier 4 · vendorDefines the golden record as one complete, accurate version accessible across the business, surfacing conflicting records so teams can verify the right data; notes matching techniques assist consolidation.
Mark this entry
Marginalia · 0 notes

No notes yet. The margin is open.

Sign in to add a note. The margin is moderated — we keep it useful, not cruel.

Related entries
Definition Drift
The data contract grew up. It stopped being a schema and started being a governance control.

An engineering convenience became a control point. A control point needs a controller.

Business Sense Required
Retrieval-augmented generation is a data-quality project nobody scoped.

The retriever inherits every undefined term in the corpus. The model just reads it aloud.

Platform Tunnel Vision
The Company That Sells the Single Source Becomes a Row in Someone Else's

Salesforce closed its purchase of Informatica. The master-data vendor is now master data.