Skip to content
Vol. I · No. 251
Mon · 8 Jun
A Daily Lexicon of Trustworthy Data
The Lexicon

005·76

data cleansing

/ˈdeɪ.tə ˈklenz.ɪŋ/ - n.

1 [colloq.] Repairing the data instead of the system that keeps breaking it, monthly, in perpetuity.Keep. Punchy.This is the problem.

Working definition

2. The correction or removal of inaccurate, incomplete, or improperly formatted records to bring a dataset within its quality requirements.

Evidence
See also
  • data profilingThe one-time exercise of learning what the data actually contains, scheduled for after launch.
  • remediationFixing the rows now and scheduling the cause for a quarter that does not arrive.
  • root cause analysisThe search for the underlying cause, which terminates the moment it reaches a team that can defend itself.