Skip to content
Vol. I · No. 251
Mon · 8 Jun
A Daily Lexicon of Trustworthy Data
The Lexicon

005·75

late-arriving data

/leɪt ə.ˈraɪ.vɪŋ ˈdeɪ.tə/ - n.

1 [colloq.] Data that shows up after the meeting that needed it, with a perfectly good explanation no one waited to hear.Keep. Punchy.This is the problem.

Working definition

2. Records that reach the pipeline after the time window they belong to has already been processed, requiring late updates or reprocessing.

Filed
See also
  • backfillRe-running history to match the present, and discovering history had a different opinion about what 'customer' meant.
  • data freshness slaA promise about how recent the data is, signed by no one and measured by whoever is annoyed first.
  • replayLiving the same afternoon again, this time with the join written correctly.
  • watermarkA single stored number that is the difference between an incremental load and a four-hour incident.