006·324
model evaluation
/ˈmɒdəl ɪˌvæljuˈeɪʃən/ - n.
1 [colloq.] The benchmark the model passes, chosen after seeing which benchmark it passes.Keep. Punchy.This is the problem.
Working definition
2. The structured measurement of a model's accuracy, calibration, and failure modes against a held-out reference.