Reading a blog post by David Loshin called Data Governance and Quality: Data Reuse vs. Data Repurposing I was, perhaps a bit off topic, inspired to pose the question about if data are of high quality if they are:
- Fit for the purpose of use
- Fit for repurposing
The first definition has been around for many years and has been adapted by many data quality practitioners. I have however often encountered situations where the reuse of data for other purposes than the original purpose has raised data quality issues with else cleared data. One of my first pieces on my own blog discussed that challenge in a post called Fit for what purpose?
Not at least within master data management where data are maintained for multiple uses, this problem is very common.
Data in a master data hub may either:
- Be entered directly into the hub where multiple uses is handled
- Be loaded from other sources where data capture was done
In the latter case the data governance necessary to ensure fitness for multiple uses must stretch to the ingestion in these sources.
The only answer is probably real world alignment as discussed here on a page called Data Quality 3.0. Make sure your data are reflecting the real world as close as we can when captured and make sure data can be maintained in order to keep that alignment. And make sure this is done and facilitated where data are entered.