One of my specialties in Data Quality is Data Matching.
Data Matching is about linking – and eventually also splitting before linking – entities in or between databases, where these entities are not already linked with unique keys.
Matching party data – names and addresses – is the most common area where Data Matching is practiced and a lot of tools are available. Matching includes:
- Match with external reference data and
- Identity Resolution.
Please find more detailed information here:
The sub disciplines of data matching are further described in posts about:
Product directory matching and deduplication is also widespread. An example is described in the post Matching Light Bulbs.
I was a guest on OCDQ Radio explaining the art of data matching.
Data Matching may be deployed in some different ways, where I have been involved in the ones described in the post Deploying Data Matching.
If you are interested in Data Matching you may join the LinkedIn group on this matter – as the group manager I will be happy to approve your membership. Join here.
Also, I am co-author of the eLearningCurve course about Data Parsing, Matching and De-duplication.
The future of data matching is explained in my Informatica Perspectives blog post called Five Future Data Matching Trends.
These trends are further examined in the posts: