The new face of Data Matching

28th August 200919th June 2010Henrik Gabs Liliendahl

When matching database records holding data about a person we traditionally use string attributes as Citizen/Tax ID, Name, Address, Phone, Email.

PolarRose Today I stumbled over a company called Polar Rose that specialize in recognition of peoples faces on pictures. Current use is tagging people on Facebook pictures, but really, this technology could make Data Matching, Identity Resolution and Deduplication better.

We already know fuzzy matching with names and addresses have plenty of challenges with false positives and false negatives. Surely I also do imaging same issues with facial recognition. But we also know from comparing with strings that the more different information we may gather, the better we are at avoiding false matching. So combining fuzzy string matching and facial recognition (where picture is available) could add more human mimic to matching technology reliability.

Right now I am considering whether to add this feature to Data Quality 2.0 or leave it for Data Quality 3.0.

ronald holtkamp 5th September 2009 / 08:51

Indeed this software exits and causes a true privacy and personal security issue. Also for this reason, face recognition, we recommand our clients to stay away from face book etc.
Regards, Ronald

Reply
Peter Went 6th September 2009 / 13:03

The more properties you have to match on, the better that match can be (in terms of false positives, but also in terms of false negatives). We, at WCC, refer to that concept as ‘multi modal’.

Multi modal not ncessarily implies the inclusion of biometrics into the mix, although it is commonly used in the industry to indicate a match on different biometrics.

We achieve very good results with applying multi modal fusion, in terms of quality improvement, but also in performance improvement. The latter is relevant in genuinely very large scale applications.

It is certainly true what Ronald writes, that it may be seen as privacy invasion. However, several companies work very hard, as we speak, on solving that issue, including priv-ID, Genkey and Anonymous Recognition.

As a last note on multi modal fusion, in the ideal world all properties are captured and are of good enough quality. In practice this typically is not the case. Multi modal fusion allows to take the available data of whatever quality and intelligently combine that to come up with the best possible matches.

Hope this is informative, Peter

Reply

	Henrik Gabs Lilienda… on Balancing the Business Partner…
	Jeppe Thing Sørensen on Balancing the Business Partner…
	peolsolutions on MDM, Cloud, SaaS, PaaS, IaaS a…
	Henrik Gabs Lilienda… on Is the Holiday Season called C…
	Michael D. on Is the Holiday Season called C…
	Jay Ram on The Disruptive MDM List is…
	Henrik Gabs Lilienda… on The Intersection of Data Obser…
	Shanker on The Intersection of Data Obser…
	Bhavani Shanker on Data Matching Efficiency
	Henrik Gabs Lilienda… on Data Matching Efficiency
	Bhavani Shanker on Data Matching Efficiency
	Henrik Gabs Lilienda… on From Platforms to Ecosyst…
	Michael Fieg on From Platforms to Ecosyst…
	From Platforms to Ec… on What is Collaborative Product…
	From Platforms to Ec… on MDM and Knowledge Graph

Liliendahl on Data Quality

A blog about Master Data Management, Product Information Management, Data Quality Management and more

The new face of Data Matching

Related

2 thoughts on “The new face of Data Matching”

Leave a comment Cancel reply