Skip to content
  • Home
  • Data Quality 3.0
    • Product Data Lake
      • Product Data Lake Documentation and Data Governance
      • Business Benefits
      • Become a Product Data Lake Commissioner
    • Big Data Quality
  • Master Data Share
    • Data Matching
      • Human interaction
      • Match Destinations
      • Match Techniques
      • Types of Party Master Data
    • MDM 101
  • About Henrik Gabs Liliendahl
    • My Been Done List
    • The Emperor’s new clothes
  • What I Do
    • Popular Offerings
      • Resources
    • MDM / PIM / DQM Tool Selection Consultancy

Liliendahl on Data Quality

A blog about Master Data Management, Product Information Management, Data Quality Management and more

Metadata

2021 Data Management Mind Map

12th February 2021Henrik Gabs LiliendahlLeave a comment

Disciplines come and go in the data management world. Here is a mind map of the disciplines on top of my mind today. Some of the disciplines goes back to the emerge of IT in the previous millennium and some have risen during the latest years.

Like Loading...
Data Architecture, Data Governance, Data Matching, Information Quality, Metadata, Multi-Domain MDM, Multienterprise MDM, Product Data Syndication, Product Information ManagementData masking, Data model, Data subsetting, DQM, MDM, Metadata, PIM, TDM

A Data Quality Mind Map

20th April 2019Henrik Gabs Liliendahl2 Comments

What is data quality anyway? This question has been touched many times on this blog.

Data Quality Mind Map

Data quality can be assessed using a range of data quality dimensions – the ones coloured green in the above mind map. These dimensions relate in different ways to various data domains as examined in the post Multi-Domain MDM and Data Quality Dimensions.

Data quality can be managed using a toolbox of sub disciplines – as the ones coloured turquoise in the above mind map. The reasons for data cleansing was discussed in the blog post Top 5 Reasons for Downstream Cleansing. Data profiling was visited in the post Data Quality Tools Revealed along with data matching. The relationship between data matching and identity resolution was recently described in the post Data Matching and Real-World Alignment.

The data quality discipline is closely related to – the yellow coloured – other disciplines as data modelling, Reference Data Management (RDM), Master Data Management (MDM), metadata management and – if not a sub discipline of – data governance as also shown in the post A Data Management Mind Map.

Like Loading...
Data Matching, Information QualityData Governance, Data model, MDM, Metadata, RDM

A Data Management Mind Map

6th April 201918th April 2019Henrik Gabs LiliendahlLeave a comment

This blog is about Data Quality 3.0, Product Data Syndication Freedom, Multienterprise MDM – and many more data management topics.

These topics and the many more data management topics I have been around looks like the mind map below:

Data Management

If I can be of any help to you in the data management realm, here are some Popular Offerings.

Like Loading...
Data Architecture, Data Matching, Data Quality Tools, Information Quality, Master Data, Product Data SyndicationData Governance, Data masking, Data model, Data subsetting, MDM, Metadata, PIM, Privacy

What are they doing?

19th August 201019th August 2010Henrik Gabs Liliendahl12 Comments

A core attribute in customer master data when dealing with business entities is assigning values for your customers/prospects industry vertical (or Line-of-Business or market segment or whatever metadata name you like).

When handling this particular data element you will come across many of the classic different options in data and information management.

Unstructured versus structured

Many early CRM (Customer Relationship Management) implementations offered a free text field for the industry vertical. While this approach may have been good for the free flow in data entry it of course has created havoc when business intelligence was applied to the CRM data. Countless cleansing projects have been done (and is going on) around in order to fix this basic mistake.

Most data entry forms today having an industry vertical value has a value list to choose from.

Your list versus an external standard

When having a value list it may be a list of your own creation or be based on an external standard list, for example SIC or NACE codes.

Having a list of your own tends to fulfill the data quality principle of fit for purpose of use while an external standard tends to fulfill the data quality principle of reflecting the real world construct.  

The main weaknesses of a list of your own are that it requires continuous manual based maintenance and may cause conflicts.  Deep down into a discussion on the Initiate MDM blog Julian Schwarzenbach offered a good example saying:

“I have also come across ‘flip-flop’ data – which is typically subjective data where two users cannot agree what the correct value is and it keeps getting changed between two values. This could be the classification of a customer by market sector where two different territories are reflecting different capabilities in their territories.” – Link here.

The main weaknesses of an external standard are that they seldom offer the granularity you need and for global data the different standards (SIC versions and different national NACE implementations and others) are a pain in the…

One versus several values

Many companies have more than one distinct activity. Catching only one (the primary) value for each company is keeping it simple, stupid. Having more than one value in relevant cases is adding complexity but may lead to better decisions.

Bookmark and Share

55.580294
12.282991
Like Loading...
Big Reference Data, Data Architecture, Data Governance, Master DataB2B, CRM, Data model, Fit for purpose, Metadata, Real world alignment, Standardization

Football is FIFA

11th May 20102nd September 2010Henrik Gabs LiliendahlLeave a comment

Today we are only one month from the start of the biggest single-sport event in the world this year: The 2010 FIFA World Cup taking place in South Africa.

Now, shouldn’t the name be The Football World Cup?

Well, the problem is that football is a different game in some parts of the world like football is considered what is American Football in Northern America and Australian Football down under. The football we now in most other parts of the world is known as soccer in these areas. Association Football is the technically correct name, which is also why the acronym FIFA is an abbreviation of Fédération Internationale de Football Association which is French for International Federation of Association Football.

So, to avoid confusion the FIFA World Cup is the common – and official – name of the event.

Such naming difficulties are a very common source of information quality issues. In my work with global party master data I meet the naming issue daily – or on a daily basis as some might put it. Examples:

  • The first name is the family name in some parts of the world – so given name is a better term
  • ZIP code is technically only the US system – so postal code is a better term
  • SSN (Social Security Number) is only used in some countries. National identification number is used on Wikipedia, but I also like Citizen ID as a national identification number also might apply to companies.

The discipline concerning with unique naming of data is called Metadata Management – or Meta Data Management by some.

Bookmark and Share

55.580294
12.282991
Like Loading...
Metadata, SportDiversity, Metadata

Meterencedata

26th April 201026th April 2010Henrik Gabs Liliendahl2 Comments

Today I will like to invent a new word.

The word ”Meterencedata” is a combination of the two terms:

  • Metadata and
  • Reference Data

Metadata is data about data. Roughly spoken; in relation to databases and spreadsheets metadata describes what is in the columns.

Reference Data are high level value lists that categorize the data. Roughly spoken; in relation to databases and spreadsheets reference data explains what is in the rows.

Data Management activities – like Data Quality improvement, Master Data Management and Data Migration – will be (and have I seen are) like working in the dark if you don’t know the Metadata – and the Reference Data.

Data Models may look different. Some information may be understood through metadata in a model but through reference data in another model.

Example:

  • In one data model there are three columns in a customer table with corresponding describing metadata for:
    • Fixed line telephone number
    • Cell phone number
    • Fax number
  • In another data model there are a phone type reference table explaining the values in a separate phone table under (as a child to) the customer table having the columns:
    • Phone type
    • Phone number

In the latter case the original phone types may have been the classic fixed line, cell and fax but the entries may have been extended over time as the real world changes. This model also reflects the reality of several same type numbers attached to a single party.

Conclusion: One man’s Metadata is another man’s Reference Data as you don’t meet and mete out the data equal ways.

55.580294
12.282991
Like Loading...
Data Architecture, Data Governance, MetadataData model, MDM, Metadata, Migration

Perfect Wrong Answer

9th January 20109th May 2010Henrik Gabs Liliendahl4 Comments

If you ask me the question ”How many people live in your town?” I could give you a correct answer being 5,000 % besides what you are looking for.

I live in Greve Municipality in Denmark. Population close to 48,000. Greve is a suburb south of Copenhagen. According to Wikipedia Copenhagen urban area has a population of 1.2 million and Copenhagen metro area has a population of 1.9 million people.

The Copenhagen metro area stretches from 40 km (20 miles) south of the city to 40 km (20 miles) north at Elsinore and Kronborg Castle (immortalized in Shakespeare’s Hamlet – always remember to include Shakespeare in a blog).

Further more: From Copenhagen you can look across the water to the east seeing Sweden and the city Malmoe. The Copenhagen-Malmoe bi-national urban agglomeration has a total population of 2.5 million people.

The real data quality issue in my initial question is not the precision, validity and timeliness in the number given in the answer but the shared understanding of the label attached to the number.

I noticed that Wikipedia has developed a good metadata habit when stating town populations giving 3 distinct labels: City, Urban and Metro.

55.580294
12.282991
Like Loading...
Data Governance, MetadataCopenhagen, Fit for purpose, Metadata, Shakespeare, Single version of the truth, The world

The Disruptive MDM / PIM / DQM List

Register your innovative solution on the disruptive list of MDM/PIM/DQM/DAM solutions here

Select your MDM / PIM / DQM solution

Get a free solution ranking based on your context, scope and requirements here

Enter your email address to follow this blog and receive notifications of new posts by email.

Join 704 other subscribers

Let product data flow easily between trading partners

Product Data Lake: Put and Take

Product Data Lake: Put and Take

Categories

  • Big Reference Data (158)
  • Business Process Management (36)
  • Data Architecture (256)
  • Data Governance (157)
  • Data Matching (221)
    • Business Directories (24)
    • Identity Resolution (61)
    • Survivorship (10)
  • Data Profiling (16)
  • Data Quality Tools (180)
  • Data Quality World Tour (19)
  • Direct Marketing (19)
  • Environmental Data Management (2)
  • Information Quality (156)
  • Master Data (594)
    • Hierarchy Management (39)
    • Multi-Domain MDM (127)
    • Multienterprise MDM (27)
    • National ID (22)
    • Social MDM (60)
  • Metadata (25)
  • Product Data Syndication (51)
  • Product Information Management (27)
  • Search and navigation (15)
  • Service Oriented Architecture (12)
  • Social Media (78)
  • Sport (16)
  • Supposed to be a joke (66)
  • X-mas (20)

Data Governance Training

Getting Started in Data Governance online training from Nicola Askham is available here.

Recent Posts

  • Balancing the Business Partner / Party Concept
  • The Intersection of Data Observability, MDM and Data Quality
  • Product vs Material vs Article vs Item
  • Data Matching Efficiency
  • The Intersection Between MDM, PIM and ESG
  • The 4 Best Emerging Modern Data Quality Tools for 2024
  • Modern Data Quality at Scale using Digna
  • Three Essential Trends in Data Management for 2024
  • Modern Data Quality: Navigating the Landscape
  • From Platforms to Ecosystems

eLearningCurve

Have a look at the eLearningCurve course on data parsing, data matching and de-duplication here.

Recent Comments

Henrik Gabs Liliendahl's avatarHenrik Gabs Lilienda… on Balancing the Business Partner…
Jeppe Thing Sørensen's avatarJeppe Thing Sørensen on Balancing the Business Partner…
peolsolutions's avatarpeolsolutions on MDM, Cloud, SaaS, PaaS, IaaS a…
Henrik Gabs Liliendahl's avatarHenrik Gabs Lilienda… on Is the Holiday Season called C…
Michael D.'s avatarMichael D. on Is the Holiday Season called C…
Jay Ram's avatarJay Ram on The Disruptive MDM List is…
Henrik Gabs Liliendahl's avatarHenrik Gabs Lilienda… on The Intersection of Data Obser…
Shanker's avatarShanker on The Intersection of Data Obser…
Bhavani Shanker's avatarBhavani Shanker on Data Matching Efficiency
Henrik Gabs Liliendahl's avatarHenrik Gabs Lilienda… on Data Matching Efficiency
Bhavani Shanker's avatarBhavani Shanker on Data Matching Efficiency
Henrik Gabs Liliendahl's avatarHenrik Gabs Lilienda… on From Platforms to Ecosyst…
Michael Fieg's avatarMichael Fieg on From Platforms to Ecosyst…
Unknown's avatarFrom Platforms to Ec… on What is Collaborative Product…
Unknown's avatarFrom Platforms to Ec… on MDM and Knowledge Graph

Doing IT The Toyota Way

Addresses AI B2B B2C Big data Business ecosystems Business intelligence Business processes Business rules CDI CDP Cleansing Compliance Copenhagen CRM Cuisine D&B DAM Data Governance Data model DataQualityPro Digitalization Diversity DQM Duplicates ecommerce ERP Evolution Facebook Fit for purpose Fraud Fuzzy logic Gartner GDPR Geocode Graph Hans Christian Andersen Happy databases History Household Instant Data Quality IoT Laissez faire LEI LinkedIn London Mashup MDM MDMDG Metadata Migration Multi-Channel National identification number OCDQ Online Offline Open data People PIM Prevention Privacy RDM Real world alignment ROI Shakespeare Single version of the truth SOA components Standardisation Standardization Syndication Technology The cloud The world Twitter User involvement Who what where when

Blog stats

  • 628,744 visits

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Join 704 other subscribers
Blog at WordPress.com.
  • Subscribe Subscribed
    • Liliendahl on Data Quality
    • Join 704 other subscribers
    • Already have a WordPress.com account? Log in now.
    • Liliendahl on Data Quality
    • Subscribe Subscribed
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar
 

Loading Comments...
 

    Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use.
    To find out more, including how to control cookies, see here: Our Cookie Policy
    %d