Skip to content
  • Home
  • Data Quality 3.0
    • Product Data Lake
      • Product Data Lake Documentation and Data Governance
      • Business Benefits
      • Become a Product Data Lake Commissioner
    • Big Data Quality
  • Master Data Share
    • Data Matching
      • Human interaction
      • Match Destinations
      • Match Techniques
      • Types of Party Master Data
    • MDM 101
  • About Henrik Gabs Liliendahl
    • My Been Done List
    • The Emperor’s new clothes
  • What I Do
    • Popular Offerings
      • Resources
    • MDM / PIM / DQM Tool Selection Consultancy

Liliendahl on Data Quality

A blog about Master Data Management, Product Information Management, Data Quality Management and more

Metadata

2021 Data Management Mind Map

12th February 2021Henrik Gabs LiliendahlLeave a comment

Disciplines come and go in the data management world. Here is a mind map of the disciplines on top of my mind today. Some of the disciplines goes back to the emerge of IT in the previous millennium and some have risen during the latest years.

Like this:

Like Loading...
Data Architecture, Data Governance, Data Matching, Information Quality, Metadata, Multi-Domain MDM, Multienterprise MDM, Product Data Syndication, Product Information ManagementData masking, Data model, Data subsetting, DQM, MDM, Metadata, PIM, TDM

A Data Quality Mind Map

20th April 2019Henrik Gabs Liliendahl2 Comments

What is data quality anyway? This question has been touched many times on this blog.

Data Quality Mind Map

Data quality can be assessed using a range of data quality dimensions – the ones coloured green in the above mind map. These dimensions relate in different ways to various data domains as examined in the post Multi-Domain MDM and Data Quality Dimensions.

Data quality can be managed using a toolbox of sub disciplines – as the ones coloured turquoise in the above mind map. The reasons for data cleansing was discussed in the blog post Top 5 Reasons for Downstream Cleansing. Data profiling was visited in the post Data Quality Tools Revealed along with data matching. The relationship between data matching and identity resolution was recently described in the post Data Matching and Real-World Alignment.

The data quality discipline is closely related to – the yellow coloured – other disciplines as data modelling, Reference Data Management (RDM), Master Data Management (MDM), metadata management and – if not a sub discipline of – data governance as also shown in the post A Data Management Mind Map.

Like this:

Like Loading...
Data Matching, Information QualityData Governance, Data model, MDM, Metadata, RDM

A Data Management Mind Map

6th April 201918th April 2019Henrik Gabs LiliendahlLeave a comment

This blog is about Data Quality 3.0, Product Data Syndication Freedom, Multienterprise MDM – and many more data management topics.

These topics and the many more data management topics I have been around looks like the mind map below:

Data Management

If I can be of any help to you in the data management realm, here are some Popular Offerings.

Like this:

Like Loading...
Data Architecture, Data Matching, Data Quality Tools, Information Quality, Master Data, Product Data SyndicationData Governance, Data masking, Data model, Data subsetting, MDM, Metadata, PIM, Privacy

What are they doing?

19th August 201019th August 2010Henrik Gabs Liliendahl12 Comments

A core attribute in customer master data when dealing with business entities is assigning values for your customers/prospects industry vertical (or Line-of-Business or market segment or whatever metadata name you like).

When handling this particular data element you will come across many of the classic different options in data and information management.

Unstructured versus structured

Many early CRM (Customer Relationship Management) implementations offered a free text field for the industry vertical. While this approach may have been good for the free flow in data entry it of course has created havoc when business intelligence was applied to the CRM data. Countless cleansing projects have been done (and is going on) around in order to fix this basic mistake.

Most data entry forms today having an industry vertical value has a value list to choose from.

Your list versus an external standard

When having a value list it may be a list of your own creation or be based on an external standard list, for example SIC or NACE codes.

Having a list of your own tends to fulfill the data quality principle of fit for purpose of use while an external standard tends to fulfill the data quality principle of reflecting the real world construct.  

The main weaknesses of a list of your own are that it requires continuous manual based maintenance and may cause conflicts.  Deep down into a discussion on the Initiate MDM blog Julian Schwarzenbach offered a good example saying:

“I have also come across ‘flip-flop’ data – which is typically subjective data where two users cannot agree what the correct value is and it keeps getting changed between two values. This could be the classification of a customer by market sector where two different territories are reflecting different capabilities in their territories.” – Link here.

The main weaknesses of an external standard are that they seldom offer the granularity you need and for global data the different standards (SIC versions and different national NACE implementations and others) are a pain in the…

One versus several values

Many companies have more than one distinct activity. Catching only one (the primary) value for each company is keeping it simple, stupid. Having more than one value in relevant cases is adding complexity but may lead to better decisions.

Bookmark and Share

55.580294
12.282991

Like this:

Like Loading...
Big Reference Data, Data Architecture, Data Governance, Master DataB2B, CRM, Data model, Fit for purpose, Metadata, Real world alignment, Standardization

Football is FIFA

11th May 20102nd September 2010Henrik Gabs LiliendahlLeave a comment

Today we are only one month from the start of the biggest single-sport event in the world this year: The 2010 FIFA World Cup taking place in South Africa.

Now, shouldn’t the name be The Football World Cup?

Well, the problem is that football is a different game in some parts of the world like football is considered what is American Football in Northern America and Australian Football down under. The football we now in most other parts of the world is known as soccer in these areas. Association Football is the technically correct name, which is also why the acronym FIFA is an abbreviation of Fédération Internationale de Football Association which is French for International Federation of Association Football.

So, to avoid confusion the FIFA World Cup is the common – and official – name of the event.

Such naming difficulties are a very common source of information quality issues. In my work with global party master data I meet the naming issue daily – or on a daily basis as some might put it. Examples:

  • The first name is the family name in some parts of the world – so given name is a better term
  • ZIP code is technically only the US system – so postal code is a better term
  • SSN (Social Security Number) is only used in some countries. National identification number is used on Wikipedia, but I also like Citizen ID as a national identification number also might apply to companies.

The discipline concerning with unique naming of data is called Metadata Management – or Meta Data Management by some.

Bookmark and Share

55.580294
12.282991

Like this:

Like Loading...
Metadata, SportDiversity, Metadata

Meterencedata

26th April 201026th April 2010Henrik Gabs Liliendahl2 Comments

Today I will like to invent a new word.

The word ”Meterencedata” is a combination of the two terms:

  • Metadata and
  • Reference Data

Metadata is data about data. Roughly spoken; in relation to databases and spreadsheets metadata describes what is in the columns.

Reference Data are high level value lists that categorize the data. Roughly spoken; in relation to databases and spreadsheets reference data explains what is in the rows.

Data Management activities – like Data Quality improvement, Master Data Management and Data Migration – will be (and have I seen are) like working in the dark if you don’t know the Metadata – and the Reference Data.

Data Models may look different. Some information may be understood through metadata in a model but through reference data in another model.

Example:

  • In one data model there are three columns in a customer table with corresponding describing metadata for:
    • Fixed line telephone number
    • Cell phone number
    • Fax number
  • In another data model there are a phone type reference table explaining the values in a separate phone table under (as a child to) the customer table having the columns:
    • Phone type
    • Phone number

In the latter case the original phone types may have been the classic fixed line, cell and fax but the entries may have been extended over time as the real world changes. This model also reflects the reality of several same type numbers attached to a single party.

Conclusion: One man’s Metadata is another man’s Reference Data as you don’t meet and mete out the data equal ways.

55.580294
12.282991

Like this:

Like Loading...
Data Architecture, Data Governance, MetadataData model, MDM, Metadata, Migration

Perfect Wrong Answer

9th January 20109th May 2010Henrik Gabs Liliendahl4 Comments

If you ask me the question ”How many people live in your town?” I could give you a correct answer being 5,000 % besides what you are looking for.

I live in Greve Municipality in Denmark. Population close to 48,000. Greve is a suburb south of Copenhagen. According to Wikipedia Copenhagen urban area has a population of 1.2 million and Copenhagen metro area has a population of 1.9 million people.

The Copenhagen metro area stretches from 40 km (20 miles) south of the city to 40 km (20 miles) north at Elsinore and Kronborg Castle (immortalized in Shakespeare’s Hamlet – always remember to include Shakespeare in a blog).

Further more: From Copenhagen you can look across the water to the east seeing Sweden and the city Malmoe. The Copenhagen-Malmoe bi-national urban agglomeration has a total population of 2.5 million people.

The real data quality issue in my initial question is not the precision, validity and timeliness in the number given in the answer but the shared understanding of the label attached to the number.

I noticed that Wikipedia has developed a good metadata habit when stating town populations giving 3 distinct labels: City, Urban and Metro.

55.580294
12.282991

Like this:

Like Loading...
Data Governance, MetadataCopenhagen, Fit for purpose, Metadata, Shakespeare, Single version of the truth, The world

The Disruptive MDM / PIM / DQM List

Register your innovative solution on the disruptive list of MDM/PIM/DQM/DAM solutions here

Select your MDM / PIM / DQM solution

Get a free solution ranking based on your context, scope and requirements here

Follow the blog on social media

  • LinkedIn
  • Twitter

Enter your email address to follow this blog and receive notifications of new posts by email.

Join 11,969 other subscribers

Let product data flow easily between trading partners

Categories

  • Big Reference Data (158)
  • Business Process Management (36)
  • Data Architecture (255)
  • Data Governance (156)
  • Data Matching (220)
    • Business Directories (24)
    • Identity Resolution (61)
    • Survivorship (10)
  • Data Profiling (16)
  • Data Quality Tools (176)
  • Data Quality World Tour (19)
  • Direct Marketing (19)
  • Information Quality (154)
  • Master Data (588)
    • Hierarchy Management (39)
    • Multi-Domain MDM (125)
    • Multienterprise MDM (26)
    • National ID (22)
    • Social MDM (60)
  • Metadata (24)
  • Product Data Syndication (49)
  • Product Information Management (26)
  • Search and navigation (15)
  • Service Oriented Architecture (12)
  • Social Media (78)
  • Sport (16)
  • Supposed to be a joke (66)
  • X-mas (20)

Data Governance Training

Getting Started in Data Governance online training from Nicola Askham is available here.

Recent Posts

  • Master Data Management in Financial Services
  • Which Data Management KPIs Should You Measure?
  • A Guide to Data Quality
  • Extended MDM Revisited
  • Data Governance Necessities
  • Three Augmented Data Management Flavors
  • 4 Key Aspects of Master Data Management in Manufacturing
  • The Business Value Behind Top 3 MDM Trends
  • 4 Concepts in the Gartner Hype Cycle for Digital Business Capabilities that will Shape MDM
  • A Guide to Master Data Management

eLearningCurve

Have a look at the eLearningCurve course on data parsing, data matching and de-duplication here.

Recent Comments

Asifa on Data Fabric and Master Data…
Henrik Gabs Lilienda… on Data Fabric and Master Data…
Asifa on Data Fabric and Master Data…
Direct And Indirect… on Direct Customers and Indirect…
Boeing’s Path… on Direct Customers and Indirect…
Hillary Boyle on MDM Tools Revealed
Henrik Gabs Lilienda… on Which Data Management KPIs Sho…
Gino Fortunato on Which Data Management KPIs Sho…
Henrik Gabs Lilienda… on A Guide to Data Quality
Jeppe Thing Sørensen on A Guide to Data Quality
Henrik Gabs Lilienda… on A Guide to Data Quality
Matthias on A Guide to Data Quality
Michael Fieg on A Guide to Master Data Ma…
Henrik Gabs Lilienda… on 2022 Data Management Pred…
Gino Fortunato on 2022 Data Management Pred…

Doing IT The Toyota Way

Addresses AI B2B B2C Big data Business ecosystems Business intelligence Business processes Business rules CDI CDP Cleansing Compliance Copenhagen CRM Cuisine D&B DAM Data Governance Data model DataQualityPro Digitalization Diversity DQM Duplicates ecommerce ERP Evolution Facebook Fit for purpose Fraud Fuzzy logic Gartner GDPR Geocode Graph Hans Christian Andersen Happy databases History Household Instant Data Quality IoT Laissez faire LEI LinkedIn London Mashup MDM MDMDG Metadata Migration Multi-Channel National identification number OCDQ Online Offline Open data People PIM Prevention Privacy RDM Real world alignment ROI Shakespeare Single version of the truth SOA components Standardisation Standardization Syndication Technology The cloud The world Twitter User involvement Who what where when

Blog stats

  • 537,782 visits

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Join 11,969 other subscribers
Blog at WordPress.com.
  • Follow Following
    • Liliendahl on Data Quality
    • Join 706 other followers
    • Already have a WordPress.com account? Log in now.
    • Liliendahl on Data Quality
    • Customize
    • Follow Following
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar
 

Loading Comments...
 

    Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use.
    To find out more, including how to control cookies, see here: Our Cookie Policy
    %d bloggers like this: