Skip to content
  • Home
  • Data Quality 3.0
    • Product Data Lake
      • Product Data Lake Documentation and Data Governance
      • Business Benefits
      • Become a Product Data Lake Commissioner
    • Big Data Quality
  • Master Data Share
    • Data Matching
      • Human interaction
      • Match Destinations
      • Match Techniques
      • Types of Party Master Data
    • MDM 101
  • About Henrik Gabs Liliendahl
    • My Been Done List
    • The Emperor’s new clothes
  • What I Do
    • Popular Offerings
      • Resources
    • MDM / PIM / DQM Tool Selection Consultancy

Liliendahl on Data Quality

A blog about Master Data Management, Product Information Management, Data Quality Management and more

Metadata

Metadata Meatballs

21st February 201116th November 2013Henrik Gabs Liliendahl5 Comments

I can’t help making analogies between data quality and food and drink even that I am actually not on any kind of diet these days.

Today’s subject is the similarities between metadata and meatballs.

Metadata is loosely defined as data about data. Some data describing what is meant to be in a dataset and a data element, what the purpose is and what standards are used.

The problem with metadata is if everybody understands the same when you use a certain term when creating metadata. Despite best intensions there will probably always be someone, somewhere getting something different from your wordings.

Frikadeller

That’s where meatballs come into the context.

If you read the article about meatballs on Wikipedia you’ll get the picture. Yes, meatballs have some common characteristics around the world. Some minced meat (or fish (if not vegetarian style)) mixed with some additional ingredients exposed to heat in some way and served with something different depending on where on earth you are.

Having a metadata repository is good for data and information quality.

The challenge in filling out a metadata repository is the balancing between describing how meatballs should be (your mom’s recipe) and how meatballs could be.

Bookmark and Share

55.646340 12.549496

Like this:

Like Loading...
MetadataCuisine, Diversity

Football is FIFA

11th May 20102nd September 2010Henrik Gabs LiliendahlLeave a comment

Today we are only one month from the start of the biggest single-sport event in the world this year: The 2010 FIFA World Cup taking place in South Africa.

Now, shouldn’t the name be The Football World Cup?

Well, the problem is that football is a different game in some parts of the world like football is considered what is American Football in Northern America and Australian Football down under. The football we now in most other parts of the world is known as soccer in these areas. Association Football is the technically correct name, which is also why the acronym FIFA is an abbreviation of Fédération Internationale de Football Association which is French for International Federation of Association Football.

So, to avoid confusion the FIFA World Cup is the common – and official – name of the event.

Such naming difficulties are a very common source of information quality issues. In my work with global party master data I meet the naming issue daily – or on a daily basis as some might put it. Examples:

  • The first name is the family name in some parts of the world – so given name is a better term
  • ZIP code is technically only the US system – so postal code is a better term
  • SSN (Social Security Number) is only used in some countries. National identification number is used on Wikipedia, but I also like Citizen ID as a national identification number also might apply to companies.

The discipline concerning with unique naming of data is called Metadata Management – or Meta Data Management by some.

Bookmark and Share

55.580294 12.282991

Like this:

Like Loading...
Metadata, SportDiversity, Metadata

Meterencedata

26th April 201026th April 2010Henrik Gabs Liliendahl2 Comments

Today I will like to invent a new word.

The word ”Meterencedata” is a combination of the two terms:

  • Metadata and
  • Reference Data

Metadata is data about data. Roughly spoken; in relation to databases and spreadsheets metadata describes what is in the columns.

Reference Data are high level value lists that categorize the data. Roughly spoken; in relation to databases and spreadsheets reference data explains what is in the rows.

Data Management activities – like Data Quality improvement, Master Data Management and Data Migration – will be (and have I seen are) like working in the dark if you don’t know the Metadata – and the Reference Data.

Data Models may look different. Some information may be understood through metadata in a model but through reference data in another model.

Example:

  • In one data model there are three columns in a customer table with corresponding describing metadata for:
    • Fixed line telephone number
    • Cell phone number
    • Fax number
  • In another data model there are a phone type reference table explaining the values in a separate phone table under (as a child to) the customer table having the columns:
    • Phone type
    • Phone number

In the latter case the original phone types may have been the classic fixed line, cell and fax but the entries may have been extended over time as the real world changes. This model also reflects the reality of several same type numbers attached to a single party.

Conclusion: One man’s Metadata is another man’s Reference Data as you don’t meet and mete out the data equal ways.

55.580294 12.282991

Like this:

Like Loading...
Data Architecture, Data Governance, MetadataData model, MDM, Metadata, Migration

Perfect Wrong Answer

9th January 20109th May 2010Henrik Gabs Liliendahl4 Comments

If you ask me the question ”How many people live in your town?” I could give you a correct answer being 5,000 % besides what you are looking for.

I live in Greve Municipality in Denmark. Population close to 48,000. Greve is a suburb south of Copenhagen. According to Wikipedia Copenhagen urban area has a population of 1.2 million and Copenhagen metro area has a population of 1.9 million people.

The Copenhagen metro area stretches from 40 km (20 miles) south of the city to 40 km (20 miles) north at Elsinore and Kronborg Castle (immortalized in Shakespeare’s Hamlet – always remember to include Shakespeare in a blog).

Further more: From Copenhagen you can look across the water to the east seeing Sweden and the city Malmoe. The Copenhagen-Malmoe bi-national urban agglomeration has a total population of 2.5 million people.

The real data quality issue in my initial question is not the precision, validity and timeliness in the number given in the answer but the shared understanding of the label attached to the number.

I noticed that Wikipedia has developed a good metadata habit when stating town populations giving 3 distinct labels: City, Urban and Metro.

55.580294 12.282991

Like this:

Like Loading...
Data Governance, MetadataCopenhagen, Fit for purpose, Metadata, Shakespeare, Single version of the truth, The world
Next Articles

The Disruptive MDM / PIM / DQM List

Register your innovative solution on the disruptive list of MDM/PIM/DQM/DAM solutions here

Select your MDM / PIM / DQM solution

Get a free solution ranking based on your context, scope and requirements here

Follow the blog on social media

  • LinkedIn
  • Twitter

Enter your email address to follow this blog and receive notifications of new posts by email.

Join 11,970 other subscribers

Let product data flow easily between trading partners

Categories

  • Big Reference Data (158)
  • Business Process Management (36)
  • Data Architecture (255)
  • Data Governance (156)
  • Data Matching (220)
    • Business Directories (24)
    • Identity Resolution (61)
    • Survivorship (10)
  • Data Profiling (16)
  • Data Quality Tools (176)
  • Data Quality World Tour (19)
  • Direct Marketing (19)
  • Information Quality (154)
  • Master Data (588)
    • Hierarchy Management (39)
    • Multi-Domain MDM (125)
    • Multienterprise MDM (26)
    • National ID (22)
    • Social MDM (60)
  • Metadata (24)
  • Product Data Syndication (49)
  • Product Information Management (26)
  • Search and navigation (15)
  • Service Oriented Architecture (12)
  • Social Media (78)
  • Sport (16)
  • Supposed to be a joke (66)
  • X-mas (20)

Data Governance Training

Getting Started in Data Governance online training from Nicola Askham is available here.

Recent Posts

  • Master Data Management in Financial Services
  • Which Data Management KPIs Should You Measure?
  • A Guide to Data Quality
  • Extended MDM Revisited
  • Data Governance Necessities
  • Three Augmented Data Management Flavors
  • 4 Key Aspects of Master Data Management in Manufacturing
  • The Business Value Behind Top 3 MDM Trends
  • 4 Concepts in the Gartner Hype Cycle for Digital Business Capabilities that will Shape MDM
  • A Guide to Master Data Management

eLearningCurve

Have a look at the eLearningCurve course on data parsing, data matching and de-duplication here.

Recent Comments

Asifa on Data Fabric and Master Data…
Henrik Gabs Lilienda… on Data Fabric and Master Data…
Asifa on Data Fabric and Master Data…
Direct And Indirect… on Direct Customers and Indirect…
Boeing’s Path… on Direct Customers and Indirect…
Hillary Boyle on MDM Tools Revealed
Henrik Gabs Lilienda… on Which Data Management KPIs Sho…
Gino Fortunato on Which Data Management KPIs Sho…
Henrik Gabs Lilienda… on A Guide to Data Quality
Jeppe Thing Sørensen on A Guide to Data Quality
Henrik Gabs Lilienda… on A Guide to Data Quality
Matthias on A Guide to Data Quality
Michael Fieg on A Guide to Master Data Ma…
Henrik Gabs Lilienda… on 2022 Data Management Pred…
Gino Fortunato on 2022 Data Management Pred…

Doing IT The Toyota Way

Addresses AI B2B B2C Big data Business ecosystems Business intelligence Business processes Business rules CDI CDP Cleansing Compliance Copenhagen CRM Cuisine D&B DAM Data Governance Data model DataQualityPro Digitalization Diversity DQM Duplicates ecommerce ERP Evolution Facebook Fit for purpose Fraud Fuzzy logic Gartner GDPR Geocode Graph Hans Christian Andersen Happy databases History Household Instant Data Quality IoT Laissez faire LEI LinkedIn London Mashup MDM MDMDG Metadata Migration Multi-Channel National identification number OCDQ Online Offline Open data People PIM Prevention Privacy RDM Real world alignment ROI Shakespeare Single version of the truth SOA components Standardisation Standardization Syndication Technology The cloud The world Twitter User involvement Who what where when

Blog stats

  • 552,491 visits

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Join 11,970 other subscribers
Blog at WordPress.com.
  • Follow Following
    • Liliendahl on Data Quality
    • Join 707 other followers
    • Already have a WordPress.com account? Log in now.
    • Liliendahl on Data Quality
    • Customize
    • Follow Following
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar
 

Loading Comments...
 

    Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use.
    To find out more, including how to control cookies, see here: Our Cookie Policy
    %d bloggers like this: