Data Quality vs Big Data

If you go to Google Insight and ask for how it goes with search interest for “data quality” versus how it is with “big data” you’ll get this graph:

“Data quality” (blue line) is a bear market. The interest is slowly but steadily decreasing. “Big data” (red line) is a bull market with a steep rising curve of interest starting in early 2011 and exploding in 2012.

So, what can you do if your blog is about data quality? For my part I’m writing a blog post on my data quality blog mentioning the term “big data” as many times as possible 🙂

I’m not saying “big data” is uninteresting. Not at all. I even use the term “big reference data” when describing how to exploit big directories and social network profiles in the quest for improving party master data quality.

In the short period of the “big data” hype it has often been said, that why should we start working with “big data” when we can’t manage small data yet?

While this makes some sense, it will in my eyes be a mistake not to try exploring what data quality techniques we can apply to “big data” and what data quality advantages we can harvest within “big data”.

We have known for years that the amount of data being available is drastically increasing. Now we just have a term to be used when searching for and talking about it. Like it or not; that term is “big data”.

Bookmark and Share

4 thoughts on “Data Quality vs Big Data

  1. scvconsultants 4th May 2012 / 11:55

    Maybe it’s just me but is there an element of the “emperors new clothes” about big data. It is just data isn’t it? But lots of it.

    • Henrik Liliendahl Sørensen 6th May 2012 / 19:36

      It is 🙂

  2. Marcelo V Silva 16th May 2012 / 17:42

    Henrik,

    As you are talking about Big Data I don´t know if you have seen that
    OBAMA ADMINISTRATION UNVEILS “BIG DATA” INITIATIVE: ANNOUNCES $200 MILLION IN NEW R&D INVESTMENTS:

    http://www.whitehouse.gov/sites/default/files/microsites/ostp/big_data_press_release_final_2.pdf?goback=%2Egde_4334727_member_115361512

    Good point regarding the comparison between Data Quality and Big Data… We really have to get into this “new ship that everybody is getting inside”….

    Good job,
    Marcelo

    • Henrik Liliendahl Sørensen 17th May 2012 / 00:47

      Marcelo, thanks a lot for sharing and kind words.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s