Why would you want to visualize 5 billion rows of data?

A few years ago our new media team made this quick video to give you sense of just how big a billion is:

 

That’s big.

But now that you can visualize a billion dollars, can you imagine what a billion rows of data looks like? Or find value in those billions of variables? Not really. And that’s just one reason why data visualization is important for “big data.” Most of us can’t understand what we can’t see.

On the other hand, if you can visualize billions and billions of values by quickly distilling them into subsets, graphs and box plots, you can start to understand the data better. Essentially, visualization gives analysts a jump start on the modeling processes by making it easy to explore the relationships between variables and attributes. When you can actually see and query that much data, you can start to determine which fields mean something and which you should maybe discard.

Before high-performance analytics, you could only do these types of explorations on subsets of large data sets. Now, high-performance analytics make it possible to visualize all 5 billion (plus) rows of data.

What’s the significance? Whether it’s 1 billion or 5 billion rows of data, in-memory analytics is changing the way organizations look at and model data. This becomes even more important when viewed as part of a larger process. Discovery from the visualizations can be fed into more in-memory processes where advanced analytics, like marketing optimization, are applied to improve the business.

With all technology applications, the real benefit will come when high-performance processes are put in to context for business solutions, such as merchandise planning, assortment planning, fraud detection, and value-at-risk calculations. The visualizations are exciting and can provide the early insights, but don't forget: improving business processes is the essential next step.

tags: big data, data visualization, high-performance analytics

2 Trackbacks

  1. [...] Why would you want to visualize 5 billion rows of data? [...]

  2. By What is the cost of not now? on July 11, 2012 at 1:00 am

    [...] to hours, from minutes to seconds, without sampling or using less-than-ideal analytic techniques, even on billions of rows of data. This helps organisations to uncover potential opportunities – new lines of business, untapped [...]

Post a Comment

Your email is never published nor shared. Required fields are marked *

*
*

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <p> <pre lang="" line="" escaped=""> <q cite=""> <strike> <strong>