Who owns big data?

Data ownership has always been a thorny issue, but the era of big data is sprouting bigger thorns. Last century, ownership was like the data equivalent of “you break it, you buy it.” If you own data, you are responsible for it, and can be held accountable if something goes wrong […]

Post a Comment

SAS MDM new release brings harmony to big data discord

I've been in many bands over the years- from rock to jazz to orchestra - and each brings with it a different maturity, skill level, attitude, and challenge. Rock is arguably the easiest (and the most fun!) to play, as it involves the least members, lowest skill level, a goodly amount of drama, and the […]

Post a Comment

Finding the signal in the analytics noise

.@philsimon looks under the hood of 'analytics.'

Post a Comment

Provisioning data for advanced analytics in Hadoop

The data lake is a great place to take a swim, but is the water clean? My colleague, Matthew Magne, compared big data to the Fire Swamp from The Princess Bride, and it can seem that foreboding. The questions we need to ask are: How was the data transformed and […]

Post a Comment

Using Hadoop: Emerging options for improved query performance

In my last two posts, we concluded two things. First, because of the need for broadcasting data across the internal network to enable the complete execution of a JOIN query in Hadoop, there is a potential for performance degradation for JOINs on top of files distributed using HDFS. Second, there are […]

Post a Comment

SAS Data Loader for Hadoop helps your data heroes navigate the fire swamp of big data

In The Princess Bride, one of my favorite movies, our hero Westley – in an attempt to save his love, Buttercup – has to navigate the Fire Swamp. There, Westley and Buttercup encounter fire spouts, quicksand and the dreaded rodents of unusual size (RUS's). Each time he has a response to the […]

Post a Comment

Using Hadoop: Query optimization

In my last post, I pointed out that an uninformed approach to running queries on top of data stored in Hadoop HDFS may lead to unexpected performance degradation for reporting and analysis. The key issue had to do with JOINs in which all the records in one data set needed […]

Post a Comment

Showing the ugly face of bad data: Part 1

Financial institutions are mired with large pools of historic data across multiple line of businesses and systems. However, much of the recent data is being produced externally and is isolated from the decision making and operational banking processes. The limitations of existing banking systems combined with inward-looking and confined data practices […]

Post a Comment

What skills will be required to make sense of big data?

Small data is akin to algebra; big data is like calculus.

Post a Comment

Big wishes for data management

In the movie Big, a 12-year-old boy, after being embarrassed in front of an older girl he was trying to impress by being told he was too short for a carnival ride, puts a coin into an antique arcade fortune teller machine called Zoltar Speaks, makes a wish to be big, […]

Post a Comment