The Hadoop experiment: To model or not to model

I recently discovered this technical white paper on SAS’ customer support site called Data Modeling Considerations in Hadoop and Hive, written by one of SAS’ R&D teams. I was intrigued by the team’s findings, so in this post, I want to share its highlights – without getting into the technical […]

Post a Comment

We need Hadoop to keep our data costs down

You’ve read the research reports and seen the statistics. You’ve attended the conferences and heard the case studies. You’ve read the online articles and kept up with expert opinions. Your organization has even done a few big data sandbox projects – some successful, some not. Yet the jury is still […]

Post a Comment

Managing big data expectations

In this era of big data hype, it's easy to understand the hesitation of many organizations to take the plunge. Finding a signal in noisy petabytes of unstructured data isn't easy. Companies like Netflix, Amazon, Facebook, Twitter and Google that "do" big data well have spent hundreds of millions of dollars (or […]

Post a Comment

Hadoop and the future of ETL

I've lived in Las Vegas for two and a half years now. When I decided to move here, I was oblivious to the downtown revitalization taking place as well as the burgeoning tech ecosystem. It turns out that Vegas is an increasingly attractive place for start-ups. Thank 24-7 gambling, low […]

Post a Comment

The traditional data warehouse and Hadoop

Data warehouse (DWH) environments have typically been the standard when it comes to supporting analytical environments. There can be many systems supporting a particular modeling or analytical group, and because these groups have varying requirements for data, the replicated data is maintained because the transition to new storage and computing […]

Post a Comment

Big data management precision, pace and partnerships: Cloudera Impala, Hadoop and Teradata

The "Internet of Things" is the latest buzzword characterizing the machine-generated big data that has outstripped our ability to derive value from it. Think of UPS delivering 16 million packages every day through various hubs and all the logistics and decisioning that goes into that. But how does an organization […]

Post a Comment

Hadoop is ... fill in the blank

My new favorite game  is quite addictive, simple and, believe it or not, free. I call it "Stumped by Google Autocomplete." Here's how it works: type any word or phrase into Google, and see what Google suggests with it's Autocomplete function. Like all things Google, this feature learns from searches […]

Post a Comment

How Hadoop can help... even if you don’t have big data

“Our corporate data is growing at a rate of 27% each year and we expect that to increase. It’s just getting too expensive to extend and maintain our data warehouse.” “Don’t talk to us about our ‘big’ data. We’re having enough trouble getting our ‘small’ data processed and analyzed in […]

Post a Comment

Ready to be disillusioned? Big data and Hadoop enters the next phase

If you work in the software industry, seeing technologies emerge and catch fire is a great spectator sport. Whether it's a programming language, a platform or something like e-commerce, each new wave ripples throughout the industry. Currently, Hadoop is having its time in the sun, and we are all trying […]

Post a Comment

Big data management survey: Hadoop is the word

Recently, TDWI released "Managing Big Data," a report that explored trends in big data management (BDM). The author, Philip Russom, is an expert in the fields of data warehousing and data management, and for this report he surveyed more than 400 practitioners about their big data efforts. One thing  immediately evident is […]

Post a Comment