Uncategorized

Paul Kent 1
Share your cluster – How Apache Hadoop YARN helps SAS

Even though it sounds like something you hear on a Montessori school playground, this theme “Share your cluster” echoes across many modern Apache Hadoop deployments. Data architects are plotting to assemble all their big data in one system – something that is now achievable thanks to the economics of modern

Jim Harris 0
Data science versus narrative psychology

My previous post explained how confirmation bias can prevent you from behaving like the natural data scientist you like to imagine you are by driving your decision making toward data that confirms your existing beliefs. This post tells the story of another cognitive bias that works against data science. Consider the following scenario: Company-wide

David Loshin 0
Soliciting information about enterprise reference data

The first step is establishing governance for reference data is assessing the existing reference data landscape: understanding what reference data sets are used, who is using them, and how they are being employed to support business processes. That suggests a three-pronged approach to identifying organizational business process and application dependencies

Phil Simon 1
A foxier way to search

What are all of the companies in San Francisco trying to make the Internet of Things happen? Google it if you like, but you're only like to get a simple list of companies, no doubt in an SEO-friendly order. What if you could see those companies in a more comprehensive

Jim Harris 5
Can data change an already made up mind?

Nowadays we hear a lot about how important it is that we are data-driven in our decision-making. We also hear a lot of criticism aimed at those that are driven more by intuition than data. Like most things in life, however, there’s a big difference between theory and practice. It’s

David Loshin 0
The value of reference data governance

In my last post, I shared some thoughts about challenges associated with the lack of management for reference data, such as reinterpretation of semantics and the inconsistencies that crop up when multiple copies are used. All of the challenges I mentioned are indications of a need for improving the enterprisewide

Phil Simon 1
On pronouns, online dating and data laziness

Working from home confers significant benefits. Two of my favorites are a two-second commute and the ability to take afternoon naps without offending judgmental coworkers. Among the drawbacks, though: I'm not going to randomly meet someone at the office. Like many single professionals, I have dabbled in the world of

1 53 54 55 56 57 105