Tag: big data

Jim Harris 0
The Chicken Man versus the Data Scientist

In my previous post Sisyphus didn’t need a fitness tracker, I recommended that you only collect, measure and analyze big data if it helps you make a better decision or change your actions. Unfortunately, it’s difficult to know ahead of time which data will meet that criteria. We often, therefore, collect, measure and analyze

Jim Harris 0
Sisyphus didn’t need a fitness tracker

In his pithy style, Seth Godin’s recent blog post Analytics without action said more in 32 words than most posts say in 320 words or most white papers say in 3200 words. (For those counting along, my opening sentence alone used 32 words). Godin’s blog post, in its entirety, stated: “Don’t measure

Arun C. Murthy 0
SAS high-performance capabilities with Hadoop YARN

For Hadoop to be successful as part of the modern data architecture, it needs to integrate with existing tools. This integration allows you to reuse existing resources (licenses and personnel) and is typically 60% of the evaluation criteria for integration of Hadoop into the data center. One of the most

Paul Kent 0
Share your cluster – How Apache Hadoop YARN helps SAS

Even though it sounds like something you hear on a Montessori school playground, this theme “Share your cluster” echoes across many modern Apache Hadoop deployments. Data architects are plotting to assemble all their big data in one system – something that is now achievable thanks to the economics of modern

Jim Harris 0
Data science versus narrative psychology

My previous post explained how confirmation bias can prevent you from behaving like the natural data scientist you like to imagine you are by driving your decision making toward data that confirms your existing beliefs. This post tells the story of another cognitive bias that works against data science. Consider the following scenario: Company-wide

Jim Harris 0
Can data change an already made up mind?

Nowadays we hear a lot about how important it is that we are data-driven in our decision-making. We also hear a lot of criticism aimed at those that are driven more by intuition than data. Like most things in life, however, there’s a big difference between theory and practice. It’s

Jim Harris 0
Bring the noise, boost the signal

Many people, myself included, occasionally complain about how noisy big data has made our world. While it is true that big data does broadcast more signal, not just more noise, we are not always able to tell the difference. Sometimes what sounds like meaningless background static is actually a big insight. Other times

Jim Harris 0
The ethics of algorithmic regulation

In my last three posts on data ethics, I explored a few of the ethical dilemmas in our data-driven world. From examining the ethical practices of free internet service providers to the problem of high-frequency trading, I’ve come to realize the depth and complexity of these issues. Anyone who's aware of these

Jim Harris 0
The low ethics of high-frequency trading

Imagine if your ability to feed your family depended upon how fast you could run. Imagine the aisles of your grocery store as lanes on a running track. If you can outrun your fellow shoppers, grab food off the shelves and race through the checkout at the finish line, then

Analytics
Nele Coghe 0
Hot hot heat map

Although I’m not particularly excited about football (I admit, I don’t completely understand what offside means), I did follow the last World Cup with more than average attention. Not only for the handsome players, but especially for all the fascinating statistics that appeared. It struck me that heat maps popped

SAS Colombia 0
Manténgase activo y con empleo en la era del Big Data

Los datos importan más que nunca. Las organizaciones en progreso como Netflix y La Universidad de Texas, entre otras, están usando herramientas de visualización de datos para encontrarle forma a las grandes cantidades de información que se generan diariamente. En el libro The visual organization, Phil Simon, reconocido experto en

Jim Harris 0
Mapping ethics in a data-driven world

In my previous post, I examined ethics in a data-driven world with an example of how Facebook experiments on its users. Acknowledging the conundrum facing users of free services like Facebook, Phil Simon commented that “users and customers aren’t the same thing. Maybe users are there to be, you know... used.” What about when a

Jim Harris 0
Facing ethics in a data-driven world

I have previously blogged about how the dark side of our mood skews the sentiment analysis of customer feedback negatively since we usually only provide feedback when we have a negative experience with a product or service. Reading only negative reviews from its customers could make a company sad, but could reading only

Jim Harris 0
Data science and decision science

Data science, as Deepinder Dhingra recently blogged, “is essentially an intersection of math and technology skills.” Individuals with these skills have been labeled data scientists and organizations are competing to hire them. “But what organizations need,” Dhingra explained, “are individuals who, in addition to math and technology, can bring in

Jim Harris 0
The data that supported the decision

Data-driven journalism has driven some of my recent posts. I blogged about turning anecdote into data and how being data-driven means being question-driven. The latter noted the similarity between interviewing people and interviewing data. In this post I want to examine interviewing people about data, especially the data used by people to drive

Jim Harris 0
Being data-driven means being question-driven

At the Journalism Interactive 2014 conference, Derek Willis spoke about interviewing data, his advice for becoming a data-driven journalist. “The bulk of the skills involved in interviewing people and interviewing data are actually pretty similar,” Willis explained. “We want to get to know it a little bit. We want to figure

Data Management
Alyssa Farrell 0
A must-read for petroleum professionals

Oil companies are being forced to explore in geologically complex and remote areas to exploit more unconventional hydrocarbon deposits.  New engineering technology has pushed the envelope of previous upstream experience.  No guidebook existed on how computing methodologies can contribute to E&P performance at reduced risk.  Until now. A new book

SAS Colombia 0
Cinco características esenciales de la computación en la nube

Puede llamarlo transformación, cambio de paradigma, evolución o revolución. No importa  qué nombre reciban: en la actualidad, los servicios en la nube están cambiando la realidad de los negocios. La posibilidad y necesidad de acceder a la información desde cualquier sitio, en cualquier momento y con prácticamente cualquier dispositivo, ha

Analytics
Maggie Miller 0
Analytics 2014: Top trends in big data

The Analytics 2014 conference in Frankfurt, Germany gets started tomorrow, but today many of the speakers and attendees are arriving at the Frankfurt Marriott. I caught up with Beth Schultz, editor-in-chief of AllAnalytics.com to hear about her keynote presentation and to find out the benefits of being part of the

Data Management
David Pope 0
Your "SAS view" on steroids

Data federation is a relatively new term used to describe a form of data virtualization. Data virtualization, however, is not new. It has been around since at least the 1960's when virtual memory was introduced to simulate additional memory beyond what was physically available on a machine.  While data federation is a

Analytics
Maggie Miller 0
A big world of big data

In just three short weeks I’ll be in Frankfurt, Germany for the Analytics 2014 conference. There will be hundreds of people from 32 different countries in attendance. It will be exciting to hear how people all over the world, in many different industries, are using analytics. As host of the

Jim Harris 0
A double take on sampling

My previous post made the point that it’s not a matter of whether it is good for you to use samples, but how good the sample you are using is. The comments on that post raised two different, and valid, perspectives about sampling. These viewpoints reflected two different use cases for data,

Jim Harris 0
Survey says sampling still sensible

In my previous post, I discussed sampling error (i.e., when a randomly chosen sample doesn’t reflect the underlying population, aka margin of error) and sampling bias (i.e., when the sample isn’t randomly chosen at all), both of which big data advocates often claim can, and should, be overcome by using all the data. In this

1 19 20 21 22 23 25