Uncategorized

Rick Wicklin 0
Order variables by values of a statistic

When I create a graph of data that contains a categorical variable, I rarely want to display the categories in alphabetical order. For example, the box plot to the left is a plot of 10 standardized variables where the variables are ordered by their median value. The ordering makes it

Data Management
Steve Polilli 0
Your data is in Hadoop, so what?

Okay, let's say your data is in Hadoop. The distributed, open source framework is configured as it should be across low-cost servers and your data is sitting in those clusters. It's been a meaningful effort to get to this point but how does it benefit your organization? If it's not doing something

Advanced Analytics | Analytics
Mike Gilliland 0
5 steps to setting forecasting performance objectives (Part 2)

And now for the five steps: 1. Ignore industry benchmarks, past performance, arbitrary objectives, and what management "needs" your accuracy to be. Published benchmarks of industry forecasting performance are not relevant. See this prior post The perils of forecasting benchmarks for explanation. Previous forecasting performance may be interesting to know, but

Rick Wicklin 0
Ciphers, keys, and cryptoquotes

Today is my fourth blog-iversary: the anniversary of my first blog post in 2010. To celebrate, I am going to write a series of fun posts based on The Code Book by Simon Singh, a fascinating account of the history of cryptography from ancient times until the present. While reading

SAS Colombia 0
Comprenda el valor de la gestión de datos de referencia

Para comenzar este post, vamos a precisar en qué son los datos de referencia para luego pasar a hablar sobre el valor que representan para una empresa. Los datos de referencia tienen un significado contextual y semántico, simplemente se refieren a información que sirve para referenciar elementos y clasificar otros

Jim Harris 0
The Chicken Man versus the Data Scientist

In my previous post Sisyphus didn’t need a fitness tracker, I recommended that you only collect, measure and analyze big data if it helps you make a better decision or change your actions. Unfortunately, it’s difficult to know ahead of time which data will meet that criteria. We often, therefore, collect, measure and analyze

Rick Wicklin 0
How to create a hexagonal bin plot in SAS

While I was working on my recent blog post about two-dimensional binning, a colleague asked whether I would be discussing "the new hexagonal binning method that was added to the SURVEYREG procedure in SAS/STAT 13.2." I was intrigued: I was not aware that hexagonal binning had been added to a

Jim Harris 0
Sisyphus didn’t need a fitness tracker

In his pithy style, Seth Godin’s recent blog post Analytics without action said more in 32 words than most posts say in 320 words or most white papers say in 3200 words. (For those counting along, my opening sentence alone used 32 words). Godin’s blog post, in its entirety, stated: “Don’t measure

0
Let’s chat about big data and innovation

 “The best data scientists are those that combine deep statistical / data / machine learning skills with domain knowledge.” “[Most companies] haven't properly addressed the need for cultural change!... There's still this prevailing perception that it's a technology & skills problem.” “Analytics only ever tells you one of two things—it

Leo Sadovy 0
Why analytic forecasting?

Because you are already halfway there and you should want the entire process to be data-driven, not just the historical reporting and analysis.  You are making decisions and using data to support those decisions, but you are leaving value on the table if the analytics don't carry through to forecasting.  In the

Learn SAS
Rick Wicklin 0
Choosing bins for histograms in SAS

When you create a histogram with statistical software, the software uses the data (including the sample size) to automatically choose the width and location of the histogram bins. The resulting histogram is an attempt to balance statistical considerations, such as estimating the underlying density, and "human considerations," such as choosing

1 189 190 191 192 193 281