All Posts
The Flowing Data blog posted some data about how much TV actors get paid per episode. About a dozen folks have created various visualizations of the data (see the comments in the Flowing Data blog), several of them very glitzy and fancy. One variable in the data is a categorical
I've known several people who were raised during the Great Depression, and I've observed that they are very mindful of waste. My wife's grandmother used to save plastic bags, twist ties, and relatively clean aluminum foil for potential reuse in the household -- because such materials were once scarce. The
Suppose that you want to create a matrix in SAS/IML software that has a special structure, such as a tridiagonal matrix. How do you do it? Or suppose that you want to find elements of a matrix A such that A[i,j] satisfies a certain condition. How do you get the
Anne Milley and I recently had the chance to catch up with Eric Siegel, Chair of Predictive Analytics World for some Q&A regarding the upcoming conference that takes place during Data Driven Busines Week March 14-15 in San Francisco. Question: What a great line-up we see shaping up for Predictive
How important is accurate telecom forecasting? Sitting through another presentation this week on the explosive growth of mobile data, and the eye-popping statistics about how many Facebook and Twitter users there are, I can’t help but be excited by the possibilities. But I also have this feeling of déjà vu
Brian Whittington mentioned last month that government is innovating, and at great speed. Based on what I’ve seen in our line of work with state and local governments around the country in the past year, I have to agree. In the face of today’s daunting fiscal challenges, state and local
If you tell my wife that she's married to a statistical geek, she'll nod knowingly. She is used to hearing sweet words of affection such as You are more beautiful than Euler's identity. or My love for you is like the exponential function: increasing, unbounded, and transcendental. But those are
The Winter 2011 issue of Foresight is now available. Here is Editor Len Tashman's preview: This issue of Foresight—our 20th since the journal’s genesis in 2005—begins with a new feature, Joe and Simon Sez. Joe is Joe Smith of Dean Foods and author of three valuable Foresight articles on forecast
In a previous blog post, I described the rules for a tic-tac-toe scratch-off lottery game and showed that it is a bad idea to generate the game tickets by using a scheme that uses equal probabilities. Instead, cells that yield large cash awards must be assigned a small probability of
Because of this week's story about a geostatistician, Mohan Srivastava, who figured out how predict winning tickets in a scratch-off lottery, I've been thinking about scratch-off games. He discovered how to predict winners when he began to "wonder how they make these [games]." Each ticket has a set of "lucky
We are getting ready to kick off this year's Applying Business Analytics Webinar Series. You can participate in the monthly live webinars and hear directly from organizations that are actively applying analytics, BI and data management to create business value. We promise it will be worth your time! The series
I enjoyed the Dataists' data-driven blog on the best numbers to choose in a Super Bowl betting pool. It reminded me of my recent investigation of which initials are most common. Because the Dataists' blog featured an R function that converts Arabic numerals into Roman numerals, the blog post also
The other day, someone asked me how to compute a matrix of pairwise differences for a vector of values. The person asking the question was using SQL to do the computation for 2,000 data points, and it was taking many hours to compute the pairwise differences. He asked if SAS/IML
On Friday, I posted an article about using spatial statistics to detect whether a pattern of points is truly random. That day, one of my colleagues asked me whether there are any practical applications of detecting spatial randomness or non-randomness. "Oh, sure," I replied, and rattled off a list of
When you pass a matrix as an parameter (argument) to a SAS/IML module, the SAS/IML language does not create a copy of the matrix. That approach, known as "calling by value," is inefficient. It is well-known that languages that implement call-by-value semantics suffer performance penalties. In the SAS/IML language, matrices