Basketball tournaments, Moneyball, and sports analytics

A big part of  ”winning” these days (be it sports or a business) is performing analytics better than your competition.  This is demonstrated in awe-inspiring fashion in the book (and movie) “Moneyball.”  And on that topic, I’d like to show you a few ways SAS can be used to analyze sports data [...]

Post a Comment

The Human Side of Statistical Process Control: Three Applications of SAS/QC You Might Not Have Thought About

When you think of statistical process control, or SPC for short, what industry first comes to your mind? In the past 10 or 15 years, diverse industries have begun to standardize processes and administrative tasks with statistical process control. While the top two bars of the industrial Pareto chart are [...]

Post a Comment

Who Ate My Lunch? Discriminant Thresholds to Reduce False Accusations

Lunch. For some workers, it’s the sweetest part of an otherwise bitter day at the grindstone. Nothing can turn that sweetness sour like going into the breakroom to discover that someone has taken your lunch and eaten it themselves. Nothing like that ever happens here at SAS. But if it [...]

Post a Comment

The Punchline: MANOVA or a Mixed Model?

Edited to add: Thanks for Larry Madger for noticing an important omission in my code below. I have updated the programs to include the response variables, which enables the responses to have different means. So, if you were reading last week, we talked about how to structure your data for [...]

Post a Comment

Data Structure for Repeated Measures Analysis... A Teaser

Next week’s blog entry will build on this one, so I want you to take notes, OK? It’s not headline news that in most cases, the best way to handle a repeated measures analysis is with a mixed models approach, especially for Normal reponses (for other distributions in the exponential [...]

Post a Comment

Discriminant Analysis, Priors, and Fairy-Selection

A student in my multivariate class last month asked a question about prior probability specifications in discriminant function analysis: What if I don’t know what the probabilities are in my population? Is it best to just use the default in PROC DISCRIM? First, a quick refresher of priors in discriminant [...]

Post a Comment

We Wanted to Call It "All the Statistics You Missed in B-School"

Happy New Year!! This is a good time to think about what was going on here in SAS Education one year ago, and to introduce you to a big project that I’m really excited to “take public.” In January 2010 (as well as throughout 2009), we kept getting cries for [...]

Post a Comment

Weekday Morning Quick-trick: How to Score from PROC VARCLUS

Have you used multivariate procedures in SAS and wanted to save out scores? Some procedures, such as FACTOR, CANDISC, CANCORR, PRINCOMP, and others have an OUT= option to save scores to the input data set. However, to score a new data set, or to perform scoring with multivariate procedures that [...]

Post a Comment

Game-changing Analytics

The M2010 Data Mining Conference team has arrived in Las Vegas and we’re getting ready to host the 750+ analysts, statisticians and database managers who will be ascending upon Sin City this weekend for our 13th annual conference. tags: data analysis, data mining

Post a Comment

Mixed Feelings about Logistic Regression: Eight Hints for Getting Started with PROC GLIMMIX

Delicious Mixed Model Goodness Imagine the scene: You’re in your favorite coffee shop, laptop and chai. The last of the data from a four-year study are validated and ready for analysis. You’ve explored the plots, preliminary results are promising, and now it is time to fit the model. It’s not [...]

Post a Comment