The DO Loop

Tag: Data Analysis

Rick WicklinOctober 11, 2013 0

In my last blog post I described how to implement a "runs test" in the SAS/IML language. The runs test determines whether a sequence of two values (for example, heads and tails) is likely to have been generated by random chance. This article describes two applications of the runs test.

English

Rick WicklinOctober 9, 2013 17

How to tell whether a sequence of heads and tails is random

While walking in the woods, a statistician named Goldilocks wanders into a cottage and discovers three bears. The bears, being hungry, threaten to eat the young lady, but Goldilocks begs them to give her a chance to win her freedom. The bears agree. While Mama Bear and Papa Bear block

English

Rick WicklinSeptember 16, 2013 25

What versions of R are supported by SAS?

SAS 9 has supported calling R from the SAS/IML language since 2009. The interface to R is part of the SAS/IML language. However, there have been so many versions of SAS and R since 2009, that it is hard to remember which SAS release supports which versions of R. The

English

Radar chart of word categories used in debates

Rick WicklinAugust 21, 2013 8

Comparing two groups? Two tips that make a difference

A common visualization is to compare characteristics of two groups. This article emphasizes two tips that will help make the comparison clear. First, consider graphing the differences between the groups. Second, in any plot that has a categorical axis, sort the categories by a meaningful quantity. This article is motivated

English

Rick WicklinAugust 14, 2013 6

Dryer balls and drying time: A statistical analysis

Earlier this week I posted a "guest blog" in which my 8th grade son described a visualization of data for the 2013 ASA Poster Competition. The purpose of today's blog post is to present a higher-level statistical analysis of the same data. I will use a t test and a

English

Rick WicklinAugust 12, 2013 15

Do dryer balls reduce drying time?

Editor's Note: My 8th grade son, David, created a poster that he submitted to the 2013 ASA Poster Competition. The competition encourages students to display "two or more related graphics that summarize a set of data, look at the data from different points of view, and answer specific questions about

English

Rick WicklinJuly 31, 2013 1

Read hundreds of data sets into matrices

Do you have dozens (or even hundreds) of SAS data sets that you want to read into SAS/IML matrices? In a previous blog post, I showed how to iterate over a series of data sets and analyze each one. Inside the loop, I read each data set into a matrix

English

Rick WicklinJuly 17, 2013 2

A simple implementation of two-dimensional binning

In a previous article I discussed how to bin univariate observations by using the BIN function, which was added to the SAS/IML language in SAS/IML 9.3. You can generalize that example and bin bivariate or multivariate data. Over two years ago I wrote a blog post on 2D binning in

English

Learn SAS

Rick WicklinJuly 15, 2013 8

Bin observations by using custom cut points and unevenly spaced bins

It is often useful to partition observations for a continuous variable into a small number of intervals, called bins. This familiar process occurs every time that you create a histogram, such as the one on the left. In SAS you can create this histogram by calling the UNIVARIATE procedure. Optionally,

English

Rick WicklinJune 26, 2013 7

How to color clusters in a dendrogram

The CLUSTER procedure in SAS/STAT software creates a dendrogram automatically. The black-and-white dendrogram is nice, but plain. A SAS customer wanted to know whether it is possible to add color to the dendrogram to emphasize certain clusters. For example, the plot at the left emphasizes a four-cluster scenario for clustering

English

Blogs

Blogs

Tag: Data Analysis

Follow Us

What is...