The DO Loop

Data Visualization | Learn SAS | Programming Tips

Rick WicklinNovember 14, 2022 1

Profile plots in SAS

A profile plot is a compact way to visualize many variables for a set of subjects. It enables you to investigate which subjects are similar to or different from other subjects. Visually, a profile plot can take many forms. This article shows several profile plots: a line plot of the

English

Analytics | Learn SAS | Programming Tips

Rick WicklinNovember 2, 2022 0

The area and perimeter of a convex hull

The area of a convex hull enables you to estimate the area of a compact region from a set of discrete observations. For example, a biologist might have multiple sightings of a wolf pack and want to use the convex hull to estimate the area of the wolves' territory. A

English

Learn SAS | Programming Tips

Rick WicklinSeptember 19, 2022 2

Generate random ID values for subjects in SAS

A common question on SAS discussion forums is how to use SAS to generate random ID values. The use case is to generate a set of random strings to assign to patients in a clinical study. If you assign each patient a unique ID and delete the patients' names, you

English

Analytics | Learn SAS | Programming Tips

Rick WicklinSeptember 7, 2022 0

A test for monotonic sequences and functions

Monotonic transformations occur frequently in math and statistics. Analysts use monotonic transformations to transform variable values, with Tukey's ladder of transformations and the Box-Cox transformations being familiar examples. Monotonic distributions figure prominently in probability theory because the cumulative distribution is a monotonic increasing function. For a continuous distribution that is

English

Analytics | Learn SAS

Rick WicklinAugust 22, 2022 2

The univariate Box-Cox transformation

A SAS customer asked how to use the Box-Cox transformation to normalize a single variable. Recall that a normalizing transformation is a function that attempts to convert a set of data to be as nearly normal as possible. For positive-valued data, introductory statistics courses often mention the log transformation or

English

Analytics | Learn SAS

Rick WicklinAugust 17, 2022 1

The Box-Cox transformation for a dependent variable in a regression

In the 1960s and '70s, before nonparametric regression methods became widely available, it was common to apply a nonlinear transformation to the dependent variable before fitting a linear regression model. This is still done today, with the most common transformation being a logarithmic transformation of the dependent variable, which fits

English

Analytics | Data Visualization | Learn SAS

Rick WicklinAugust 15, 2022 0

Tukey's ladder of variable transformations

John Tukey was an influential statistician who proposed many statistical concepts. In the 1960s and 70s, he was fundamental in the discovery and exposition of robust statistical methods, and he was an ardent proponent of exploratory data analysis (EDA). In his 1977 book, Exploratory Data Analysis, he discussed a small

English

Analytics | Learn SAS

Rick WicklinAugust 8, 2022 1

Means and medians as minimizers of a loss function

On Twitter, I saw a tweet from @DataSciFact that read, "The sum of (x_i - x)^2 over a set of data points x_i is minimized when x is the sample mean." I (@RickWicklin) immediately tweeted out a reply: "And the sum of |x_i - x| is minimized by the sample

English

Learn SAS | Programming Tips

Rick WicklinMay 16, 2022 0

How to unroll frequency data

In categorical data analysis, it is common to analyze tables of counts. For example, a researcher might gather data for 18 boys and 12 girls who apply for a summer enrichment program. The researcher might be interested in whether the proportion of boys that are admitted is different from the

English

Programming Tips

Rick WicklinMay 4, 2022 0

Bootstrap estimates for nonlinear regression models in SAS

In The Essential Guide to Bootstrapping in SAS, I note that there are many SAS procedures that support bootstrap estimates without requiring the analyst to write a program. I have previously written about using bootstrap options in the TTEST procedure. This article discusses the NLIN procedure, which can fit nonlinear

English

Blogs

Blogs

Tag: Data Analysis