Rick Wicklin, Author at The DO Loop

Analytics | Data Visualization | Programming Tips

Rick WicklinOctober 20, 2025 3

Overlay multiple custom density curves on a histogram in SAS

A previous article discusses various ways to overlay a density curve on a histogram in SAS. SAS provides several procedures that handle this task for common univariate probability distributions such as normal, lognormal, and gamma. If you define and use a less common distribution, you can write a GTL template

English

Data Visualization | Learn SAS | Programming Tips

Rick WicklinOctober 13, 2025 1

Use a high-low plot to emulate a histogram in SAS

SAS has several procedures that can fit a probability distribution to data, plot a histogram, and overlay one or more density estimates: PROC UNIVARIATE in Base SAS enables you to overlay parametric density curves from about 20 common continuous probability distributions, such as normal, lognormal, and gamma. It also enables

English

Analytics | Programming Tips

Rick WicklinOctober 6, 2025 0

The inverse power method for eigenvalues

The power method is a well-known iterative scheme to approximate the largest eigenvalue (in absolute value) of a symmetric matrix. It is useful in practice when you need only the largest eigenvalue and eigenvector of a large matrix. The method requires only matrix-vector multiplication and vector scaling. There is a

English

Analytics | Programming Tips

Rick WicklinSeptember 29, 2025 1

Visualize Rayleigh quotients and eigenvectors

When I encounter a new function, I usually graph it to gain intuition about how the function transforms its inputs. Recently, I needed to use the Rayleigh quotient function, which is connected to the estimation of eigenvalues and eigenvectors for symmetric matrices. It has been several years since I last

English

Analytics | Programming Tips

Rick WicklinSeptember 22, 2025 2

Birthdays and the coupon collector's problem

The new school year had barely started when I got a call from a friend who is an elementary school principal. She told me that every morning she announces the names of students who are celebrating a birthday. "One student noticed that we've already had two days on which no

English

Analytics | Programming Tips

Rick WicklinSeptember 15, 2025 0

Stirling numbers in SAS

In probability and statistics, special numbers are used to compute probabilities by counting the number of ways certain events can occur. The most famous are combinations and permutations. Both are used to count the ways to arrange or select items from a set. If a set contains n elements: A

English

Data Visualization | Learn SAS

Rick WicklinSeptember 8, 2025 2

The CYCLEATTRS option in PROC SGPLOT

I've often wondered about the logic that the SGPLOT procedure in SAS uses to determine whether a set of graphical overlays will receive identical attributes or different attributes. (Recall that color, size, line style, and marker symbol are all examples of attributes.) I know that when you plot grouped data

English

Learn SAS | Programming Tips

Rick WicklinSeptember 2, 2025 1

A SAS macro technique for running a one-time task

In data analysis, sometimes we need to perform a preliminary task before we can analyze data. Often the task needs to be performed only once per session. For example, you might need to download or merge data prior to your analysis. Or you might need to define or load a

English

Analytics | Learn SAS | Programming Tips

Rick WicklinAugust 25, 2025 5

Implement the generalized extreme value distribution in SAS

SAS supports more than 25 common probability distributions for the PDF, CDF, QUANTILE, and RAND functions. If you need a less-common distribution, you can implement new distributions by using Base SAS (specifically, PROC FCMP) or the SAS/IML language. On the SAS Support Communities, a SAS programmer asked how to implement

English

Analytics | Learn SAS | Programming Tips

Rick WicklinAugust 18, 2025 0

Confidence intervals for Cohen's d statistic in SAS

A previous article discusses Cohen's d statistic and how to compute it in SAS. For a two-sample independent design, Cohen's d estimates the standardized mean difference (SMD). Because Cohen's d is a biased statistic, the previous article also computes Hedges' g, which is an unbiased estimate of the SMD. Lastly,

English

Blogs

Blogs

Author