The DO Loop
Statistical programming in SAS with an emphasis on SAS/IML programsdata:image/s3,"s3://crabby-images/5a780/5a780f39600aa8e03c599bf06f0d52edb23b1d89" alt="Confidence intervals for eigenvalues of a correlation matrix Graphical comparison of two methods for estimating confidence intervals of eigenvalues of a correlation matrix"
A fundamental principle of data analysis is that a statistic is an estimate of a parameter for the population. A statistic is calculated from a random sample. This leads to uncertainty in the estimate: a different random sample would have produced a different statistic. To quantify the uncertainty, SAS procedures
data:image/s3,"s3://crabby-images/c606e/c606ebacc79c8d69401c0f88c804fffecb2658ef" alt="Generate random points in a polygon Decomposition of a convex polygon into triangles"
The triangulation theorem for polygons says that every simple polygon can be triangulated. In fact, if the polygon has V vertices, you can decompose it into V-2 non-overlapping triangles. In this article, a "polygon" always means a simple polygon. Also, a "random point" means one that is drawn at random
data:image/s3,"s3://crabby-images/9c5f7/9c5f715713eb5b8c66bd51badb42aaa426c3bf2b" alt="Generate random points in a triangle Random uniform points in a triangle"
How can you efficiently generate N random uniform points in a triangular region of the plane? There is a very cool algorithm (which I call the reflection method) that makes the process easy. I no longer remember where I saw this algorithm, but it is different from the "weighted average"
data:image/s3,"s3://crabby-images/caac3/caac366625d6120d79f140edb0170773974dada1" alt="A continuous band plot for visualizing uncertainty in regression predictions"
A previous article discusses the confidence band for the mean predicted value in a regression model. The article shows a "graded confidence band plot," which I saw in Claus O. Wilke's online book, Fundamentals of Data Visualization (Section 16.3). It communicates uncertainty in the predictions. A graded band plot is
data:image/s3,"s3://crabby-images/f8354/f83549bd1be85c23b1dc95f6d668a7f140b52e62" alt="Visualize uncertainty in regression predictions"
You've probably seen many graphs that are similar to the one at the right. This plot shows a regression line overlaid on a scatter plot of some data. Given a value for the independent variable (x), the regression line gives the best prediction for the mean of the response variable
data:image/s3,"s3://crabby-images/da628/da628ac26e6e94db25e4159e05aeddf2b7d99912" alt="The Poisson-binomial distribution for hundreds of parameters"
A previous article shows how to use a recursive formula to compute exact probabilities for the Poisson-binomial distribution. The recursive formula is an O(N2) computation, where N is the number of parameters for the Poisson-binomial (PB) distribution. If you have a distribution that has hundreds (or even thousands) of parameters,