The DO Loop
Statistical programming in SAS with an emphasis on SAS/IML programsdata:image/s3,"s3://crabby-images/4f8af/4f8af028ddeaf4f1fddf94c745368a4be9c34437" alt="The 80-20 rule for blogs"
You've probably heard about the "80-20 Rule," which describes many natural and manmade phenomena. This rule is sometimes called the "Pareto Principle" because it was discovered by Vilfredo Pareto (1848–1923) who used it to describe the unequal distribution of wealth. Specifically, in his study, 80% of the wealth was held
data:image/s3,"s3://crabby-images/9508a/9508ac3d14510adf9d6c8447bbf0f28fc825285c" alt="The sweep operator: A fundamental operation in regression"
The sweep operator performs elementary row operations on a system of linear equations. The sweep operator enables you to build regression models by "sweeping in" or "sweeping out" particular rows of the X`X matrix. As you do so, the estimates for the regression coefficients, the error sum of squares, and
data:image/s3,"s3://crabby-images/7e153/7e15399ce30c891cc98fa71e0d4956b68c1791a4" alt="Random permutations without duplicates"
A colleague and I recently discussed how to generate random permutations without encountering duplicates. Given a set of n items, there are n! permutations My colleague wants to generate k unique permutations at random from among the total of n!. Said differently, he wants to sample without replacement from the
data:image/s3,"s3://crabby-images/7e153/7e15399ce30c891cc98fa71e0d4956b68c1791a4" alt="Find the unique rows of a numeric matrix"
Sometimes it is important to ensure that a matrix has unique rows. When the data are all numeric, there is an easy way to detect (and delete!) duplicate rows in a matrix. The main idea is to subtract one row from another. Start with the first row and subtract it
data:image/s3,"s3://crabby-images/7e153/7e15399ce30c891cc98fa71e0d4956b68c1791a4" alt="Taking in. Giving back."
When we breathe, we breathe in and breathe out. If we choose only one or the other, the results are disastrous. The same principle applies to professional growth and development. Whether we are programmers, statisticians, teachers, students, or writers, we benefit from taking in and giving back. We "take in"
data:image/s3,"s3://crabby-images/fb175/fb175d9360b802a108b0da1b700fc64fdafdb986" alt="Distance correlation"
Correlation is a statistic that measures how closely two variables are related to each other. The most popular definition of correlation is the Pearson product-moment correlation, which is a measurement of the linear relationship between two variables. Many textbooks stress the linear nature of the Pearson correlation and emphasize that