Blogs

Blogs

Search Results: sgplot (969)

Analytics

Rick WicklinMarch 17, 2021 0

The Farey sequence

Here is an interesting math question: How many reduced fractions in the interval (0, 1) have a denominator less than 100? The question is difficult is because of the word "reduced." If we only care about the total number of fractions in (0,1) whose denominator is less than 100, we

Read More

Data Visualization | Programming Tips

Robert AllisonMarch 15, 2021 0

SAS graphs for R programmers - bar charts

This is another in my series of blog posts where I take a deep dive into converting customized R graphs into SAS graphs. Today we'll be working on bar charts ... And to give you a hint about what data I'll be using this time, here's a picture from a SAS

Read More

Analytics | Programming Tips

Rick WicklinMarch 15, 2021 0

The generalized gamma distribution

A SAS customer wanted to compute the cumulative distribution function (CDF) of the generalized gamma distribution. For any continuous distribution, the CDF is the integral of the probability density function (PDF), which usually has an explicit formula. Accordingly, he wanted to compute the CDF by using the QUAD function in

Read More

Programming Tips

Rick WicklinMarch 10, 2021 0

Pi and products

This is my Pi Day post for 2021. Every year on March 14th (written 3/14 in the US), geeky mathematicians and their friends celebrate "all things pi-related" because 3.14 is the three-decimal approximation to pi. Most years I write about lower-case pi (π), which is the ratio of a circle's

Read More

Analytics | Data Visualization | Learn SAS

Rick WicklinMarch 8, 2021 0

The conditional distribution of a response variable

I recently learned about a new feature in PROC QUANTREG that was added in SAS/STAT 15.1 (part of SAS 9.4M6). Recall that PROC QUANTREG enables you to perform quantile regression in SAS. (If you are not familiar with quantile regression, see an earlier article that describes quantile regression and provides

Read More

Data Visualization | Learn SAS | Programming Tips

Rick WicklinMarch 1, 2021 0

Create a wind chill chart in SAS

I recently wrote about a simple statistical formula that approximates the wind chill temperature, which is the cumulative effect of air temperature and wind on the human body. The formula uses two independent variables (air temperature and wind speed) to predict the wind chill temperature. This article describes how to

Read More

Data Visualization | Learn SAS | Programming Tips

Rick WicklinFebruary 22, 2021 0

How to use the #BYVAR and #BYVAL keywords to customize graph titles in SAS

A previous article describes how to use the SGPANEL procedure to visualize subgroups of data. It focuses on using headers to display information about each graph. In the example, the data are time series for the price of several stocks, and the headers include information about whether the stock price

Read More

Data Visualization | Learn SAS

Rick WicklinFebruary 17, 2021 0

Data-driven titles for graphs

Many characteristics of a graph are determined by the underlying data at run time. A familiar example is when you use colors to indicate different groups in the data. If the data have three groups, you see three colors. If the data have four groups, you see four colors. The

Read More

Data Visualization | Programming Tips

Robert AllisonFebruary 15, 2021 0

SAS graphs for R programmers - diverging bars

This is another in my series of blogs where I take a deep dive into converting a customized R graph into a SAS graph. Today I'm focusing on a diverging bar chart (where one bar segment is above the zero line, and the other is below). What type of data

Read More

Data Visualization | Programming Tips

Robert AllisonFebruary 5, 2021 0

SAS graphs for R programmers - needle plots

This is another in my series of blogs where I take a deep dive into converting a customized R graph into a SAS ODS Graphics graph. This time the example is a needle plot (that's essentially like a bar plot, with lots of tiny bars, plotted along a continuous xaxis).

Read More

Analytics | Programming Tips

Rick WicklinFebruary 3, 2021 0

Generate random points on a sphere

In a previous article, I showed how to generate random points uniformly inside a d-dimensional sphere. In that article, I stated the following fact: If Y is drawn from the uncorrelated multivariate normal distribution, then S = Y / ||Y|| has the uniform distribution on the unit sphere. I was

Read More

Data Visualization | Programming Tips

Robert AllisonJanuary 28, 2021 0

SAS graphs for R programmers - overlay lines

In the past, Sanjay showed how to create several basic graphs using both R and SAS ODS Graphics code. I'm going to take a bit of a "deeper dive" and focus a series of blog posts on highly customized graphs. Hopefully the code for these customizations will provide you with

Read More

Analytics | Programming Tips

Rick WicklinJanuary 27, 2021 0

The inverse gamma distribution in SAS

The inverse gamma distribution is a continuous probability distribution that is used in Bayesian analysis and in some statistical models. The inverse gamma distribution is closely related to the gamma distribution. For any probability distribution, it is essential to know how to compute four functions: the PDF function, which returns

Read More

Programming Tips

Rick WicklinJanuary 25, 2021 0

How to compute the incomplete gamma function in SAS

Years ago, I wrote about how to compute the incomplete beta function in SAS. Recently, a SAS programmer asked about a similar function, called the incomplete gamma function. The incomplete gamma function is a "special function" that arises in applied math, physics, and statistics. You should not confuse the gamma

Read More

Data Visualization | Programming Tips

Robert AllisonJanuary 20, 2021 0

Mobile phone market share - stacked bar charts

I recently had a discussion with a friend, and we were wondering about Apple's market share. This led me to look into the actual data ... finding the online charts lacking, and then designing my own charts. Follow along if you're curious about the process of improving the charts, or

Read More

Data Visualization | Programming Tips

netflix trip through The Office

Chris Hemedinger

Chris HemedingerJanuary 18, 2021 0

Visualizing our Netflix Trip through "The Office"

Over 57 billion minutes of The Office was streamed in 2020. My family bears some responsibility. Here's our activity visualized -- using SAS.

Read More

Data Visualization | Learn SAS

Rick WicklinJanuary 18, 2021 0

The DOLIST syntax: Specify a list of numerical values in SAS

Have you ever heard of the DOLIST syntax? You might know the syntax even if you are not familiar with the name. The DOLIST syntax is a way to specify a list of numerical values to an option in a SAS procedure. Applications include: Specify the end points for bins

Read More

Data Visualization | Programming Tips

Robert AllisonJanuary 8, 2021 0

List of 'big' movies you might not have seen yet

If you've been stuck at home a lot lately, and think you have run out of movies to watch -- think again! Here is a list of big-budget movies you might not have seen, because they flopped (lost lots of money). Follow along as I show you how I created

Read More

Sports & Entertainment

Analytics | Learn SAS | Programming Tips

Rick WicklinJanuary 6, 2021 0

The simple block bootstrap for time series in SAS

For ordinary least squares (OLS) regression, you can use a basic bootstrap of the residuals (called residual resampling) to perform a bootstrap analysis of the parameter estimates. This is possible because an assumption of OLS regression is that the residuals are independent. Therefore, you can reshuffle the residuals to get

Read More

Analytics | Data Visualization | Programming Tips

Rick WicklinJanuary 4, 2021 0

Top posts from The DO Loop in 2020

Last year, I wrote more than 100 posts for The DO Loop blog. In previous years, the most popular articles were about SAS programming tips, statistical analysis, and data visualization. But not in 2020. In 2020, when the world was ravaged by the coronavirus pandemic, the most-read articles were related

Read More

Analytics | Data Visualization | Learn SAS

Rick WicklinDecember 14, 2020 0

Segmented regression models in SAS

A segmented regression model is a piecewise regression model that has two or more sub-models, each defined on a separate domain for the explanatory variables. For simplicity, assume the model has one continuous explanatory variable, X. The simplest segmented regression model assumes that the response is modeled by one parametric

Read More

Analytics | Learn SAS | Programming Tips

Rick WicklinDecember 9, 2020 0

Horn's method: A simulation-based method for retaining principal components

One purpose of principal component analysis (PCA) is to reduce the number of important variables in a data analysis. Thus, PCA is known as a dimension-reduction algorithm. I have written about four simple rules for deciding how many principal components (PCs) to keep. There are other methods for deciding how

Read More

Analytics | Data Visualization

Rick WicklinDecember 7, 2020 0

Can you transplant an indoor Christmas tree?

"O Christmas tree, O Christmas tree, how lovely are your branches!" The idealized image of a Christmas tree is a perfectly straight conical tree with lush branches and no bare spots. Although this ideal exists only on Christmas cards, forest researchers are always trying to develop trees that approach the

Read More

Analytics | Data Visualization | Programming Tips

Rick WicklinDecember 2, 2020 0

How to score a logistic regression model that was not fit by PROC LOGISTIC

A SAS customer asked a great question: "I have parameter estimates for a logistic regression model that I computed by using multiple imputations. How do I use these parameter estimates to score new observations and to visualize the model? PROC LOGISTIC can do the computation I want, but how do

Read More

Analytics | Data Visualization

Rick WicklinNovember 23, 2020 0

Decile plots in SAS

I previously showed how to create a decile calibration plot for a logistic regression model in SAS. A decile calibration plot (or "decile plot," for short) is used in some fields to visualize agreement between the data and a regression model. It can be used to diagnose an incorrectly specified

Read More

Analytics | Data Visualization

Predicted probabilities for a logistic regression model

Rick WicklinNovember 18, 2020 0

Create scoring data when regressors are correlated

To help visualize regression models, SAS provides the EFFECTPLOT statement in several regression procedures and in PROC PLM, which is a general-purpose procedure for post-fitting analysis of linear models. When scoring and visualizing a model, it is important to use reasonable combinations of the explanatory variables for the visualization. When

Read More

Data Visualization | Learn SAS

Rick WicklinNovember 16, 2020 0

Three tips for plotting discontinuous functions in SAS

I have previously written about how to plot a discontinuous function in SAS. That article shows how to use the GROUP= option on the SERIES statement to graph a discontinuous function. An alternative approach is to place a missing value for the Y variable at the locations at which the

Read More

Analytics

Monte Carlo distribution of skewness statistic (B=10000, N=100)

Rick WicklinOctober 28, 2020 0

The sample skewness is a biased statistic

The skewness of a distribution indicates whether a distribution is symmetric or not. The Wikipedia article about skewness discusses two common definitions for the sample skewness, including the definition used by SAS. In the middle of the article, you will discover the following sentence: In general, the [estimators] are both

Read More

Data Visualization | Programming Tips

Robert AllisonOctober 26, 2020 0

Do low mortgage rates bring you joy(plots)?

When it comes to plotting mortgage rate data, I often look to Len Kiefer for inspiration. He recently posted a retro-looking graph on twitter that caught my eye ... and of course I had to see if I could create something similar using SAS. For lack of a better term,

Read More

Analytics | Programming Tips

Graphical comparison of two methods for estimating confidence intervals of eigenvalues of a correlation matrix

Rick WicklinOctober 26, 2020 0

Confidence intervals for eigenvalues of a correlation matrix

A fundamental principle of data analysis is that a statistic is an estimate of a parameter for the population. A statistic is calculated from a random sample. This leads to uncertainty in the estimate: a different random sample would have produced a different statistic. To quantify the uncertainty, SAS procedures

Read More

Previous 1 … 7 8 9 10 11 … 33 Next