Blogs

Blogs

Tag: Regression

Learn SAS | Programming Tips

Rick WicklinJuly 24, 2024 0

QPSOLVE: A new SAS IML function for quadratic optimization

Since the pandemic began in 2020, the SAS IML developers have added about 50 new functions and enhancements to the SAS IML language in SAS Viya. Among these functions are new modern methods for optimization that have a simplified syntax as compared to the older 'NLP' functions that are available

Read More

Analytics | Learn SAS | Programming Tips

Rick WicklinJuly 15, 2024 0

Isotonic regression: An application of quadratic optimization

Isotonic regression (also called monotonic regression) is a type of regression model that assumes that the response variable is a monotonic function of the explanatory variable(s). The model can be nondecreasing or nonincreasing. Certain physical and biological processes can be analyzed by using an isotonic regression model. For example, a

Read More

Analytics | Data Visualization | Learn SAS

Rick WicklinJune 3, 2024 0

Visualize a multivariate regression model when using spline effects

A SAS analyst read my previous article about visualizing the predicted values for a regression model that uses spline effects. Because the original explanatory variable does not appear in the model, the analyst had several questions: How do you score the model on new data? The previous example has only

Read More

Analytics | Learn SAS | Programming Tips

Rick WicklinMarch 13, 2023 0

Use the metalog distribution in SAS

A previous article describes the metalog distribution (Keelin, 2016). The metalog distribution is a flexible family of distributions that can model a wide range of shapes for data distributions. The metalog system can model bounded, semibounded, and unbounded continuous distributions. This article shows how to use the metalog distribution in

Read More

Analytics | Data Visualization | Learn SAS

Rick WicklinJanuary 18, 2023 0

Visualize how parameters in a binary logistic regression model affect the probability of the event

A previous article shows that you can use the Intercept parameter to control the ratio of events to nonevents in a simulation of data from a logistic regression model. If you decrease the intercept parameter, the probability of the event decreases; if you increase the intercept parameter, the probability of

Read More

Analytics | Programming Tips

Rick WicklinJanuary 16, 2023 0

Simulate data from a logistic regression model: How the intercept parameter affects the probability of the event

This article shows that you can use the intercept parameter to control the probability of the event in a simulation study that involves a binary logistic regression model. For simplicity, I will simulate data from a logistic regression model that involves only one explanatory variable, but the main idea applies

Read More

Analytics | Learn SAS | Programming Tips

Rick WicklinOctober 24, 2022 0

Implement binary logistic regression from first principles

I recently gave a presentation about the SAS/IML matrix language in which I emphasized that a matrix language enables you to write complex analyses by using only a few lines of code. In the presentation, I used least squares regression as an example. One participant asked how many additional lines

Read More

Analytics | Learn SAS

Rick WicklinJuly 11, 2022 0

Confidence bands for partial leverage regression plots

I previously wrote about partial leverage plots for regression diagnostics and why they are useful. You can generate a partial leverage plot in SAS by using the PLOTS=PARTIALPLOT option in PROC REG. One useful property of partial leverage plots is the ability to graphically represent the null hypothesis that a

Read More

Analytics | Learn SAS

Rick WicklinJune 8, 2022 0

The effect of weight functions in a robust regression method

M estimation is a robust regression technique that assigns a weight to each observation based on the magnitude of the residual for that observation. Large residuals are downweighted (assigned weights less than 1) whereas observations with small residuals are given weights close to 1. By iterating the reweighting and fitting

Read More

Analytics | Learn SAS

Rick WicklinJune 6, 2022 0

Weights for residuals in robust regression

An early method for robust regression was iteratively reweighted least-squares regression (Huber, 1964). This is an iterative procedure in which each observation is assigned a weight. Initially, all weights are 1. The method fits a least-squares model to the weighted data and uses the size of the residuals to determine

Read More

Programming Tips

Rick WicklinMay 4, 2022 0

Bootstrap estimates for nonlinear regression models in SAS

In The Essential Guide to Bootstrapping in SAS, I note that there are many SAS procedures that support bootstrap estimates without requiring the analyst to write a program. I have previously written about using bootstrap options in the TTEST procedure. This article discusses the NLIN procedure, which can fit nonlinear

Read More

Analytics | Programming Tips

Rick WicklinFebruary 14, 2022 0

Passing-Bablok regression in SAS

This article implements Passing-Bablok regression in SAS. Passing-Bablok regression is a one-variable regression technique that is used to compare measurements from different instruments or medical devices. The measurements of the two variables (X and Y) are both measured with errors. Consequently, you cannot use ordinary linear regression, which assumes that

Read More

Analytics | Learn SAS | Programming Tips

Rick WicklinJanuary 10, 2022 0

12 blog posts from 2021 that deserve a second look

On this blog, I write about a diverse set of topics that are relevant to statistical programming and data visualization. In a previous article, I presented some of the most popular blog posts from 2021. The most popular articles often deal with elementary or familiar topics that are useful to

Read More

Analytics | Learn SAS

Rick WicklinOctober 27, 2021 0

Interpret estimates for a Weibull regression model in SAS

It can be frustrating when the same probability distribution has two different parameterizations, but such is the life of a statistical programmer. I previously wrote an article about the gamma distribution, which has two common parameterizations: one that uses a scale parameter (β) and another that uses a rate parameter

Read More

Analytics | Learn SAS

Rick WicklinMay 17, 2021 0

Standardized regression coefficients in PROC GLIMMIX

I previously wrote about how to understand standardized regression coefficients in PROC REG in SAS. You can obtain the standardized estimates by using the STB option on the MODEL statement in PROC REG. Several readers have written to ask whether I could write a similar article about the STDCOEF option

Read More

Analytics | Data Visualization | Learn SAS

Rick WicklinMarch 8, 2021 0

The conditional distribution of a response variable

I recently learned about a new feature in PROC QUANTREG that was added in SAS/STAT 15.1 (part of SAS 9.4M6). Recall that PROC QUANTREG enables you to perform quantile regression in SAS. (If you are not familiar with quantile regression, see an earlier article that describes quantile regression and provides

Read More

Analytics | Data Visualization | Learn SAS

Rick WicklinDecember 14, 2020 0

Segmented regression models in SAS

A segmented regression model is a piecewise regression model that has two or more sub-models, each defined on a separate domain for the explanatory variables. For simplicity, assume the model has one continuous explanatory variable, X. The simplest segmented regression model assumes that the response is modeled by one parametric

Read More

Advanced Analytics | Machine Learning

Flow chart shows which algorithms to use when

Hui LiDecember 9, 2020 0

Which machine learning algorithm should I use?

This resource is designed primarily for beginner to intermediate data scientists or analysts who are interested in identifying and applying machine learning algorithms to address the problems of their interest. A typical question asked by a beginner, when facing a wide variety of machine learning algorithms, is “which algorithm should

Read More

Analytics | Data Visualization | Programming Tips

Rick WicklinDecember 2, 2020 0

How to score a logistic regression model that was not fit by PROC LOGISTIC

A SAS customer asked a great question: "I have parameter estimates for a logistic regression model that I computed by using multiple imputations. How do I use these parameter estimates to score new observations and to visualize the model? PROC LOGISTIC can do the computation I want, but how do

Read More

Analytics | Learn SAS

Rick WicklinSeptember 21, 2020 0

Regression with inequality constraints on parameters

A previous article discussed how to solve regression problems in which the parameters are constrained to be a specified constant (such as B1 = 1) or are restricted to obey a linear equation such as B4 = –2*B2. In SAS, you can use the RESTRICT statement in PROC REG to

Read More

Programming Tips

Rick WicklinJuly 29, 2020 0

Simulate regression models that incorporate CLASS parameterizations

When you write a program that simulates data from a statistical model, you should always check that the simulation code is correct. One way to do this is to generate a large simulated sample, estimate the parameters in the simulated data, and make sure that the estimates are close to

Read More

Analytics

Rick WicklinJune 8, 2020 0

Interactions with spline effects in regression models

A SAS customer asked how to specify interaction effects between a classification variable and a spline effect in a SAS regression procedure. There are at least two ways to do this. If the SAS procedure supports the EFFECT statement, you can build the interaction term in the MODEL statement. For

Read More

Analytics | Data Visualization

Rick WicklinMay 13, 2020 0

Find points where a regression curve has zero slope

This article shows how to find local maxima and maxima on a regression curve, which means finding points where the slope of the curve is zero. An example appears at the right, which shows locations where the loess smoother in a scatter plot has local minima and maxima. Except for

Read More

Analytics | Data Visualization | Machine Learning

Rick WicklinJanuary 13, 2020 0

10 posts from 2019 that deserve a second look

Did you add "learn something new" to your list of New Year's resolutions? Last week, I wrote about the most popular articles from The DO Loop in 2019. The most popular articles are about elementary topics in SAS programming or univariate statistics because those topics have broad appeal. Advanced topics

Read More

Analytics | Learn SAS

Rick WicklinJanuary 8, 2020 0

3 ways to add confidence limits to regression curves in SAS

Many SAS procedures can automatically create a graph that overlays multiple prediction curves and their prediction limits. This graph (sometimes called a "fit plot" or a "sliced fit plot") is useful when you want to visualize a model in which a continuous response variable depends on one continuous explanatory variable

Read More

Analytics | Learn SAS

Rick WicklinNovember 20, 2019 0

Predicted values in generalized linear models: The ILINK option in SAS

In a linear regression model, the predicted values are on the same scale as the response variable. You can plot the observed and predicted responses to visualize how well the model agrees with the data, However, for generalized linear models, there is a potential source of confusion. Recall that a

Read More

Analytics | Learn SAS | Programming Tips

Rick WicklinJune 26, 2019 0

Jump-start PROC LOGISTIC by using parameter estimates from PROC HPLOGISTIC

SAS/STAT software contains a number of so-called HP procedures for training and evaluating predictive models. ("HP" stands for "high performance.") A popular HP procedure is HPLOGISTIC, which enables you to fit logistic models on Big Data. A goal of the HP procedures is to fit models quickly. Inferential statistics such

Read More

Analytics | Data Visualization | Learn SAS

Rick WicklinMay 30, 2019 0

Visualize interaction effects in regression models

Knowing how to visualize a regression model is a valuable skill. A good visualization can help you to interpret a model and understand how its predictions depend on explanatory factors in the model. Visualization is especially important in understanding interactions between factors. Recently I read about work by Jacob A.

Read More

Analytics | Programming Tips

Rick WicklinMay 6, 2019 0

How to simulate data from a generalized linear model

Here's a simulation tip: When you simulate a fixed-effect generalized linear regression model, don't add a random normal error to the linear predictor. Only the response variable should be random. This tip applies to models that apply a link function to a linear predictor, including logistic regression, Poisson regression, and

Read More

Analytics | Learn SAS

Rick WicklinMay 1, 2019 0

Encodings of CLASS variables in SAS regression procedures: A cheat sheet

SAS regression procedures support several parameterizations of classification variables. When a categorical variable is used as an explanatory variable in a regression model, the procedure generates dummy variables that are used to construct a design matrix for the model. The process of forming columns in a design matrix is called

Read More

1 2 3 Next