Rick Wicklin, Author at The DO Loop

Author

Rick Wicklin RSS
Distinguished Researcher in Computational Statistics

Rick Wicklin, PhD, is a distinguished researcher in computational statistics at SAS and is a principal developer of SAS/IML software. His areas of expertise include computational statistics, simulation, statistical graphics, and modern methods in statistical data analysis. Rick is author of the books Statistical Programming with SAS/IML Software and Simulating Data with SAS.

Data Visualization | Learn SAS

Rick WicklinFebruary 27, 2019 4

3 ways to create nested box plots in SAS

Box plots are a great way to compare the distributions of several subpopulations of your data. For example, box plots are often used in clinical studies to visualize the response of patients in various cohorts. This article describes three techniques to visualize responses when the cohorts have a nested or

English

Analytics | Data Visualization | Learn SAS

Rick WicklinFebruary 25, 2019 0

Graphs of bootstrap statistics in PROC TTEST

When I run a bootstrap analysis, I create graphs to visualize the distribution of the bootstrap statistics. For example, in my article about how to bootstrap the difference of means in a two-sample t test, I included a histogram of the bootstrap distribution and added reference lines to indicate a

English

Analytics | Data Visualization

Rick WicklinFebruary 20, 2019 7

An easier way to create a calibration plot in SAS

Last year I published a series of blogs posts about how to create a calibration plot in SAS. A calibration plot is a way to assess the goodness of fit for a logistic model. It is a diagnostic graph that enables you to qualitatively compare a model's predicted probability of

English

Programming Tips

Rick WicklinFebruary 18, 2019 34

An easier way to perform regression with restricted cubic splines in SAS

Maybe if we think and wish and hope and pray It might come true. Oh, wouldn't it be nice? The Beach Boys Months ago, I wrote about how to use the EFFECT statement in SAS to perform regression with restricted cubic splines. This is the modern way to use splines

English

Analytics | Learn SAS | Programming Tips

Rick WicklinFebruary 13, 2019 0

3 ways to obtain the Hessian at the MLE solution for a regression model

When you use maximum likelihood estimation (MLE) to find the parameter estimates in a generalized linear regression model, the Hessian matrix at the optimal solution is very important. The Hessian matrix indicates the local shape of the log-likelihood surface near the optimal value. You can use the Hessian to estimate

Analytics | Learn SAS

Rick WicklinFebruary 11, 2019 13

4 reasons to use PROC PLM for linear regression models in SAS

Have you ever run a regression model in SAS but later realize that you forgot to specify an important option or run some statistical test? Or maybe you intended to generate a graph that visualizes the model, but you forgot? Years ago, your only option was to modify your program

English

Advanced Analytics | Machine Learning

Rick WicklinFebruary 6, 2019 0

Feature generation and correlations among features in machine learning

Feature generation (also known as feature creation) is the process of creating new features to use for training machine learning models. This article focuses on regression models. The new features (which statisticians call variables) are typically nonlinear transformations of existing variables or combinations of two or more existing variables. This

English

Advanced Analytics | Machine Learning

Rick WicklinFebruary 4, 2019 5

Model selection with PROC GLMSELECT

I previously discussed how you can use validation data to choose between a set of competing regression models. In that article, I manually evaluated seven models for a continuous response on the training data and manually chose the model that gave the best predictions for the validation data. Fortunately, SAS

English

Advanced Analytics | Machine Learning

Rick WicklinJanuary 30, 2019 3

Model assessment and selection in machine learning

Machine learning differs from classical statistics in the way it assesses and compares competing models. In classical statistics, you use all the data to fit each model. You choose between models by using a statistic (such as AIC, AICC, SBC, ...) that measures both the goodness of fit and the

English

Learn SAS | Programming Tips

Parameter estimates for synthetic (simulated) data that follows a regression model.

Rick WicklinJanuary 28, 2019 2

Simulate data for a regression model with categorical and continuous variables

This article shows how to use SAS to simulate data that fits a linear regression model that has categorical regressors (also called explanatory or CLASS variables). Simulating data is a useful skill for both researchers and statistical programmers. You can use simulation for answering research questions, but you can also

English

Blogs

Blogs

Author

Follow Us

What is...