Blogs

Blogs

Tag: Simulation

Analytics | Data Visualization | Programming Tips

Rick WicklinJune 10, 2024 0

The distribution of the R-square statistic

A SAS analyst ran a linear regression model and obtained an R-square statistic for the fit. However, he wanted a confidence interval, so he posted a question to a discussion forum asking how to obtain a confidence interval for the R-square parameter. Someone suggested a formula from a textbook (Cohen,

Read More

Analytics | Learn SAS

Rick WicklinMay 20, 2024 0

On the correctness of a discrete simulation

After writing a program that simulates data, it is important to check that the statistical properties of the simulated (synthetic) data match the properties of the model. As a first step, you can generate a large random sample from the model distribution and compare the sample statistics to the expected

Read More

Analytics | Programming Tips

Rick WicklinMay 13, 2024 0

The distribution of p-values under the null hypothesis

A SAS statistical programmer recently asked a theoretical question about statistics. "I've read that 'p-values are uniformly distributed under the null hypothesis,'" he began, "but what does that mean in practice? Is it important?" I think data simulation is a great way to discuss the conditions for which p-values are

Read More

Learn SAS | Programming Tips

Rick WicklinFebruary 19, 2024 0

The linear distribution on an interval

In a recent Monte Carlo project, I needed to simulate numbers on an interval by using a continuous linear probability density function (PDF). An example is shown to the right. In this example, the linear density function is decreasing on the interval, but the function could also be constant or

Read More

Analytics | Learn SAS

Rick WicklinJanuary 22, 2024 0

Angles vs slopes: The statistics of steepness

There are two popular ways to express the steepness of a line or ray. The most-often used mathematical definition is from high-school math where the slope is defined as "rise over run." A second way is to report the angle of inclination to the horizontal, as introduced in basic trigonometry.

Read More

Learn SAS | Programming Tips

Rick WicklinJanuary 15, 2024 0

Simulate correlated continuous and discrete variables

Statistical software provides methods to simulate independent random variates from continuous and discrete distributions. For example, in the SAS DATA step, you can use the RAND function to simulate variates from continuous distributions (such as the normal or lognormal distributions) or from discrete distributions (such as the Bernoulli or Poisson).

Read More

Advanced Analytics

Harry SnartOctober 13, 2023 0

Simulating a rugby tournament with SAS® Viya®

The world’s largest rugby tournament returns for the knockout stages. This blog post explores how probability and simulation can be used to predict likely winners in each of the knockout stages. Team sports are dynamic, time-varying and complex topics to model. When modeling regular competitions, such as domestic leagues, it

Read More

Sports & Entertainment

Analytics | Learn SAS | Programming Tips

Rick WicklinSeptember 6, 2023 0

Model data from published summary statistics

There are many ways to model a set of raw data by using a continuous probability distribution. It can be challenging, however, to choose the distribution that best models the data. Are the data normal? Lognormal? Is there a theoretical reason to prefer one distribution over another? The SAS has

Read More

Learn SAS | Programming Tips

Rick WicklinAugust 30, 2023 0

Simulate the use of personal checks in the US

Does anyone write paper checks anymore? According to researchers at the Federal Reserve Bank of Atlanta (Greene, et al., 2020), the use of paper checks has declined 63% among US consumers since the year 2000. The researchers surveyed more than 3,000 consumers in 2017-2018 and discovered that only 7% of

Read More

Analytics | Data Visualization | Programming Tips

Rick WicklinAugust 28, 2023 0

Generate random uniform points in an ellipse

I have previously written about how to efficiently generate points uniformly at random inside a sphere (often called a ball by mathematicians). The method uses a mathematical fact from multivariate statistics: If X is drawn from the uncorrelated multivariate normal distribution in dimensiond, then S = r*X / ||X|| has

Read More

Learn SAS | Programming Tips

Rick WicklinAugust 7, 2023 0

Construct an envelope function for the acceptance-rejection method

The acceptance-rejection method (sometimes called rejection sampling) is a method that enables you to generate a random sample from an arbitrary distribution by using only the probability density function (PDF). This is in contrast to the inverse CDF method, which uses the cumulative distribution function (CDF) to generate a random

Read More

Learn SAS | Programming Tips

Rick WicklinJuly 10, 2023 0

Simulate from a Markov model

A previous article shows an example of a Markov chain model and computes the probability that the system ends up in a terminal state (called an absorbing state). As explained previously, you can often compute exact probabilities for questions about Markov chains. Nevertheless, it can be useful to know how

Read More

Analytics | Learn SAS | Programming Tips

Rick WicklinApril 17, 2023 0

Should you use the Wald confidence interval for a binomial proportion?

The "Teacher’s Corner" of The American Statistician enables statisticians to discuss topics that are relevant to teaching and learning statistics. Sometimes, the articles have practical relevance, too. Andersson (2023) "The Wald Confidence Interval for a Binomial p as an Illuminating 'Bad' Example," is intended for professors and masters-level students in

Read More

Analytics | Learn SAS | Programming Tips

Rick WicklinMarch 13, 2023 0

Use the metalog distribution in SAS

A previous article describes the metalog distribution (Keelin, 2016). The metalog distribution is a flexible family of distributions that can model a wide range of shapes for data distributions. The metalog system can model bounded, semibounded, and unbounded continuous distributions. This article shows how to use the metalog distribution in

Read More

Analytics | Learn SAS | Programming Tips

Rick WicklinFebruary 20, 2023 0

Simulate from a bounded distribution that has a specified mean

A SAS programmer asked for help to simulate data from a distribution that has certain properties. The distribution must be supported on the interval [a, b] and have a specified mean, μ, where a < μ < b. It turns out that there are infinitely many distributions that satisfy these

Read More

Data Visualization | Learn SAS | Programming Tips

Rick WicklinFebruary 8, 2023 0

A random walk inside a heart

SAS programmers love to make special graphs for Valentine's Day. In fact, there is a long history of heart-shaped graphs and love-inspired programs written in SAS! Last year, I added to the collection by showing how a ball bounces on a heart-shaped billiards table. This year, I create a similar

Read More

Analytics | Data Visualization | Learn SAS

Rick WicklinJanuary 18, 2023 0

Visualize how parameters in a binary logistic regression model affect the probability of the event

A previous article shows that you can use the Intercept parameter to control the ratio of events to nonevents in a simulation of data from a logistic regression model. If you decrease the intercept parameter, the probability of the event decreases; if you increase the intercept parameter, the probability of

Read More

Analytics | Programming Tips

Rick WicklinJanuary 16, 2023 0

Simulate data from a logistic regression model: How the intercept parameter affects the probability of the event

This article shows that you can use the intercept parameter to control the probability of the event in a simulation study that involves a binary logistic regression model. For simplicity, I will simulate data from a logistic regression model that involves only one explanatory variable, but the main idea applies

Read More

Advanced Analytics

Bahar BillerDecember 14, 2022 0

Accelerating what-if analysis with machine learning

SAS' Bahar Biller expounds on the idea that stochastic simulations are large-data generation programs for highly complex and dynamic stochastic systems.

Read More

Learn SAS | Programming Tips

Rick WicklinNovember 30, 2022 0

Ladders: A probabilistic card trick

A probabilistic card trick is a trick that succeeds with high probability and does not require any skill from the person performing the trick. I have seen a certain trick mentioned several times on social media. I call it "ladders" or the "ladders game" because it reminds me of the

Read More

Learn SAS | Programming Tips

Rick WicklinNovember 28, 2022 0

Simulate poker hands in SAS

A SAS programmer was trying to simulate poker hands. He was having difficulty because the sampling scheme for simulating card games requires that you sample without replacement for each hand. In statistics, this is called "simple random sampling." If done properly, it is straightforward to simulate poker hands in SAS.

Read More

Analytics | Data Visualization | Learn SAS | Programming Tips

Rick WicklinNovember 7, 2022 0

The area of the convex hull of random points

I recently blogged about how to compute the area of the convex hull of a set of planar points. This article discusses the expected value of the area of the convex hull for n random uniform points in the unit square. The article introduces an exact formula (due to Buchta,

Read More

Learn SAS | Programming Tips

Rick WicklinOctober 10, 2022 0

The expected volume of a random tetrahedron in a cube

One of the benefits of social media is the opportunity to learn new things. Recently, I saw a post on Twitter that intrigued me. The tweet said that the expected volume of a random tetrahedron in the unit cube (in 3-D) is E[Volume] = 0.0138427757.... This number seems surprisingly small!

Read More

Learn SAS | Programming Tips

Rick WicklinSeptember 19, 2022 0

Generate random ID values for subjects in SAS

A common question on SAS discussion forums is how to use SAS to generate random ID values. The use case is to generate a set of random strings to assign to patients in a clinical study. If you assign each patient a unique ID and delete the patients' names, you

Read More

Analytics | Learn SAS

Rick WicklinAugust 1, 2022 0

Introductory examples of Monte Carlo simulation in SAS

When I was writing Simulating Data with SAS (Wicklin, 2013), I read a lot of introductory textbooks about Monte Carlo simulation. One of my favorites is Sheldon Ross's book Simulation. (I read the 4th Edition (2006); the 5th Edition was published in 2013.) I love that the book brings together

Read More

Learn SAS | Programming Tips

Rick WicklinJuly 25, 2022 0

Monte Carlo estimates of area

I've previously shown how to use Monte Carlo simulation to estimate probabilities and areas. I illustrated the Monte Carlo method by estimating π ≈ 3.14159... by generating points uniformly at random in a unit square and computing the proportion of those points that were inside the unit circle. The previous

Read More

Advanced Analytics | Analytics

Bahar BillerJuly 20, 2022 0

Digital twin development: Why simulations are critical

SAS' Bahar Biller reveals how simulations enable KPI generation, risk quantification, risk management and more.

Read More

Analytics | Learn SAS | Programming Tips

Rick WicklinMay 2, 2022 0

Simulate the null distribution for a hypothesis test

Recently, I wrote about Bartlett's test for sphericity. The purpose of this hypothesis test is to determine whether the variables in the data are uncorrelated. It works by testing whether the sample correlation matrix is close to the identity matrix. Often statistics textbooks or articles include a statement such as

Read More

Analytics | Customer Intelligence | Learn SAS | Students & Educators

Alex CoopMarch 21, 2022 0

Hooked on data science: gamification drives engagement among students and trainees

While studying business intelligence as an undergraduate student at business school HEC Montreal, Camille Duchesne encountered Cortex, an analytics simulation that pits participants against each other to develop the most accurate models for a particular task. In this case, the simulation supports a fictional charity by predicting which subjects from

Read More

Programming Tips

Rick WicklinJanuary 20, 2022 0

How often do different statistical tests agree? A simulation study

Here's a fun problem to think about: Suppose that you have two different valid ways to test a statistical hypothesis. For a given sample, will both tests reject or fail to reject the hypothesis? Or might one test reject it whereas the other does not? The answer is that two

Read More

1 2 3 … 7 Next