Blogs

Blogs

Tag: Simulation

Analytics | Programming Tips

Rick WicklinSeptember 25, 2017 0

Simulate multivariate normal data in SAS by using PROC SIMNORMAL

My article about Fisher's transformation of the Pearson correlation contained a simulation. The simulation uses the RANDNORMAL function in SAS/IML software to simulate multivariate normal data. If you are a SAS programmer who does not have access to SAS/IML software, you can use the SIMNORMAL procedure in SAS/STAT software to

Read More

Advanced Analytics | Learn SAS | Programming Tips

Simulate clustered data from a Gaussian mixture distribution

Rick WicklinSeptember 13, 2017 0

Simulate multivariate clusters in SAS

This article shows how to simulate data from a mixture of multivariate normal distributions, which is also called a Gaussian mixture. You can use this simulation to generate clustered data. The adjacent graph shows three clusters, each simulated from a four-dimensional normal distribution. Each cluster has its own within-cluster covariance,

Read More

Programming Tips

Broken stick problem: What is the probability that three randomly chosen points will break a segment into a triangle?

Rick WicklinJuly 26, 2017 0

Random segments and broken sticks

A classical problem in elementary probability asks for the expected lengths of line segments that result from randomly selecting k points along a segment of unit length. It is both fun and instructive to simulate such problems. This article uses simulation in the SAS/IML language to estimate solutions to the

Read More

Programming Tips

Rick WicklinJune 5, 2017 0

Runs in coin tosses; patterns in random seating

If you toss a coin 28 times, you would not be surprised to see three heads in a row, such as ...THHHTH.... But what about eight heads in a row? Would a sequence such as THHHHHHHHTH... be a rare event? This question popped into my head last weekend as I

Read More

Learn SAS | Programming Tips

How to generate random numbers in SAS

Rick WicklinJune 1, 2017 0

How to choose a seed for generating random numbers in SAS

Last week I was asked a simple question: "How do I choose a seed for the random number functions in SAS?" The answer might surprise you: use any seed you like. Each seed of a well-designed random number generator is likely to give rise to a stream of random numbers,

Read More

Learn SAS | Programming Tips

Rick WicklinMay 10, 2017 0

Simulate lognormal data in SAS

A SAS customer asked how to simulate data from a three-parameter lognormal distribution as specified in the PROC UNIVARIATE documentation. In particular, he wanted to incorporate a threshold parameter into the simulation. Simulating lognormal data is easy if you remember an important fact: if X is lognormally distributed, then Y=log(X)

Read More

Advanced Analytics | SAS Events

Ed HughesMarch 31, 2017 0

Operations Research Talks at SAS Global Forum 2017

The 2017 edition of SAS Global Forum, the largest annual SAS user group meeting, will be held at the Swan and Dolphin Resort in Orlando, Florida on April 2-5. Among the many analytic talks at SAS Global Forum 2017, several focus on operations research topics like optimization and simulation. If

Read More

Advanced Analytics

Rick WicklinMarch 1, 2017 0

Monte Carlo estimates of joint probabilities

Monte Carlo techniques have many applications, but a primary application is to approximate the probability that some event occurs. The idea is to simulate data from the population and count the proportion of times that the event occurs in the simulated data. For continuous univariate distributions, the probability of an

Read More

Advanced Analytics

Rick WicklinFebruary 1, 2017 0

Simulate many samples from a linear regression model

In a previous article, I showed how to simulate data for a linear regression model with an arbitrary number of continuous explanatory variables. To keep the discussion simple, I simulated a single sample with N observations and p variables. However, to use Monte Carlo methods to approximate the sampling distribution

Read More

Advanced Analytics

Rick WicklinJanuary 25, 2017 0

Simulate data for a linear regression model

This article shows how to simulate a data set in SAS that satisfies a least squares regression model for continuous variables. When you simulate to create "synthetic" (or "fake") data, you (the programmer) control the true parameter values, the form of the model, the sample size, and magnitude of the

Read More

Rick WicklinDecember 28, 2016 0

The contaminated normal distribution

How can you generate data that contains outliers in a simulation study? The contaminated normal distribution is a simple but useful distribution you can use to simulate outliers. The distribution is easy to explain and understand, and it is also easy to implement in SAS. What is a contaminated normal

Read More

Rick WicklinNovember 28, 2016 0

Goodness-of-fit tests: A cautionary tale for large and small samples

In the classic textbook by Johnson and Wichern (Applied Multivariate Statistical Analysis, Third Edition, 1992, p. 164), it says: All measures of goodness-of-fit suffer the same serious drawback. When the sample size is small, only the most aberrant behaviors will be identified as lack of fit. On the other hand,

Read More

Rick WicklinNovember 23, 2016 0

Sampling variation in small random samples

Somewhere in my past I encountered a panel of histograms for small random samples of normal data. I can't remember the source, but it might have been from John Tukey or William Cleveland. The point of the panel was to emphasize that (because of sampling variation) a small random sample

Read More

Advanced Analytics

Ed HughesNovember 10, 2016 0

SAS at the 2016 INFORMS Annual Meeting

The 2016 INFORMS Annual Meeting will be held at the Music City Center and Omni Nashville Hotel in downtown Nashville, TN on November 13-16, with pre-conference events starting on Saturday, November 12. SAS will be a major participant in this conference. Over two dozen people from SAS will attend, with

Read More

Rick WicklinOctober 26, 2016 0

Create patterns of missing data

When simulating data or testing algorithms, it is useful to be able to generate patterns of missing data. This article shows how to generate random and systematic patterns of missing values. In other words, this article shows how to replace nonmissing data with missing data. Generate a random pattern of

Read More

Rick WicklinSeptember 21, 2016 0

Simulate data from a generalized Gaussian distribution

Although statisticians often assume normally distributed errors, there are important processes for which the error distribution has a heavy tail. A well-known heavy-tailed distribution is the t distribution, but the t distribution is unsuitable for some applications because it does not have finite moments (means, variance,...) for small parameter values.

Read More

Rick WicklinSeptember 8, 2016 0

Coverage probability of confidence intervals: A simulation approach

The article uses the SAS DATA step and Base SAS procedures to estimate the coverage probability of the confidence interval for the mean of normally distributed data. This discussion is based on Section 5.2 (p. 74–77) of Simulating Data with SAS. What is a confidence interval? Recall that a confidence

Read More

Advanced Analytics

Ed HughesSeptember 7, 2016 0

Operations Research Talks at Analytics Experience 2016

Analytics Experience 2016 will be held on Sept. 12-14, 2016 at the Bellagio in Las Vegas, NV. There will be a great number of excellent talks and demonstrations at the conference, covering many aspects of SAS analytics and many practical applications. Several of these sessions deal directly with the use

Read More

Advanced Analytics

Rob PrattApril 15, 2016 0

SAS/OR at SAS Global Forum 2016

This year's SAS Global Forum conference will take place April 18-21 at The Venetian in Las Vegas. For SAS/OR, SAS staff will present two Super Demos and three papers:

Read More

Rick WicklinApril 13, 2016 0

Head-tail versus head-head: A counterintuitive property of coin tosses

I saw an interesting mathematical result in Wired magazine. The original article was about mathematical research into prime numbers, but the article included the following tantalizing fact: If Alice tosses a [fair]coin until she sees a head followed by a tail, and Bob tosses a coin until he sees two

Read More

Advanced Analytics

Ed HughesApril 8, 2016 0

SAS at the 2016 INFORMS Conference on Business Analytics and Operations Research

SAS will have a major presence at the 2016 INFORMS Conference on Business Analytics and Operations Research, which will be held at the Hyatt Regency Grand Cypress hotel in Orlando, FL on April 10-12. Many SAS staff will participate in this conference. SAS/OR, the SAS Global Academic Program, and JMP

Read More

Generate random points uniformly in a ball

Rick WicklinApril 6, 2016 0

Generate points uniformly inside a d-dimensional ball

Last week I showed how to generate random points uniformly inside a 2-d circular region. That article showed that the distance of a point to the circle's center cannot be distributed uniformly. Instead, you should use the square root of a uniform variate to generate 2-D distances to the origin.

Read More

Rick WicklinMarch 30, 2016 0

Generate points uniformly inside a circular region in 2-D

It is easy to generate random points that are uniformly distributed inside a rectangle. You simply generate independent random uniform values for each coordinate. However, nonrectangular regions are more complicated. An instructive example is to simulate points uniformly inside the ball with a given radius. The two-dimensional case is to

Read More

Rick WicklinMarch 16, 2016 0

Simulate from the multinomial distribution in the SAS DATA step

There are several ways to simulate multinomial data in SAS. In the SAS/IML matrix language, you can use the RANDMULTINOMIAL function to generate samples from the multinomial distribution. If you don't have a SAS/IML license, I have previously written about how to use the SAS DATA step or PROC SURVEYSELECT

Read More

Rick WicklinMarch 14, 2016 0

Monte Carlo estimates of pi and an important statistical lesson

Today is March 14th, which is annually celebrated as Pi Day. Today's date, written as 3/14/16, represents the best five-digit approximation of pi. On Pi Day, many people blog about how to approximate pi. This article uses a Monte Carlo simulation to estimate pi, in spite of the fact that

Read More

Rick WicklinFebruary 15, 2016 0

Four essential sampling methods in SAS

Many simulation and resampling tasks use one of four sampling methods. When you draw a random sample from a population, you can sample with or without replacement. At the same time, all individuals in the population might have equal probability of being selected, or some individuals might be more likely

Read More

Rick WicklinFebruary 10, 2016 0

Sample with replacement and unequal probability in SAS

How do you sample with replacement in SAS when the probability of choosing each observation varies? I was asked this question recently. The programmer thought he could use PROC SURVEYSELECT to generate the samples, but he wasn't sure which sampling technique he should use to sample with unequal probability. This

Read More

Nicole TschauderDecember 21, 2015 0

Simulation - Verladung der Geschenke (SAS Adventskalender 21. Pforte)

Die gescorten Rentiere scharren heute schon ganz nervös mit den Hufen. Bald geht es los! Sie freuen sich schon so auf die Reise. Überall auf der Erde ist es so schön geschmückt, alles leuchtet und blinkt! Und vielleicht liegt sogar ein bisschen Schnee. Heute wird mal wieder simuliert. Unsere bisherigen

Read More

Advanced Analytics

Rob PrattOctober 30, 2015 0

SAS-Related Talks at the INFORMS 2015 Annual Meeting

The INFORMS 2015 Annual Meeting will be held in Philadelphia November 1-4. More than two dozen SAS staff will participate, and SAS will have three adjacent booths representing SAS/OR (and all of Advanced Analytics), JMP, and the SAS Global Academic Program. SAS is well-represented among the presentations at this meeting,

Read More

Rick WicklinOctober 28, 2015 0

Monte Carlo simulation for contingency tables in SAS

The FREQ procedure in SAS supports computing exact p-values for many statistical tests. For small and mid-sized problems, the procedure runs very quickly. However, even though PROC FREQ uses efficient methods to avoid unnecessary computations, the computational time required by exact tests might be prohibitively expensive for certain tables. If

Read More

Previous 1 2 3 4 5 6 7 Next