Strengthen your programming skills with tips and techniques from the experts
By using data provided by a Game of Thrones fan, we use SAS to look at screen time for scene locations and characters in this crazy popular show.
Strengthen your programming skills with tips and techniques from the experts
By using data provided by a Game of Thrones fan, we use SAS to look at screen time for scene locations and characters in this crazy popular show.
Did you recently lock in a mortgage rate of 3.5-5%, or perhaps even lower? How do today's rates compare to rates in the past? Let's dig a little deeper into this topic, with some graphs! But before we get started, here's a picture to get you into the mood for
I think every course in exploratory data analysis should begin by studying Anscombe's quartet. Anscombe's quartet is a set of four data sets (N=11) that have nearly identical descriptive statistics but different graphical properties. They are a great reminder of why you should graph your data. You can read about
This blog post is based on the Code Snippets tutorial video in the free SAS® Viya® Enablement course from SAS Education. Keep reading to learn more about code snippets or check out the video to follow along with the tutorial in real-time. Has there ever been a block of code
In the past, I created some graphs about our record low unemployment rate (US unemployment, and state-level unemployment), but does low unemployment also mean there are jobs available? Let's have a look at the data!... Existing Graphs I knew I couldn't be the only one interested in this kind of
A quadratic form is a second-degree polynomial that does not have any linear or constant terms. For multivariate polynomials, you can quickly evaluate a quadratic form by using the matrix expression x` A x This computation is straightforward in a matrix language such as SAS/IML. However, some computations in statistics
At the end of my SAS Users blog post explaining how to install SAS Viya on the Azure Cloud for a SAS Hackathon in the Nordics, I promised to provide some technical background. I ended up with only one manual step by launching a shell script from a Linux machine
The past few years, I've created maps showing things you might visit and do during the big SAS Global Forum (SASGF) conference. This year the conference is in Dallas, and ... you guessed it ... I've created a Dallas map! If you're familiar with my previous maps, you know that
This blog post could be subtitled "To Catch a Thief" or maybe "Go ahead. Steal this blog. I dare you."* That's because I've used this technique several times to catch and report other web sites who lift the blog content from blogs.sas.com and present it as their own. Syndicating blog
In numerical linear algebra, there are often multiple ways to solve a problem, and each way is useful in various contexts. In fact, one of the challenges in matrix computations is choosing from among different algorithms, which often vary in their use of memory, data access, and speed. This article
As word spreads that SAS integrates with open source technologies, people are beginning to explore how to connect, interact with, and use SAS in new ways. More and more users are examining the possibilities and with this comes questions like: How do I code A, integrate B, and accomplish C?
SAS Global Forum 2019 (SGF) is rapidly approaching - and which of the hundreds of presentations are you planning to attend? Well, no matter what types of analyses you perform with SAS software, you'll most likely want to present your findings in a really nice/informative graph! Therefore I highly recommend
Suppose you need to assign 100 patients equally among 3 treatment groups in a clinical study. Obviously, an equal allocation is impossible because the second number does not evenly divide the first, but you can get close by assigning 34 patients to one group and 33 to the others. Mathematically,
Recently, you may have heard about the release of the new SAS Cloud. The platform allows fast access to data-science applications in the cloud! Running on the SAS Cloud and using the latest container technology, SAS Cloud eliminates the need to install, update, or maintain software or related infrastructure. SAS
You are a data scientist, in your office, doing data scientist-y things when, your manager's, manager's, manager makes an impossible request. She wants you take a raw data set from the stem cell research team, scrub the data, create and score models, and be ready to rescore when new data
I recently saw an interesting graph that showed the number of motor vehicle crash deaths has been going down. The graph showed deaths per mile. That's a good statistic, but I wondered whether there were other ways to look at the data? An Interesting Graph Here's the graph, from an
I've previously written about how to deal with nonconvergence when fitting generalized linear regression models. Most generalized linear and mixed models use an iterative optimization process, such as maximum likelihood estimation, to fit parameters. The optimization might not converge, either because the initial guess is poor or because the model
Many SAS procedures support the BY statement, which enables you to perform an analysis for subgroups of the data set. Although the SAS/IML language does not have a built-in "BY statement," there are various techniques that enable you to perform a BY-group analysis. The two I use most often are
Whether you are a strong believer in the power of dividing by zero, agnostic, undecided, a supporter, denier or anything in between and beyond, this blog post will bring all to a common denominator. History of injustice For how many years have you been told that you cannot divide by
Also celebrating is your bank's branch manager. She was skeptical when headquarters analysts equipped branches for "Cloud-based application using SAS" , saying it would speed up loan applications. But your quick, frictionless transaction proved them right.The bank's accountants are happy too. The new pay-as-you-go mode of using SAS software in
During the year 2020, many countries and areas will be conducting their decennial census, and making projections to estimate what their population will be in the future. Therefore I decided to dust off one of my old SAS/Graph samples based on the 2010 census, and rewrite it using more modern
An important concept in multivariate statistical analysis is the Mahalanobis distance. The Mahalanobis distance provides a way to measure how far away an observation is from the center of a sample while accounting for correlations in the data. The Mahalanobis distance is a good way to detect outliers in multivariate
Flooding has been in the news the past few days, and that makes me want to analyze some data! I hang out at Jordan Lake (here in central North Carolina) a lot, so I decided to download the data for that lake, and do a graphical analysis. If you're interested
Here in the US, there's a lot of talk about the flu each year. First, people discuss whether or not to get the flu shot. Then there are discussions about whether or not you or your friends have the flu (or something else). Then the discussions about what strain of
Statisticians often emphasize the dangers of extrapolating from a univariate regression model. A common exercise in introductory statistics is to ask students to compute a model of population growth and predict the population far in the future. The students learn that extrapolating from a model can result in a nonsensical
SAS makes it easy for you to create a large amount of procedure output with very few statements. However, when you create a large amount of procedure output with the Output Delivery System (ODS), your SAS session might stop responding or run slowly. In some cases, SAS generates a “Not
US farmers grow a lot of food ... but did you know some of them also grow fuel for our vehicles? Follow along and you'll learn how much fuel they grow, and also learn some tips about plotting this type of data! These days most gasoline in the US has
It's time to celebrate Pi Day! Every year on March 14th (written 3/14 in the US), math-loving folks celebrate "all things pi-related" because 3.14 is the three-decimal approximation to the mathematical constant, π. Although children learn that pi is approximately 3.14159..., the actual definition of π is the ratio of
A SAS programmer posted an interesting question on a SAS discussion forum. The programmer wanted to iterate over hundreds of SAS data sets, read in all the character variables, and then do some analysis. However, not every data set contains character variables, and SAS complains when you ask it to
According to the most recent data, the child poverty rate in China is 33.1% - the rate in Denmark is 2.9%. Where do other countries fall in between these two extremes? Let's build a graph and find out! (or, if you're not interested in the code - jump to the