A well-formed WHERE statement or subsetting IF can narrow down the output of your SAS DATA step. The SAS log does a good job of telling you how many records were processed by the action. For example, let's look at this simple DATA step with my "poor man's random sample",
Uncategorized
data:image/s3,"s3://crabby-images/077ec/077ece051acd64bd9a57c3b665f6c2d40898ac4d" alt=""
A few weeks ago, in Northern Virginia, a 30 foot highway sign fell onto I-66 and landed on a passing pickup truck. Fortunately, no one was hurt, but it drew media attention and caused motorists in the area to wonder about the safety of other signs and the transportation network
data:image/s3,"s3://crabby-images/7b8ec/7b8ec85b31ddd31a4a56b6bb5c2ff680095e1347" alt=""
There is something that 90% of us admit to doing, and the other 10% will lie about. That, of course, is Googling yourself. As an avid follower of myself, and everything I do, I look forward to a weekly Google Alert that tells me all about what I've been up to.
data:image/s3,"s3://crabby-images/89cd7/89cd7ddd11cbfab2f96c911cdf089cb4a349d0b7" alt=""
After unwittingly getting involved recently in a code vs GUI discussion another pro GUI vote came in yesterday when presenting to a customer's internal user group. When creating and using prompts in SAS Enterprise Guide, it is a no-brainer to recommend leveraging the %_eg_WhereParam as it handles all the special
data:image/s3,"s3://crabby-images/ad1df/ad1df43f9b4b5748c449e902112a387ab7f2659e" alt=""
Most statistical programmers have seen a graph of a normal distribution that approximates a binomial distribution. The figure is often accompanied by a statement that gives guidelines for when the approximation is valid. For example, if the binomial distribution describes an experiment with n trials and the probability of success
data:image/s3,"s3://crabby-images/04c10/04c102886cd27d3a8c96aee1f2cee92b6cea5894" alt=""
Did you oversleep this morning? If you live in the United States of America, Monday morning seems to have arrived just a bit earlier, accompanied by a bit more "dark" than usual. That's because as good time-fearing citizens, we have all set our clocks ahead by one hour so as
data:image/s3,"s3://crabby-images/23198/231982a57a5655a8c69ace4adb4d42736c299e9a" alt=""
SAS provides several ways to compute sample quantiles of data. The UNIVARIATE procedure can compute quantiles (also called percentiles), but you can also compute them in the SAS/IML language. Prior to SAS/IML 9.22 (released in 2010) statistical programmers could call a SAS/IML module that computes sample quantiles. With the release
data:image/s3,"s3://crabby-images/98bec/98becdc787c3bfdc96d5d5a4deb9d279798f2a49" alt=""
In the United States, this upcoming weekend is when we turn our clocks forward one hour as we adopt daylight saving time. (Some people will also flip their mattresses this weekend!) Daylight saving time (DST) in the US begins on the second Sunday in March and ends on the first
data:image/s3,"s3://crabby-images/7b8ec/7b8ec85b31ddd31a4a56b6bb5c2ff680095e1347" alt=""
Apparently the prolonged use of OxyContin will give you a pompous and surley demeanor, and make you say a lot of really ignorant things. So I implore you, dear readers, to withhold your use of such a substance, preserve your good attitude and brain cells, and participate in a research study
A few years ago I had the privilege of presenting the last technical paper at SAS Global Forum. This year, conference chair Andy Kuligowski asked me to go one better than that, and present a talk at the official Closing Session. What will I talk about? That's a mystery (maybe
data:image/s3,"s3://crabby-images/dc8d7/dc8d78c22dcbea850cc2bf8f71d6d8b63a0b2260" alt="Get a quick summary of your data using this task."
During IFSUG yesterday, Sunil Gupta gave attendees to his presentation a special homework assignment. Look into the SAS Enterprise Guide task 'Characterize Data'. Sunil suggested that this was a simple approach to quickly getting a summary of all the variables within your data table. Of course, some programmers will use
data:image/s3,"s3://crabby-images/2a1ff/2a1ff9f96145ce59607fc60ef272b821e6463ab6" alt=""
I work with continuous distributions more often than with discrete distributions. Consequently, I am used to thinking of the quantile function as being an inverse cumulative distribution function (CDF). (These functions are described in my article, "Four essential functions for statistical programmers.") For discrete distributions, they are not. To quote
data:image/s3,"s3://crabby-images/dc3f8/dc3f8c35f19651a458b6ab69dc9b397fbb534c3e" alt=""
Today at IFSUG, Chuck Patridge presented a wonderful talk about how to complete fuzzy matching using BASE SAS tools. Chuck has been programming SAS since 1979 and has been tasked multiple times with coming up with in-house solutions to address business needs without the software costs that are typically associated
The next time you write a DATA step, try to express it in iambic pentameter. Or instead of a SAS macro function, how about a SAS macro sonnet? (Or, for the more base among you, a limerick?) That's the spirit behind the code {poems} project. You write a poem in
data:image/s3,"s3://crabby-images/63cd7/63cd7b7ec6e9e75411a4fc3a2d03bb299819d533" alt=""
As a SAS developer, I am always looking ahead to the next release of SAS. However, many SAS customer sites migrate to new releases slowly and are just now adopting versions of SAS that were released in 2010 or 2011. Consequently, I want to write a few articles that discuss
data:image/s3,"s3://crabby-images/8b850/8b8509856669a3a755084fe90a1e448a2e879b3b" alt=""
I've blogged several times about multivariate normality, including how to generate random values from a multivariate normal distribution. But given a set of multivariate data, how can you determine if it is likely to have come from a multivariate normal distribution? The answer, of course, is to run a goodness-of-fit
data:image/s3,"s3://crabby-images/78804/788047c28e53829d8d55afa8fc6fd5b9bdead68a" alt=""
About once a month, a customer approaches SAS and asks a question of significance. By "significance", I don't necessarily mean "of great importance", but instead I mean "of how SAS handles large numbers, or floating-point values with many significant digits". In response, we always first ask why they asked. This
In the previous post, “Roses are red, violets are blue…”, I discussed the general problem of getting style attributes to line up with specific group values and some ways to overcome the problem. In this installment, I want to elaborate on the attribute map functionality in the Graph Template Language
data:image/s3,"s3://crabby-images/b9b92/b9b92c494f12bed89b4dd252d642428707de7ec9" alt=""
Sometimes in matrix computations you need to obtain the values of certain submatrices such as the diagonal elements or the super- or subdiagonal elements. About a year ago, I showed one way to do that: convert subscripts to indices and vice-versa. However, a tip from @RLangTip on Twitter got me
data:image/s3,"s3://crabby-images/c6282/c6282ad3b1dbb289dcc9865821af5ea910af9b0a" alt=""
Neil Constable is a Principal Education Consultant at SAS in the United Kingdom, where he applies his extensive knowledge of Base SAS, SAS Enterprise Guide, and the SAS business intelligence tools. He's also the author of SAS Programming for Enterprise Guide Users, Second Edition--and this week's featured tip. You can get to know Neil
data:image/s3,"s3://crabby-images/0dcb9/0dcb927bc5e02a48e3fd7cc667874d5237eb2573" alt=""
This question came up while helping my presales colleague with a client’s question: “Is Microsoft Excel in direct competition with SAS?” Not really. I like to think of SAS as partnering with Excel. Because of the following Excel benefits and its synergy with SAS. Excel has been around for years.
data:image/s3,"s3://crabby-images/c56cf/c56cfeb1bcde6174ec594304d23eadd6fda404aa" alt=""
I recently saw a SAS Knowledge Base article called "How to stop processing your code if a certain condition is met." The article discusses the use of the %RETURN macro statement to abort the execution of a SAS program if some condition occurs. The "condition" is usually an error that
data:image/s3,"s3://crabby-images/bab49/bab494d8ff352afc4b9291b1039d9c2d68b31112" alt=""
Do you want your report to look good on the web, or to look good when you print it? Pick one. Before the SAS Report file format, that was the choice that you faced. HTML is perfect for the web browser. It's easy to scroll through tables, to apply an
data:image/s3,"s3://crabby-images/7b8ec/7b8ec85b31ddd31a4a56b6bb5c2ff680095e1347" alt=""
Citing online job postings reviewed by talent data firm Wanted Analytics, and a Software Advice blog by Michael Koploy, APICS e-News reports that "Demand planning analysts" are hot -- one of the five hottest careers in logistics. (Free subscription to APICS e-News) Clearly, APICS means there are a lot of good jobs
data:image/s3,"s3://crabby-images/3a27e/3a27eb1e8d148dd3c3e98710a2b9a051aed67bf2" alt=""
Unlike BASE SAS tables, OLAP cubes must exist in within the metadata in order to access from any of the OLAP Viewers. In addition to having some metadata OLAP cubes have a physical file structure presence (at least for MOLAP/HOLAP because it's a different story for ROLAP). When you refresh
data:image/s3,"s3://crabby-images/7a9e3/7a9e3ac522a1387ad9c17f23b8007066e3a175b3" alt=""
I recently blogged about Mahalanobis distance and what it means geometrically. I also previously showed how Mahalanobis distance can be used to compute outliers in multivariate data. But how do you compute Mahalanobis distance in SAS? Computing Mahalanobis distance with built-in SAS procedures and functions There are several ways to
data:image/s3,"s3://crabby-images/7b8ec/7b8ec85b31ddd31a4a56b6bb5c2ff680095e1347" alt=""
Recently I complained about the stock art used on The BFD blog header. So I was foaming with excitement when Alison Bolen (who oversees the SAS blogging platform) kindly took notice, and enlisted Mr. Blackwell to come up with something more pleasing and appropriate. OMG Mr. Blackwell!!! As a huge fan of
data:image/s3,"s3://crabby-images/d2020/d20205ae944a577dbac3261d4e5d6f13ae56ff72" alt=""
Teachers have more than enough to juggle each day, lacking the time to search for, and find, high-quality curricular resources online. When I would search for lesson plan supplements, I would often get lost in Google's abyss of results, spending far too much precious time sifting through mediocre materials. Until
data:image/s3,"s3://crabby-images/bca4e/bca4e311b21f280228009dd1a9bda909232be9e6" alt=""
The SAS DATA step supports a special syntax for determining whether a value is contained in an interval: y = (-2 < x < 2); This expression creates an indicator variable with the value 1 if x is in the interval (-2,2) and 0 otherwise. There is not a standard
data:image/s3,"s3://crabby-images/c29a8/c29a8cf6d7fb2487ddde2ac2ca45150b503c28df" alt="Converting between correlation and covariance matrices"
I have previously blogged about how to convert a covariance matrix into a correlation matrix in SAS (and the other way around). However, I still get questions about it, perhaps because my previous post demonstrated more than one way to accomplish each transformation. To eliminate all confusion, the following SAS/IML