I recently blogged about Mahalanobis distance and what it means geometrically. I also previously showed how Mahalanobis distance can be used to compute outliers in multivariate data. But how do you compute Mahalanobis distance in SAS? Computing Mahalanobis distance with built-in SAS procedures and functions There are several ways to
English
The smallest aircraft I’ve ever flown on was the one I took from Raleigh to Hilton Head, South Carolina to attend my first PharmaSUG – the Pharmaceutical Industry SAS Users Group. That was in 1997, and it’s been my pleasure to work with this great group of volunteer SAS leaders
Managing the deluge of smart grid data. Responding to the increased demand for customer intelligence. Implementing analytics for grid optimization. These were just a few of the topics covered at this week’s inaugural “Utility Analytics Summit.” Utilities have been grappling with implementation of smart grid devices for several years and
One of the cool things with SAS OLAP Viewer in Add-in to Microsoft Office is your ability to skip right into a specific value. Out of the box, viewing OLAP cubes can lead you to believe that in order to view a specific value you need to click-thru a hierarchy
Teachers have more than enough to juggle each day, lacking the time to search for, and find, high-quality curricular resources online. When I would search for lesson plan supplements, I would often get lost in Google's abyss of results, spending far too much precious time sifting through mediocre materials. Until
What're you wearing? I get that question all of the time …. OK, let me re-phrase that. In regards to events, I am often asked, “What are you going to wear?” It’s always hard to answer as the standard for events, including SAS Global Forum, is business casual.
The SAS DATA step supports a special syntax for determining whether a value is contained in an interval: y = (-2 < x < 2); This expression creates an indicator variable with the value 1 if x is in the interval (-2,2) and 0 otherwise. There is not a standard
In this blog we have been discussing graphs useful for analysis of data for many domains such as clinical research, forecasting and more. SG Procedures and GTL are particularly suited for these use cases. So, when I came upon a dashboard image from Steven Few's Visual Business Intelligence blog, showing the
When I read the way that this user is working with SAS, I took a look around support.sas.com to find examples to allow you to work with your own data. (Take a look at this.) If this example isn't quite right for you, plug in your own key words.
I have previously blogged about how to convert a covariance matrix into a correlation matrix in SAS (and the other way around). However, I still get questions about it, perhaps because my previous post demonstrated more than one way to accomplish each transformation. To eliminate all confusion, the following SAS/IML
The graph showing the distribution of the maximum liver function test values by treatment for all participants in a study is commonly used for the analysis of safety data in clinical trials. The data is often structured in multiple columns (one per treatment) as below on the left, or grouped by
Bank Systems & Technology just published a special issue focused on "big data" - and how high-performance analytics helps solve the big data problem. Clicking on the cover image will take you to the ebook, so you can flip through 24 pages full of information about the benefits of big data for banking.
I previously described how to use Mahalanobis distance to find outliers in multivariate data. This article takes a closer look at Mahalanobis distance. A subsequent article will describe how you can compute Mahalanobis distance. Distance in standard units In statistics, we sometimes measure "nearness" or "farness" in terms of the
Do you love SAS? I do! And there is a special place for me to share that love. Don't worry there is a special place for you, too. It just so happens that it is the same place – SAS users groups. SAS users groups hold events to allow attendees
So, there’s a lot of talk about staying for the magic of Disney World after SAS Global Forum. What about arriving early for some SAS magic? A selection of some of the most popular SAS training courses is being offered Wednesday – Friday, April 18-20 prior to the conference. Here’s
How we love shortcuts. Our life seems more efficient with them. I try to take the earlier train for a good parking spot that lets me exit early. Actually change that to any spot (my train station has tons of commuters with FEW parking spaces so getting up early is the
The Walt Disney World Swan and Dolphin Resort (the headquarters hotel) is offering a great rate to SAS Global Forum attendees. This amazing facility is where all of the action is. No cars to rent, no gas tanks to fill. No parking to find and pay for – which means no searching
Way back when I learned to program, I remember a computer instructor explaining that an IF-THEN statement can be a relatively slow operation. He said "If a multiplication takes one unit of time, an IF statement requires about 70 units." I don't know where his numbers came from, or even
The parable of beer and diapers is often related when teaching data mining techniques. Whether fact or fiction, a Heat Map is useful to view the claimed associations. A co-worker recently enquired about possible ways to display associations or dependency between variables. One option is to show the dependency as a node
I have a love-hate relationship with spreadsheet data. I am interested in finding data to analyze, and if it arrives in the form of a spreadsheet, I'll take it. And I like to deliver results and reports, but often my constituents ask for it as a spreadsheet that they can
Big data is one of the hottest topics in business. When you hear those words - BIG Data - you almost surely think of: HUGE financial services firms scoring terabytes of historical and current risk data GLOBAL telecommunications companies mining petabytes of structured and unstructured data INTERNATIONAL retailers repricing hundreds of thousands of products across
The American Association of State Highway Transportation Officials recently released a top 10 list of transportation issues that will be “talked, written or tweeted and legislated about” in 2012. As expected, funding constraints and Congressional action on reauthorization appear on the list but the group also notes that natural disaster
Unlike prior versions of SAS OLAP technology, 9.2 provides more options for maintaining and refreshing OLAP cubes. With this comes some discussion about what each one does alone, and how pairing these techniques really provides SAS OLAP Server Administrators and cube developers a cornucopia of OLAP options. PROC OLAPOPERATE Remember
Here is the second installment of sample graphs from the SG Procedures book - The Adverse Event Timeline. This is a graph commonly used in patient profiles for clinical trials where we track the progress of a patient through a hospitalization event, tracking the dates and severity of the adverse events. The
A variance-covariance matrix expresses linear relationships between variables. Given the covariances between variables, did you know that you can write down an invertible linear transformation that "uncorrelates" the variables? Conversely, you can transform a set of uncorrelated variables into variables with given covariances. The transformation that works this magic is
Many of the chapters in my new book Building Business Intelligence Using SAS: Content Development Examples were straightforward to write but one in particular was the beast. Neither Tricia nor I wanted to tackle the SAS BI Dashboard chapter. Just chalk it up to no major work experience with this
Have you ever wanted to run a sample program from the SAS documentation or wanted to use a data set that appears in the SAS documentation? You can: all programs and data sets in the documentation are distributed with SAS, you just have to know where to look! Sample data
Recently a user posted a question on the SAS/GRAPH and ODS Graphics Communities page on how to plot the normal density curves for two classification levels in the same graph. We have often seen examples of a distribution plot of one variable using a histogram with normal and kernel density curves. Here is a simple example: Code Snippet:
Sara started it, and I like the idea - the Unofficial Official Guide to SAS Global Forum. My contribution to this unofficial guide is “The Agenda Builder and why it matters to me." In case you're wondering, I use it to:
My colleague, Steve Serrao, just published a blog post on the proliferation of varied law enforcement approaches and their related phraseologies. After reading, I concur – hence, this blog’s title. I am not a huge fan of the term “predictive policing”. While others may love it and live it, and