How to graph NBA data with SAS

13

People have always been fascinated by sports statistics, and with the recent popularity of fantasy sports there is an increased demand for custom analyses of the sports data. With those folks in mind, I have created a simple example that SAS programmers can use as a starting point for analyzing NBA data.

Before we get into the nitty gritty details, here's a picture of my friend Simone's son playing basketball (in the bright shirt, with the ball). He's tall, in shape, and smart, so I think he'll do well in basketball. Who knows, maybe one of these days we'll all be plotting his data in our NBA graphs!

simones_son

I had recently read about some examples that demonstrate how to use the Python and R programming languages to analyze the NBA data, and decided to try my hand at using SAS to do something similar. With a bit of digging, I found the magic url that can be used to download the data for a specified player & season. I then wrote some SAS code to import the data directly from the Web page, into a SAS dataset.

After scrutinizing the data a bit, I determined that the 0,0 origin coordinate was in the middle of the basket, and all of the shots were shown in relation to one end of the court. I looked up the dimensions of an NBA basketball court, then determined the coordinates of the 4 corners, and created a map polygon I could use to represent the court in Proc Gmap. I then converted the shot data into an annotate dataset that would plot the missed shots as red x's and the made shots as blue o's. Here's what things looked like so far:

nba_tracker_no_markings

The above graph is nice, but it would be even better with some points-of-reference so we can see 'where' the player was when he made the shot. Therefore I worked out the coordinates of all the markings on the court, and created a special annotate dataset to draw them on the map (using annotate draw and polygon functions). Wow - what a difference that makes!How to graph NBA data with SAS #analytics Click To Tweet

nba_tracker

The Proc Gmap approach is a good starting place for a spatial analysis, but how about analyzing the data over time? It was a simple matter to feed the data into Proc Gplot, and generate the following. Do you notice any trends in Stephen's shot data? Can you explain the outliers?

nba_tracker1

 

 


 

Just for Fun:

Here's a little quiz, to test your NBA knowledge, combined with your visual analytics perception skills. Below are three graphs - can you tell which goes with Kevin Durant, Lebron James, and Marc Gasol:

nba_gasol

nba_durant

nba_lebron

(Once you've made your guess, you can 'cheat' and look at the filenames of the images for a hint!)

Share

About Author

Robert Allison

The Graph Guy!

Robert has worked at SAS for over 20 years, and is perhaps the foremost expert in creating custom graphs using SAS/GRAPH. His educational background is in Computer Science, and he holds a BS, MS, and PhD from NC State University. He is the author of several conference papers, has won a few graphic competitions, and has written a book (SAS/GRAPH: Beyond the Basics).

13 Comments

  1. So cool! I went to Virginia Tech with Stephen's father, Dell Curry, before the school became well-known for its football program. Dell was year ahead of me.

  2. Pingback: A statistical analysis of Stephen Curry's shooting - The DO Loop

  3. Pingback: Nonparametric regression for binary response data in SAS - The DO Loop

    • Robert Allison
      Robert Allison on

      I can't remember exactly what page I got it from. I did a bunch of Google searches, and tried out several existing web-based NBA data query tools, and copy-n-pasted the URL from one of them, and trimmed it down a bit to do what I was wanting :)

  4. I wanted to imitate your work, but the code doesn't work!
    There was no observation in 'my data', and I am poor at SAS, so I could not find what is the problem.

    Please comment..

    • Robert Allison
      Robert Allison on

      Have you defined the 'my_proxy' macro variable, as described in the comments? Each site will have a different/unique proxy, so you need to define that based on your site...

      %let my_proxy=http://yourproxy.com:80;

      The my_proxy macro variable's value is then utilized in the filename statement, to get to the data...

      filename temp_url url &url proxy="&my_proxy";

  5. Pingback: Wie treffsicher war Jordan unter Druck? - Mehr Wissen

  6. Pingback: My top 10 graph blog posts of 2016! - SAS Learning Post

Leave A Reply

Back to Top