I'm ramping up my visualization skills in preparation for the next big election, and I invite you to do the same! Let's start by plotting some county-level election data on a map... To get you into the spirit of elections, here's a picture of my friend Sara's dad, when he was
English
Because finding analytical talent continues to be a challenge for most, here I offer tips 5, 6, and 7 of my ten tips for finding data scientists, based on best practices at SAS and illustrated with some of our own “unicorns.” You can read my first blog post for why they
Regulatory compliance is a principal driver for data quality and data governance initiatives in many organisations right now, particularly in the banking sector. It is interesting to observe how many financial institutions immediately demand longer timeframes to help get their 'house in order' in preparation for each directive. To the
Part 1 of this topic presented a simple Sudoku solver. By treating Sudoku as an exact cover problem, the algorithm efficiently found solutions to simple Sudoku problems using basic logic. Unfortunately, the simple solver fails when presented with more difficult Sudoku problems. The puzzle on the right was obtained from
In many ways it’s open season for open data; open data is one of those phrases we hear a lot but it’s not always appreciated as having value. The fact that it’s openly available is seen by some as proof that there’s no value in the data – unlike, for
It’s February, so love is in the air (or at least hearts, chocolate, and roses are lining the isles at the grocery store) in the weeks before Valentine’s Day. For the singles in the house, don’t stop here! The stats are in, and according to the http://www.pursuit-of-happiness.org/ , people who have
In this blog series, I am exploring if it’s wise to crowdsource data improvement, and if the power of the crowd can enable organizations to incorporate better enterprise data quality practices. In Part 1, I provided a high-level definition of crowdsourcing and explained that while it can be applied to a wide range of projects
In SAS, the order of variables in a data set is usually unimportant. However, occasionally SAS programmers need to reorder the variables in order to make a special graph or to simplify a computation. Reordering variables in the DATA step is slightly tricky. There are Knowledge Base articles about how
Staying competitive in a big data world means working fast and making decisions even faster. You need to assess conditions, approve access, stop transactions and reroute activities quickly so you can seize opportunities or prevent problems. With increasing data volumes from the Internet of Things (Cisco predicts that fifty billion
North Carolina is one of those lucky states that has a huge variety of scenic destinations, such as mountains, piedmont, coastal plains, beaches, and 'outer banks' islands. We have state parks in all of these areas, but can you guess which state park has been trending the most during the past
I stated in my previous blog about the value and benefits of volunteering that SAS Global Forum is designed to bring users with questions together with users with know-how. This goal is accomplished primarily in breakout and ePoster presentations. During his keynote address at SAS Global Forum 2014, Futurist Thornton
There are companies that have no data quality initiative, and truly do believe that if they see no data problem. In effect, they say that if it does not interfere with day-to-day business, then there is no data quality problem. From what I have seen in my consulting experience, it usually
We asked our partners at the Cornell Center for Hospitality Research to poll the research faculty at the Hotel School to understand their guidance about what to expect in 2015. We were also able to get a preview of what the faculty will be working on in terms of research
Over my last two posts, I suggested that our expectations for data quality morph over the duration of business processes, and it is only at a point that the process has completed that we can demand that all statically-applied data quality rules be observed. However, over the duration of the
I love to teach, but it took several years of teaching before I felt comfortable being in front of a class. And having taught for over 20 years, the fear of presenting in the classroom has passed, but what about presenting at professional meetings or in front of my peers?
A SAS/IML programmer asked a question on a discussion forum, which I paraphrase below: I've written a SAS/IML function that takes several arguments. Some of the arguments have default values. When the module is called, I want to compute some quantity, but I only want to compute it for the
Significant progress in reduction of Cancer mortality is shown in a graph that I noticed recently on the Cancer Network web site. This graph showed the actual and projected cancer mortality by year for males. The graph is shown on the right. The graph plots the projected and actual numbers
Google recently announced that they will be adding Google Fiber high speed network and TV to my area. This was great news, because it will give us more choices ... and a little competition among providers tends to make them all 'try harder' to please the customer. :-) I was curious what other
It's an exciting time for reality! We've been technologically enhancing reality for a long time -- eye glasses, telescopes, binoculars, microscopes, photography, moving pictures, live streaming video over the Internet, etc. But whether it's augmented reality, virtual reality or somewhere in between, a new wave of eye wear technology is
In the latest release of SAS Visual Analytics Designer, a parameter is a variable whose value can be changed and that can be referenced by other report objects. Why is this an important introduction? This addition means that, not only can you design interactive reports via prompt controls, those controls
This week, I finally ate some liver, for the first time in over 20 years - and I realized it's a lot like prepping data (which I'll explain in this blog post). Here are a few of the similarities: They're both good for you. Thinking about them makes you go
This year the American Statistical Association Conference on Statistical Practice (CSP) has some weighty themes including Big Data Prediction and Analysis and all of its exciting applications. But just as important is the theme Communication and Impact. Everyone knows that if you have a great idea or discovery but you
This is an exciting and busy time for the SAS Global Forum 2015 content and delivery teams. They have worked hard to finalize the content, enhance your scheduling experience and ensure that attendees have access to as much of the conference content as possible. Please set aside some time in
Happy New Year! For many the New Year means new beginnings which also means change. But change is hard. We’ve all heard that before, yet still we’re surprised when confronted with the prospect of change and just how challenging it can be. One of the reasons is that most of
In my book Simulating Data with SAS, I discuss a relationship between the skewness and kurtosis of probability distributions that might not be familiar to some statistical programmers. Namely, the skewness and kurtosis of a probability distribution are not independent. If κ is the full kurtosis of a distribution and
We asked our partners at the Cornell Center for Hospitality Research to comment on what they are seeing in terms of trends that will impact the hospitality industry in 2015. Cathy Enz, full professor in strategy and The Lewis G. Schaeneman Jr. Professor of Innovation and Dynamic Management at the
One of the significant problems data quality leaders face is changing people's perception of data quality. For example, one common misconception is that data quality represents just another data processing activity. If you have a data warehouse, you will almost certainly have some form of data processing in the form
“Here’s Johnny!!!” and well sometimes John and sometimes Jonathan and sometimes Jon. In the real world, you sometimes need to make matching character strings more flexible. This is especially common when merging data sets. Variables, especially names, are not always exactly the same in all sources of your data. When
The Internet of Things is going to be driven by innovative business models as much as by innovative technology. In order to ground the following discussion, I found it helpful to create this visual depiction of the IoT that defines and distinguishes the key elements that enter into these business models.
Have you ever thought about retiring in another country, where your money might go further? Well here's some quantitative data to help you make an informed decision! ... First, to get you in the mood, here's a picture of my friend Erik checking out the prices at a pedal-powered food