Do you want to know what will happen in the future? To gain true predictive insight, skip the tea leaves and look toward your data. SAS instructor Jeff Thompson is a high-energy data mining expert who will be demonstrating how to gain predictive insight from your data in his new
Uncategorized
It’s me again!! We're at the halfway point of meeting our Strata + Hadoop World dream team. So far, you’ve met machine learning guru Patrick Hall; data management expert Clark Bradley; and advanced analytics specialist Rachel Hawley. Next up … Dan Zaratsian! I met Dan a few years back while preparing for Analytics 2013
Last weekend I realized I wanted something very specific: a great-looking lawn by mid-October that would require minimal effort from me. This meant that the output I needed to produce this desired outcome was the target date on which I needed to have aeration and over-seeding done in my yard.
Even though the first papers in machine learning were in the 1950s, one could argue it goes back further to the work of Alan Turing and other early computer scientists. So why has this way of modeling seemingly become so popular now? Because data has become a commodity. Large amounts of many different
I read an article recently discussing how runners inevitably slow down with age, particularly after 50. Data from the New York Marathon and Boston Marathon back this up with generally flat average finishing times for ages 20-49 followed by a steady, almost exponential, increase after 50. I haven’t reached the
I just returned home from an expedition/adventure boat trip to Cuba, and Talk Like a Pirate day is coming up this Saturday - what a combination for an interesting blog! I hope you enjoy a few pictures, and a bit of data analysis on these topics! A couple of weeks ago,
In the UK, technology trends move a little slower than for our US counterparts. It was about 5 years ago when I first met a data leader at a conference on this side of the pond who was actively engaging in large scale big data projects. This wasn’t a presenter
A core SAS team of thought leaders, developers and executives will be in New York City on September 29 at Strata + Hadoop World, mixing and mingling with people like you who live and breathe analytics. We’d love to be a part of your Strata + Hadoop World agenda. Last week, I introduced
Among the tightly held cards, piles of chips and bright lights, there have been stories that have unfolded in Las Vegas that have been forever preserved in time, never seeing the light of day. But what if what happened in Vegas…could be shared with excitement with your friends and family?
Last week I discussed ordinary least squares (OLS) regression models and showed how to illustrate the assumptions about the conditional distribution of the response variable. For a single continuous explanatory variable, the illustration is a scatter plot with a regression line and several normal probability distributions along the line. The
Data integration, on any project, can be very complex – and it requires a tremendous amount of detail. The person I would pick for my data integration team would have the following skills and characteristics: Has an enterprise perspective of data integration, data quality and extraction, transformation and load (ETL): Understands
I've previously written about how to generate a sequence of evenly spaced points in an interval. Evenly spaced data is useful for scoring a regression model on an interval. In the previous articles the endpoints of the interval were hard-coded. However, it is common to want to evaluate a function
In today’s information economy, the ability to engage and develop meaningful digital relationships is fundamental to any business. A growing number of organisations, including small-to-medium sized enterprises, are investing in easy-to-use analytical software and services to extract insights from data about their business. As a result, we're now experiencing the
with Natalie Osborn, Senior Industry Consultant, Hospitality and Gaming Practice, SAS This week, we continue our fall “back to the basics” refresher series on analytics for hoteliers. Last week, in part one, Natalie and I reviewed the analytic methods that can be utilized by hoteliers. This week we will explore
Meet Clark Bradley: SAS technical architect by day and comedian by night. When he’s not demoing SAS Data Loader for Hadoop, he’s blogging about it on The Data Roundtable. Clark and a core SAS team of thought leaders, developers and executives will be in New York City on September 29 at Strata
The benefits of big data often depend on taming unstructured data. However, in international contexts, customer comments, employee notes, external websites, and the social media labyrinth are not exclusively written in English, or any single language for that matter. The Tower of Babel lives and it is in your unstructured
I had the pleasure of speaking at the inaugural “Accounting IS Big Data” conference this past week in New York City, a meeting organized by the American Accounting Association. In addition to giving several talks, I participated in breakout sessions in which attendees discussed how analytics is used to monitor
A friend who teaches courses about statistical regression asked me how to create a graph in SAS that illustrates an important concept: the conditional distribution of the response variable. The basic idea is to draw a scatter plot with a regression line, then overlay several probability distributions along the line,
The epicenter of big data moves to New York City on September 29 at Strata + Hadoop World. It’s a great chance to mix and mingle with people that live and breathe analytics, including a core SAS team of thought leaders, developers and executives. We’d love to be a part
There are four key areas that require continuous investment in order to become demand-driven: people, process, analytics, and technology. However the intent of your demand forecasting process along with business interdependencies need to be horizontally aligned in order to gain sustainable adoption. Adoption alone doesn't necessarily mean it will be sustainable. As
Over my career, I've attended many events where the audience was blasted with the value of analytics -- and the pitch usually fell on deaf ears. The 2015 SAS Energy Analytics Forum (which doubled in size from last year) was vastly different. The attendees had more questions and more urgency to take
In my prior two posts, I explored some of the issues associated with data integration for big data and particularly, the conceptual data lake in which source data sets are accumulated and stored, awaiting access from interested data consumers. One of the distinctive features of this approach is the transition
Statistical programmers often have to use the results from one SAS procedure as the input to another SAS procedure. Because ODS enables you to you to create a SAS data set from any ODS table or graph, it is easy to obtain a data set that contains the value of
I realized a little while ago that I may have more loyalty cards and memberships than the average person. (And that I more actively prove my loyalty than the average person). But as anybody who has ever signed up to a mailing list or for a store card knows, having
Along with the data scientist hype, analytics and the people who make them work have found themselves in the spotlight. The trend has also put an emphasis on the "science" aspects of analysis, such as a data focus, statistical rigor, controlled experiments and the like. Now, I’m not at all against adding more
Since our last VirtualOil update in May, oil prices have continued to take a beating. As the chart of the rolling five-year portfolio shows, much of our strip of options is now out-of-the-money and the average value per barrel of that optionality has sunk below $7. No surprise then that
with Natalie Osborn, Senior Industry Consultant, Hospitality and Gaming Practice, SAS. It’s back to school time, and back to school reminds me of getting back to the basics. So, we thought we’d start the fall with a “back to the basics” refresher series on analytics. To accomplish this, Natalie and
Suppose you wish to select a random sample from a large SAS dataset. No problem. The PROC SURVEYSELECT step below randomly selects a 2 percent sample: proc surveyselect data=large out=sample method=srs /* simple random sample */ n=1000000; /* sample size */ run; Do you have a SAS/STAT license? If not,
In my previous blog post, I showed how you can use SAS to program a "weaving" algorithm that takes an image, cuts it into strips, and weaves the strips together to create mathematical art. I used matrices and heat maps for the computations and visualization. At the end of the
I think everyone can agree that being able to debug programs is an important skill for SAS programmers. That’s why Susan Slaughter and I devoted a whole chapter to it in The Little SAS® Book. I don’t know about you, but I think figuring out what’s wrong with my program