Crucially, this data not only needs to be standardised but also it needs to be actionable. By that, I mean this behavioural data needs.
English
How can a solar farm ensure peak energy production? And what factors can be adjusted to optimize production throughout the day, the week and season-by-season? These are just some of the questions that a team of data scientists have asked and answered about the SAS solar farm using data, drones
Well SAS, you outdid yourselves once again! For 8 years running, we've been collecting and donating leftover and excess Halloween candy from SAS families. After last year's record breaking 680 pounds of donated candy, we challenged you to raise the {candy} bar even higher this year...and man did you do
A 2-D "bin plot" counts the number of observations in each cell in a regular 2-D grid. The 2-D bin plot is essentially a 2-D version of a histogram: it provides an estimate for the density of a 2-D distribution. As I discuss in the article, "The essential guide to
I saw an article that claimed Donald Trump recently tweeted 123 times in one day. This got me wondering how many times he typically tweets during a day, and whether this number has changed over the years. This seems like it might be a good topic to analyze with a
I have a theory about parents (most often mothers, but I might be biased based on my experience!). When we are born, we pick up a suitcase filled with guilt. The contents may shift during flight, but it is almost always full. Some examples are: during pregnancy – whether or not
The next time you pick up a book, you might want to pause and think about the work that has gone into producing it – and not just from the authors.
Data scientists naturally use a lot of machine learning algorithms, which work well for detecting patterns, automating simple tasks, generalizing responses and other data heavy tasks. As a subfield of computer science, machine learning evolved from the study of pattern recognition and computational learning theory in artificial intelligence. Over time, machine learning has borrowed from many
We’re proud of our unique company culture that spans across offices, divisions and geographies. It’s one of the reasons SAS was recently named among the top workplaces in the world by Fortune’s Great Place to Work! If you’re curious about what it is that makes SAS such a great place to work, one answer
Rockin' around the Christmas tree At the Christmas party hop. – Brenda Lee Last Christmas, I saw a fun blog post that used optimization methods to de-noise an image of a Christmas tree. Although there are specialized algorithms that remove random noise from an image, I am not going to
I can tell that my area (Wake county, NC) has a growing population, because the traffic keeps getting worse and worse. But it's a little difficult to quantitatively gauge growth by looking at traffic congestion. Therefore let's have a look at a more direct measurement - the actual population data!
Parts 1 and 2 of this blog post discussed exploring and preparing your data using SASPy. To recap, Part 1 discussed how to explore data using the SASPy interface with Python. Part 2 continued with an explanation of how to prepare your data to use it with a machine-learning model.
Bringing the power of SAS to your Python scripts can be a game changer. An easy way to do that is by using SASPy, a Python interface to SAS allowing Python developers to use SAS® procedures within Python. However, not all SAS procedures are included in the SASPy library. So,
I caught up with Ioannis Tsiliras, Director at OTE-COSMOTE, to discuss innovation, real-time and customer contact.
Jeff Stander helps us understand the different options of preparing data for analytics.
Who says that holidays have to be stuffed with belly-stuffing traditions? While the average American gains about one to two pounds between Thanksgiving and New Year’s Day, weight gain, indigestion and post-meal sluggishness do not have to be on your wish list.
Binary matrices are used for many purposes. I have previously written about how to use binary matrices to visualize missing values in a data matrix. They are also used to indicate the co-occurrence of two events. In ecology, binary matrices are used to indicate which species of an animal are
For the last two months I’ve seen commercial after commercial with smiling faces, catchy tunes, and kids running around houses full of good food, presents, and family. This season, we’re encouraged to remember what “matters”. Here’s something else to remember: the goal of these commercials is for the company to
As we're getting into December, and the weather is getting colder, I thought it would be cool to plot some Antarctica data. You might remember I did this about 1.5 years ago, using good-old Proc Gmap, a special projection, and lots of tricky annotation. Well, this time let's use the
The DATA step remains a popular way to create and manipulate SAS data sets. Whether you are reshaping a data set entirely or simply assigning values to a new variable, there are numerous tips and tricks that you can use to save time and keystrokes.
SAS' Leonid Batkhan shows you how to compare SAS data sets that include common and uncommon columns. You'll learn how to check mark commonalities and color-code differences in data tables side-by-side columns and add a comments field to see greater detail.
Recently I showed how to visualize and analyze longitudinal data in which subjects are measured at multiple time points. A very common situation is that the data are collected at two time points. For example, in medicine it is very common to measure some quantity (blood pressure, cholesterol, white-blood cell
Another year, another traditional Christmas song or carol turned into a fun technology-related version! This is the sixth year and my ninth song. I hope you enjoy your 2019 holiday song, based on this famous tune. The Data Science and AI Song Computer vision processing on an open stack The
One analysis suggests that in the first year of using process mining, the telecoms company had decreased its cost per process order by almost 40% and improved its productivity.
This is a second article about analyzing longitudinal data, which features measurements that are repeatedly taken on subjects at several points in time. The previous article discusses a response-profile analysis, which uses an ANOVA method to determine differences between the means of an experimental group and a placebo group. The
SAS Viya is a cloud-enabled, in-memory analytics engine which allows for rapid analytics insights. Viya utilizes the SAS Cloud Analytics Services (CAS) to perform various actions and tasks. Best of all, CAS is accessible from various interfaces including R. In this blog, I will go through a few blocks one of my notebooks, which moves through an analytics workflow using R and SAS.
If you’re like me, you struggle to buy gifts. Most folks in my inner circle already have everything they need and most of what they want. Most folks, that is, except the tech-lovers. That’s because there’s always something new on the horizon. There’s always a new gadget or program. Or
Site relaunches with improved content, organization and navigation. In 2016, a cross-divisional SAS team created developer.sas.com. Their mission: Build a bridge between SAS (and our software) and open source developers. The initial effort made available basic information about SAS® Viya® and integration with open source technologies. In June 2018, the
If you don't have a SAS/Graph license, then you're probably using the ODS Graphics 'sg' procedures that come with Base SAS to create your graphs and maps. And if you've tried plotting data on a map, you probably noticed that SGmap lets you overlay point-data on an OpenStreetmap, but you
Longitudinal data are used in many health-related studies in which individuals are measured at multiple points in time to monitor changes in a response variable, such as weight, cholesterol, or blood pressure. There are many excellent articles and books that describe the advantages of a mixed model for analyzing longitudinal