A common question on SAS discussion forums is how to repeat an analysis multiple times. Most programmers know that the most efficient way to analyze one model across many subsets of the data (perhaps each country or each state) is to sort the data and use a BY statement to
Uncategorized
Are you struggling to hire talented data scientists to glean insights from your corporate data? There’s currently a lack of big data talent hampering corporate analytics and causing nightmares for CIO’s, but I have good news for you: You may already have all the data scientists you need! There are
People come from all over the world to attend this highlight of the season. It’s been a tradition for decades. Hotels book months in advance. Traffic is horrendous in the city center. The coveted tickets can cost thousands of dollars, but tens of thousands of people are lucky enough to score them. In
In 2014, the federal government lost more than $125 billion to fraud, waste and abuse. And that’s just what we know about. While that number may sound incredible, those on the front lines of the government fraud fight know that it's all too real. The US government needs to change
Good Judgment® Open Ever wondered how good you are at forecasting? As a business forecaster, you can do the usual comparison against a naive model (and hopefully you are beating it!). You might also compare your forecast accuracy to published industry benchmarks -- although I would strongly recommend against this.
A topic that's been in the news a lot lately is the presidential power to grant pardons, commutations, and such. But all the articles I've seen just quoted numeric totals - I haven't seen a graph of the data anywhere! So I set out to find the data and graph
How important is reading to the skills gap? It's crucial. Through third grade, children are learning to read. After that, they read to learn. That is why reading proficiently by the end of third grade is one of the most reliable predictors of future success for children. Students who develop
.@philsimon chimes in with trust- and privacy-related recommendations
On discussion forums, I often see questions that ask how to Winsorize variables in SAS. For example, here are some typical questions from the SAS Support Community: I want an efficient way of replacing (upper) extreme values with (95th) percentile. I have a data set with around 600 variables and
Colors are the subject of many romantic poems and songs, but there isn't much romance to be found in their hexadecimal values. With apologies to Van Morrison: ...Skipping and a jumping In the misty morning fog with Our hearts a thumpin' and you My cx662F14 eyed girl When it comes
Doing business in a global economy, have you ever found yourself wanting to show Chinese (or Korean, or Japanese) labels on a map? If so, then this blog is for you! Before we get started, here is a photo of some Chinese characters to get you into the mood. This
On any inauguration day in our country’s history people probably found themselves in one of three categories: happy & hopeful, disappointed & apprehensive, or apathetic & checked-out. Change is difficult, whether you perceive it as positive or negative. This blog is not to share which category I fall into but
The financial sector has always been subjected to regulatory compliance laws and directives. Consumers, lawmakers and politicians would expect no less. But it's fair to say that the financial sector has witnessed a "hockey stick" trend regarding new regulations in recent years.
Suppose you create a scatter plot in SAS with PROC SGPLOT. What color does PROC SGPLOT use for the markers? If you specify the GROUP= option so that markers are colored by a grouping variable, what colors are used to represent the various groups? The following scatter plot shows the
Having addressed the adaptability and power of an analytics environment in my last two posts, I thought I'd close out this mini-series of blogs by providing the business and technology implications of three attributes that need to define any truly open and unified analytics environment: Cohesion Business: The platform enables
It was just a few years ago that the idea of an Internet of Things (IoT) seemed far off, something out of a science-fiction movie. After all, why would a vehicle need to talk to the road? Why would our utility meters need to talk to the central office? The
„Die IT liefert nicht, der Fachbereich weiß nicht, was er heute oder morgen an Daten haben will“… Beide haben recht, ein Dilemma, das darin endet, dass Selbsthilfe betrieben wird. Der Informationshunger besteht weiterhin, und was nicht geliefert wird, besorgt man sich auf anderem Wege. Da wären: die SAP-Maske, Excel, Datenbank(en),
To make it easy to identify non-value adding areas, you can build a simple application using SAS® Visual Analytics software. Such an application lets you point and click your way through the organization’s forecasting hierarchy, and at each point view performance of the Naïve, Manual, Statistical, and Automated forecasts (or
.@philsimon says that, once again, there's quite a bit to learn from Amazon.
2017 stehen die Themen Big Data und Analytics immer noch ganz oben auf der Agenda. Doch Gott sei Dank ist die Diskussion nun einen Schritt weiter und dreht sich um die geschäftlichen Auswirkungen der Technologie. Wie kann der Einsatz von Analytics-as-a-Service den Umsatz erhöhen, Kosten reduzieren oder einen Wettbewerbsvorteil sichern?
In a previous article, I showed how to simulate data for a linear regression model with an arbitrary number of continuous explanatory variables. To keep the discussion simple, I simulated a single sample with N observations and p variables. However, to use Monte Carlo methods to approximate the sampling distribution
To properly evaluate (and improve) forecasting performance, we recommend our customers use a methodology called Forecast Value Added (FVA) analysis. FVA lets you identify forecasting process waste (activities that are failing to improve the forecast, or are even making it worse). The objective is to help the organization generate forecasts
I usually create very technical maps, to display data spatially - and they usually have a certain look. They're clear, crisp, and to the point. I typically only use color to represent the data, and I choose a font that is simple and easy to read (such as arial). But
Did you know that January is Human Trafficking Awareness Month in the United States? If not, you may not know that researchers estimate that human trafficking is a global industry with revenues from $51 to $99 billion annually, up from $32 billion just a few years ago. In this case,
You have all seen, or perhaps even created, some really bad graphics: Cluttered, confusing, too small, incomprehensible. Or worse, the author may have committed one of the three unforgivable sins of data visualization by deceptively distorting a map, truncating the axis so as to misrepresent the data, or used double
Unendliche Weiten … nein, heute meine ich damit mal nicht das gute alte Raumschiff Enterprise (übrigens 50-jähriges Jubiläum), sondern die Möglichkeiten, die sich einem auftun, wenn man bereit ist, sich auf etwas Neues einzulassen. Eigentlich ist das ziemlich einfach: Man schaut sich bei Leuten um, die aus einer vollkommen anderen
If you are a SAS programmer and use the GROUP= option in PROC SGPLOT, you might have encountered a thorny issue: if you use a WHERE clause to omit certain observations, then the marker colors for groups might change from one plot to another. This happens because the marker colors
I’ve had several meetings lately on data management, and especially integration, where the ability to explore alternatives has been critical. And the findings from our internet of things (IoT) early adopters survey confirms that the ecosystem nature of data sources in IoT deployments means we need to expand the traditional
Am 28. Januar war Europäischer Datenschutztag und ab sofort gilt dann verschärftes EU-Recht – so kommt es einem zumindest vor bei Gesprächen mit Datenschutz-Experten, bei ihrem Streben, die neue EU-Datenschutz-Grundverordnung (DSGVO) zu bewältigen. Was ist schutzwürdig? Alles bekannt Personenbezogene, sowieso. Mehr aber noch das Unbekannte.
Preview of the Winter 2017 issue of Foresight Foresight begins the new year with our 44th issue since the journal began publishing in 2005, and in this Winter 2017 collection we’re showcasing a broad range of incisive and entertaining pieces. We’re looking at new research on the effectiveness of collaboration