Im diesem Gastbeitrag von Cloudera geht es darum, wie Big Data effizient gemananged werden können. Cloudera präsentiert sich auf dem diesjährigen SAS Forum in Bonn (28. April) mit einem eigenen Stand. Lassen wir ab jetzt Cloudera sprechen: Analytik ist wertschöpfend für Unternehmen. SAS Anwender wissen das besser als alle anderen.
Uncategorized
Many data quality issues are a result of the distance separating data from the real-world object or entity it attempts to describe. This is the case with master data, which describes parties, products, locations and assets. Customer (one of the roles within party) master data quality issues are rife with examples, especially
We continue to ready ourselves for this year’s SAS Global Forum coming up next month. On Monday, April 18, there will be two SAS Certification testing events. Then after the conference wraps up, SAS training instructors will be offering unique, hands-on training that you won’t find anywhere else. In this
If a picture is worth a thousand words, then visualizing data in Hadoop would be like a billion. Over the last few years, organizations have rushed to leverage the low-cost distributed computing and storage power of Hadoop clusters. As Hadoop environments mature and move away from their initial focus of
You’ve likely heard the news that the Google DeepMind “AlphaGo” computer not only beat a human expert at the game of Go, defeating the European Go champion, Fan Hui in five straight games, but also beat the reigning world champion grandmaster, South Korea’s Lee Sedol, 4 games to 1. Go
My previous blog post shows how to use PROC LOGISTIC and spline effects to predict the probability that an NBA player scores from various locations on a court. The LOGISTIC procedure fits parametric models, which means that the procedure estimates parameters for every explanatory effect in the model. Spline bases
Modeling risk to meet regulatory requirements is costly and complex. Because of that, some have suggested that financial services institutions (FSIs) move toward a set of standardized models. The argument is that central banks and regulatory authorities could then more easily monitor systemic risk and compare apples to apples. But
Even the most casual observers of the IT space over the last few years are bound to have heard about Hadoop and the advantages it brings. Consider its ability to store data in virtually any format and process it in parallel. Hadoop distributors, such as Hortonworks, can also provide enterprise-level
“We could send a juvenile justice youth to Harvard for what we pay for incarceration, and we don't get very good outcomes.” That was said by Gladys Carrion when she was Director of the New York State Office of Children and Family Services. (She’s now Commissioner of NYC Administration for
According to Lloyd Dean, president and CEO, "At Dignity Health, we are committed to developing partnerships and opportunities that harness the tremendous potential of technology, from improving the patient experience to providing caregivers with tools that will support their day-to-day care decisions." Dignity Health, one of the largest health systems
I recently bought a vehicle that has FlexFuel capability and can use E85 (mostly ethanol) fuel. But can you guess whether it is more economical for me to use E85, or regular gasoline? Read the SAS analysis below to see if you guessed right! I've been the happy owner of
Did you know one of the attendees' favorite events at SAS Global Forum is to meet our bestselling authors? This year at SAS Global Forum 2016 we are planning a "Top Tips from Your Favorite SAS Press Authors" lunch where we will ask 3 or 4 authors to present a top tip
@philsimon on what we can learn about data quality from Jeff Bezos's behemoth.
Anknüpfend an meinen Einstieg in die Big-Data-Welt und nach meiner Reise in die Vergangenheit mit „In-Memory“ hat mich die Neugier gepackt. Was hat es mit anderen Technologien auf sich, die gerade dabei sind, unsere Welt zu revolutionieren? Blicken wir zunächst einmal auf „Event Stream Processing“ (ESP). Ein Thema, das gerade
Last week Robert Allison showed how to download NBA data into SAS and create graphs such as the location where Stephen Curry took shots in the 2015-16 season to date. The graph at left shows the kind of graphs that Robert created. I've reversed the colors from Robert's version, so
The new book Business Forecasting: Practical Problems and Solutions contains a large section of recent articles on forecasting performance evaluation and reporting. Among the contributing authors is Rob Hyndman, Professor of Statistics at Monash University in Australia. To anyone needing an introduction, Hyndman's credentials include: Editor-in-chief of International Journal of
As the big data era continues to evolve, Hadoop remains the workhorse for distributed computing environments. MapReduce has been the dominant workload in Hadoop, but Spark -- due to its superior in-memory performance -- is seeing rapid acceptance and growing adoption. As the Hadoop ecosystem matures, users need the flexibility to use either traditional MapReduce
Im diesem Gastbeitrag von Accantec geht es um den Datenschutz im Big Data Umfeld. Accantec präsentiert sich auf dem diesjährigen SAS Forum in Bonn (28. April) mit einem eigenen Stand. Lassen wir ab jetzt Gero Hentschel von Accantec sprechen. Big Data ist längst keine Modeerscheinung mehr, sondern in vielen Unternehmen mittlerweile
Many of us here at SAS are hustling to prepare for SAS Global Forum set to begin April 18 in Las Vegas. This year’s agenda is shaping up to be a “can’t-miss” event. Monday, before SAS Global Forum begins, there are two SAS Certification testing events. Then after the conference
In an upcoming paper for SAS Global Forum, several of us from the SAS Text Analytics team explore shifting the context of our underlying representation from documents to the sentences that are within the documents. We then look at how this shift can allow us to answer new text mining
At a recent TDWI conference, I was strolling the exhibition floor when I noticed an interesting phenomenon. A surprising percentage of the exhibiting vendors fell into one of two product categories. One group was selling cloud-based or hosted data warehousing and/or analytics services. The other group was selling data integration products. Of
There are several ways to simulate multinomial data in SAS. In the SAS/IML matrix language, you can use the RANDMULTINOMIAL function to generate samples from the multinomial distribution. If you don't have a SAS/IML license, I have previously written about how to use the SAS DATA step or PROC SURVEYSELECT
Nothing works today without an efficient data management – also in insurance business. A standard data model can be an important component of it. This article explains why. “Make or Buy”? This question has been raised very often by insurance companies planning to introduce a consistent data structure – a
Streaming analytics is a red hot topic in many industries. As the Internet of Things continues to grow, the ability to process and analyze data from new sources like sensors, mobile phones, and web clickstreams will set you apart from your competition. Event stream processing is a popular way to
In my SAS Press book Business Statistics Made Easy in SAS® I place a strong focus on the skill of extrapolating analytics/statistical outcomes to key business implications (similar techniques can be used to link statistics to other key societal outcomes). Unfortunately, business analytics often stops short of defining the impact
Math lovers, do you know what day it is? It's Pi Day, which we celebrate every year on March 14 because the date 3-14 matches the first three digits of pi, 3.14. This year, I'm celebrating with poetry, combining my love of math with my love of language. Word Spy explains that a pi-ku is
People have always been fascinated by sports statistics, and with the recent popularity of fantasy sports there is an increased demand for custom analyses of the sports data. With those folks in mind, I have created a simple example that SAS programmers can use as a starting point for analyzing NBA
In the past, we've always protected our data to create an integrated environment for reporting and analytics. And we tried to protect people from themselves when using and accessing data, which sometimes could have been considered a bottleneck in the process. We instituted guidelines and procedures around: Certification of the data
Today is March 14th, which is annually celebrated as Pi Day. Today's date, written as 3/14/16, represents the best five-digit approximation of pi. On Pi Day, many people blog about how to approximate pi. This article uses a Monte Carlo simulation to estimate pi, in spite of the fact that
I was answering questions about SAS in a forum the other day, and it struck me how much easier it is to help folks if they can provide a snippet of data to go along with their program when asking others to help troubleshoot. This makes it easy to run