When SAS is used for analysis on large volumes of data (in the gigabytes), SAS reads and writes the data using large block sequential IO. To gain the optimal performance from the hardware when doing these IOs, we strongly suggest that you review the information below to ensure that the
English
Since my opening blog post about our new partnership with British Rowing and the GB Rowing Team, I’ve had a deep dive into their data, and we’re now close to our first key milestone – having all the data about a rower in a single location, so we can then
Artificial sweeteners have been a controversial topic since they came into the market decades ago. There has been an ongoing debate over the potential health benefits and risks of artificial sweeteners, which are among the most common food additives and are consumed by hundreds of millions of people around the
SAS/IML 13.1 includes a handy function for programmers who write a lot of modules. The PARENTNAME function obtains the name of the symbol that was passed in as a parameter to a user-defined module. How is this useful? Well, suppose that you want to create a SAS/IML module that prints
SAS has been developing "secret sauce" technology for more than 38 years. Whether it has to do with being platform independent, processing in-database, running across a grid, or analyzing data in-memory like our SAS LASR Analytic Server or our High Performance Analytics offerings, secret sauce makes everything taste or, in
Do you order things online, to avoid paying sales tax? Those "good old days" might be coming to an end soon... Here's a snapshot of my latest purchase from Amazon.com (a little something for the Talk Like a Pirate party I had on Sept 19): In the US, each of
At most banks, data is stored in separate databases and data warehouses. Customer data is stored in marketing databases, fraud analyses are done on transactional data, and risk data is stored in risk data warehouses. Oftentimes even liquidity, credit, market, and operational risk data is stored separately as well. Bringing
Part 1: The challenge and the opportunity Mental illness continues to profoundly affect the nation’s population and, for the most part, remains greatly under analyzed. This is the first entry in a series about the mental health problem in the US, and how an analytic approach can improve care for
Next week Breffni Noone, Associate Professor, The Pennsylvania State University and I will be hosting a discussion at the Cornell Hospitality Research Summit on how user generated content impacts the purchase choices of the unmanaged business traveler. The discussion will take place on Monday October 13 at 2:15pm in Statler 396.
In the first installment of this series on Hadoop, I shared a little of Hadoop's genesis, framing it within four phases of connectivity that we are moving through. I also stated my belief that Hadoop has already arrived in the mainstream, and we are currently moving from phases three of connecting people to phase four
In response to my recent post about how to use the PALETTE function in SAS/IML to generate color ramps, a reader wrote the following: The PALETTE function returns an array of hexadecimal values such as CXF03B20. For those of us who think about colors as RGB values, is there an
One of the hottest trends today in the business intelligence and analytics spaces is “self-service.” The word self-service is thrown around lightly in many situations and often carries different expectations for different people and organizations. Before we go into the details of self-service analytics it is important to have a
Report design includes several phases. Granted, these phases aren’t official: they’re more a reflection of my own thought processes and how my report designs typically unfold: the initial “get the data on the screen to see what we have” phase the addition of filters and prompts to assist with guided
Is the type of car you drive more likely, or less likely, to get a speeding ticket? Let's analyze some data to find out! Do red cars attract more attention from the police, and get more tickets? How about cars with a 'racing stripe'? Or cars with a big chromed motor,
Double, double toil and trouble; Fire burn, and caldron bubble. Macbeth, Act IV, Scene I For the cyptanalyst or recreational puzzle solver, "double double" does not lead to toil or trouble. Just the opposite: The occurrence of a double-letter bigram in an enciphered word puzzle is quite fortunate. Certain double
Have you heard the old saying that "Banks only loan money to people who don't need it"? Let's analyze the data and see if that is true!... I'm very much a car-guy, and I love learning about all the new vehicles, and love the new-car feel ... and even the smell. It's hard to not like a
So, you've heard the Hadoop hype and you are looking – or have already invested – into Hadoop. Maybe you have also realized some benefits from the Hadoop ecosystem. But now you want to maximize those benefits by using advanced analytics, or you might have heard about algorithms or machine learning libraries available
Shades of Pink - Honoring Breast Cancer Awareness Month was written by Celeste Cooper-Peel. Some things you never forget. I remember my mom’s breast cancer diagnosis like it was yesterday. She was only 42. Who would have thought? My mom found the lump herself and although it was cancer, she was
So, with the simple introduction in Understanding Hadoop security, configuring Kerberos with Hadoop alone looks relatively straightforward. Your Hadoop environment sits in isolation within a separate, independent Kerberos realm with its own Kerberos Key Distribution Center. End users can happily type commands as they log into a machine hosting the
Have you ever looked as a statistical graph that uses bright garish colors and thought, "Why in the world did that guy choose those awful colors?" Don't be "that guy"! Your choice of colors for a graph can make a huge difference in how well your visualization is perceived by
Would you build a house without a proper foundation? Most of us wouldn’t dare, but that’s exactly what many retail businesses are doing today. When building a house, if you don’t get the foundation right, paint, wallpaper and fixtures won’t matter much. It’s no different in the retail industry. Success
Last year, after 15 years of benefiting from the SAS community, I thought it was time to give a little something back. So I decided to write a paper on two technologies I have a healthy interest in: SAS and Hadoop. My paper SASReduce: an implementation of MapReduce using BASE/SAS
In a previous article I introduced the HEATMAPCONT subroutine in SAS/IML 13.1, which makes it easy to visualize matrices by using heat maps with continuous color ramps. This article introduces a companion subroutine. The HEATMAPDISC subroutine, which also requires SAS/IML 13.1, is designed to visualize matrices that have a small
A HighLow plot is very popular in the financial industry, often used to track the periodic movement of a stock or some instrument or commodity. The CandleStick Chart is one specific type of high low plot, purportedly originating in Japan for tracking of financial instruments in the rice trade. Creating a
If you live in an English speaking country you are used to a relatively unadorned alphabet. Take a look at the French and Spanish languages, where vowels are decorated with accents like “acción” in Spanish, and the circumflex, or the hat used in “pâte” in French. Look at the gorgeous
People often talk about the customer experience and the engagement model. This is an easier task when a business has regular interactions with its customers like banks and retailers. However, for insurers, this is a challenge. First of all, insurers have infrequent interactions with their customers. When there is interaction,
It’s a great time to be a sports fan – and an even better time to be a sports and data fan as these two worlds continue to meld together. For the last couple of years nearly every conversation about sports, analytics or both had to have at least one
A student brought in this coding problem after her manager was struggling with this issue for a while. They played guessing games, but to no avail. Here’s what happened when they submitted data step and proc sql code using a WHERE clause with an INPUT function? data aileen; length hcn
In last week's article about the distribution of letters in an English corpus, I presented research results by Peter Norvig who used Google's digitized library and tabulated the frequency of each letter. Norvig also tabulated the frequency of bigrams, which are pairs of letters that appear consecutively within a word.
This guest post was written by Zubin Dowlaty. He has 20+ years’ experience in the business intelligence and analytics space. At Mu Sigma, he works closely with Fortune 500 companies counseling them on how to institutionalize data-driven decision-making. Zubin is focusing his efforts managing an agenda of rapidly implementing innovative analytics