Get the right information, with visual impact, to the people who need it

SAS's Kevin Scott explains how to set up and analyze a dataset for forecasting in SAS Viya with a particular emphasis on selecting lags for dynamic models
Get the right information, with visual impact, to the people who need it
SAS's Kevin Scott explains how to set up and analyze a dataset for forecasting in SAS Viya with a particular emphasis on selecting lags for dynamic models
After giving a talk about how to create effective statistical graphics in SAS, I was asked a question: "When do you suggest using the graph template language (GTL) to build graphs?" I replied that I turn to the GTL when I cannot create the graph I want by using PROC
A common task in statistics is to model data by using a parametric probability distribution, such as the normal, lognormal, beta, or gamma distributions. There are many ways to assess how well the model fits the data, including graphical methods such as a Q-Q plot and formal statistical tests such
En el vertiginoso mundo de la analítica, la teoría es fundamental, pero la experiencia práctica es la que realmente marca la diferencia. En SAS, lo sabemos bien. Por eso, hemos forjado una alianza estratégica con la Universidad Carlos III de Madrid para impulsar el talento emergente, brindándoles la oportunidad de
A colleague asked me an interesting question: Suppose you have a structured correlation matrix, such as a matrix that has a compound symmetric, banded, or an AR1(ρ) structure. If you generate a random correlation matrix that has the same eigenvalues as the structured matrix, does the random matrix have the
In a previous article, I presented some of the most popular blog posts from The DO Loop in 2024. In general, popular articles deal with elementary topics that have broad appeal. However, I also write technical articles about advanced topics, which typically do not make it onto a Top 10
SAS' Danny Sprukulis takes you step-by-step through geocoding incomplete geographic data sources for proper visualization and through its additional capabilities.
In 2024, I wrote about 80 articles for The DO Loop blog. My most popular articles were about SAS programming, data visualization, and statistics. If you missed any of these articles, here is the "Reader's Choice Awards" for some of the most popular articles from 2024! SAS Programming The following
O Christmas tree, O Christmas tree, How lovely are your branches! SAS programmers have a long history of creating yuletide-themed graphics. Christmas trees are a popular image because of their simplicity. I admit that I have indulged more than once in this holiday tradition: An old-school ASCII art image A
En la actualidad, los modelos analíticos son herramientas esenciales para tomar decisiones basadas en datos. Desde prever tendencias hasta optimizar operaciones, los modelos analíticos dependen en gran medida de la calidad de los datos de entrada. La precisión, integridad y relevancia de estos datos son cruciales para obtener resultados confiables
The article "Order two-dimensional vectors by using angles" shows how to re-order a set of 2-D vectors by their angles. Because angles are on a circle, which has no beginning and no end, you must specify which vector will appear first in the list. The previous article finds the largest
In SAS Viya, users can customize a project management environment by using a file that contains metadata about the organization’s project progress. This process allows the management team to track and interact with the project’s ongoing steps.
In SAS, range attribute maps enable you to specify the range of values that determine the colors used for graphical elements. There are various examples that use the GTL to define a range attribute map, but fewer examples that show how to use a range attribute map with PROC SGPLOT.
As part of this year's IEEE Visual Analytics Science and Technology (VAST) Challenge, a group of SAS data scientists puit SAS Viya and related machine learning tools to the ultimate test - to identify individuals in a complex fishing network. Excitedly, the team received the Honorable Mention Award for Breadth of Investigation!
Learn how an intern integrated SAS Viya® and open-source code (Python) into a Machine Learning project to combine their strengths within the context of predictive modeling, and to show off the variety of ways this integration can be accomplished.
SAS Visual Text Analytics can easily analyze similar words and phrases coming from various cultural heritage-related documents to construct a heritage wordbook that cultural workers can use to identify what relevant conservation technique to use on a structure/artifact.
This article discusses how to scale a probability density curve so that it fits appropriately on a histogram, as shown in the graph to the right. By definition, a probability density curve is scaled so that the area under the curve equals 1. However, a histogram might show counts or
A previous article discusses a formula for a confidence interval for R-square in a linear regression model (Olkin and Finn (1995) "Correlations redux", Psychological Bulletin) The formula is useful for large data sets, but should be used with caution for small samples. At the end of the previous article, I
A SAS analyst ran a linear regression model and obtained an R-square statistic for the fit. However, he wanted a confidence interval, so he posted a question to a discussion forum asking how to obtain a confidence interval for the R-square parameter. Someone suggested a formula from a textbook (Cohen,
Detecting illicit financial flows require much more than using traditional business methods. At this point, using centrality metrics in investigation and analytical models will provide wider detection approaches.
A SAS analyst read my previous article about visualizing the predicted values for a regression model that uses spline effects. Because the original explanatory variable does not appear in the model, the analyst had several questions: How do you score the model on new data? The previous example has only
Batch manufacturing involves producing goods in batches rather than in a continuous stream. This approach is common in industries such as pharmaceuticals, chemicals, and materials processing, where precise control over the production process is essential to ensure product quality and consistency. One critical aspect of batch manufacturing is the need to manage and understand inherent time delays that occur at various stages of the process.
A SAS programmer wanted to visualize density estimate for some univariate data. The data had several groups, so he wanted to create a panel of density estimate, which you can easily do by using PROC SGPANEL in SAS. However, the programmer's boss wanted to see filled density estimates, such as
En los últimos años, la ciencia de datos ha experimentado un crecimiento exponencial y se ha convertido en un pilar fundamental para las estrategias de las organizaciones en todas las industrias. Sin embargo, para los data scientist experimentados, el panorama del dato se encuentra en un proceso de cambio constante.
Years ago, I wrote an article that showed how to visualize patterns of missing data. During a recent data visualization talk, I discussed the program, which used a small number of SAS IML statements. An audience member asked whether it is possible to construct the same visualization by using only
Recently, I saw a scatter plot that displayed the ticks, values, and labels for a vertical axis on the right side of a graph. In the SGPLOT procedure in SAS, you can use the Y2AXIS option to move an axis on the right side of a graph. Similarly, you can
I sometimes see analysts overuse colors in statistical graphics. My rule of thumb is that you do not need to use color to represent a variable that is already represented in a graph. For example, it is redundant to use a continuous color ramp to represent the lengths of bars
This phenomenon has been in the news recently, so I've updated this article that I originally published in 2017. The paper currency in circulation in the US is mostly $100 bills. And not just by a little bit -- these account for 34% of the notes by denomination and nearly
Have you ever been curious about your monthly water consumption and how it compares to others in your community? Recently, I had this question and decided to get ahold of my family's water usage data for analysis. Harnessing the power of data visualization, I compared my family of four's monthly
In 2023, I wrote 90 articles for The DO Loop blog. My most popular articles were about SAS programming, data visualization, and statistics. In addition, several "general interest" articles were popular, including my article for Pi Day and an article about AI chatbots. If you missed any of these articles,