Learn about the latest tips, tutorials, upcoming events and certifications
SAS' Cindy Wang, inspired by a SAS Support Community post, reveals how to perform dynamic calculations for an xmr control chart.
Learn about the latest tips, tutorials, upcoming events and certifications
SAS' Cindy Wang, inspired by a SAS Support Community post, reveals how to perform dynamic calculations for an xmr control chart.
From solutions engineers and students to analysts and data scientists, SAS® users consistently speak about the value of upskilling and lifelong learning. They say upskilling is vital to success – particularly for those in analytics. Why? Because when it comes to analytics, curiosity is king. Digging deeper and trying new
Did you know that you can embed one graph inside another by using PROC SGPLOT in SAS? A typical example is shown to the right. The large graph shows kernel density estimates for the distribution of the Cholesterol variable among male and female patients in a heart study. The small
No creo que ninguno de los lectores de este post llegue a sorprenderse si les contamos que en SAS estamos al 100% comprometidos con el mundo académico. No obstante, hace casi medio siglo que nuestra compañía nació precisamente en las aulas de una universidad, la de Carolina del Norte. Desde
I don't often use the SG annotation facility in SAS for adding annotations to statistical graphics, but when I do, I enjoy the convenience of the SG annotation macros. I can never remember the details of the SG annotation commands, but I know that the SG annotation macros will create
Many SAS procedures support a BY statement that enables you to perform an analysis for each unique value of a BY-group variable. The SAS IML language does not support a BY statement, but you can program a loop that iterates over all BY groups. You can emulate BY-group processing by
Debido a la complejidad y cambios en el mercado, las organizaciones de todo el mundo están aprovechando las oportunidades para hacer mejores predicciones, identificar soluciones y dar pasos estratégicos y proactivos, lo que significa que dependen cada vez más de los big data. Sin embargo, en su búsqueda de resistencia
There are many ways to model a set of raw data by using a continuous probability distribution. It can be challenging, however, to choose the distribution that best models the data. Are the data normal? Lognormal? Is there a theoretical reason to prefer one distribution over another? The SAS has
Does anyone write paper checks anymore? According to researchers at the Federal Reserve Bank of Atlanta (Greene, et al., 2020), the use of paper checks has declined 63% among US consumers since the year 2000. The researchers surveyed more than 3,000 consumers in 2017-2018 and discovered that only 7% of
Some readers read the article “how-to-evaluate-sas-expression-in-data-step-dynamically” and wonder if there is a same mechanism or functionality in DS2. As indicated in that article, SAS provides similar features in DATA step, PROC CAS and PROC Python, but some projects like ESP (Event Stream Processing) projects would store those expression definition in
A previous article shows how to use the MODELAVERAGE statement in PROC GLMSELECT in SAS to perform a basic bootstrap analysis of the regression coefficients and fit statistics. A colleague asked whether PROC GLMSELECT can construct bootstrap confidence intervals for the predicted mean in a regression model, as described in
I've written many articles about bootstrapping in SAS, including several about bootstrapping in regression models. Many of the articles use a very general bootstrap method that can bootstrap almost any statistic that SAS can compute. The method uses PROC SURVEYSELECT to generate B bootstrap samples from the data, uses the
It has been more than a decade since SAS 9.3 changed the default ODS destination from the old LISTING destination to more modern destinations such as HTML. One of the advantages of modern output destinations is support for Unicode symbols, superscripts, subscripts, and for formatting text by using boldface, italics,
In ordinary least squares regression, there is an explicit formula for the confidence limit of the predicted mean. That is, for any observed value of the explanatory variables, you can create a 95% confidence interval (CI) for the predicted response. This formula assumes that the model is correctly specified and
A SAS programmer wanted to use PROC SGPLOT in SAS to visualize a regression model. The programmer wanted to visualize confidence limits for the predicted mean at certain values of the explanatory variable. This article shows two options for adding confidence limits to a scatter plot. You can use a
The acceptance-rejection method (sometimes called rejection sampling) is a method that enables you to generate a random sample from an arbitrary distribution by using only the probability density function (PDF). This is in contrast to the inverse CDF method, which uses the cumulative distribution function (CDF) to generate a random
SoDA를 이용해 쉽게 배우는 데이터 과학 #2 지난 포스팅에서는 SoDA(SAS Ondemand for Academics)의 장점과 필요성을 중심으로 SoDA가 어떤 제품인지 소개해 드렸습니다. 이번 포스팅에서는 ‘서비스 가입 방법’을 알아보겠습니다. 먼저 SoDA가 서비스 되는 브라우저 환경을 알아보고, 이어서 서비스 가입의 각 단계를 살펴보겠습니다. * 편집자 주 : 이번 글은 SAS코리아 컨설팅 본부 한노아
There are dozens of common probability distributions for a continuous univariate random variable. Familiar examples include the normal, exponential, uniform, gamma, and beta distributions. Where did these distributions come from? Well, some mathematician needed a model for a stochastic process and wrote down the equation for the distribution, typically by
Wilcoxon's signed rank test is a popular nonparametric alternative to a paired t test. In a paired t test, you analyze measurements for subjects before and after some treatment or intervention. You analyze the difference in the measurements for each subject, and test whether the mean difference is significantly different
A previous article discusses standardized coefficients in linear regression models and shows how to compute standardized regression coefficients in SAS by using the STB option on the MODEL statement in PROC REG. It also discusses how to interpret a standardized regression coefficient. Recently, a SAS user wanted to know how
A previous article shows an example of a Markov chain model and computes the probability that the system ends up in a terminal state (called an absorbing state). As explained previously, you can often compute exact probabilities for questions about Markov chains. Nevertheless, it can be useful to know how
A previous article shows how to model the probabilities in a discrete-time Markov chain by using a Markov transition matrix. A Markov chain is a discrete-time stochastic process for which the current state of the system determines the probability of the next state. In this process, the probabilities for transitioning
Given a set of N points in k-dimensional space, can you find the location that minimizes the sum of the distances to the points? The location that minimizes the distances is called the geometric median of the points. For univariate data, the "points" are merely a set of numbers $${p_1,
While writing an article about labeling a polygon by using the centroid, I almost made a false claim about the centroid. I almost claimed that that the centroid is the point in a polygon that minimizes the sum of the distances to the vertices. It is not. The point that
A colleague asked how to compute the barycentric coordinates of a point inside a triangle. Given a triangle in the plane with vertices p1, p2, and p3, every point in the triangle can be represented as a convex combination of the vertices: c1*p1 + c2*p2 + c3*p3, where c1,c2,c3 ≥
Part of the power of the SAS ODS system is the ability to visualize data by using ODS templates. An ODS template describes how to render data as a table or as a graph. A lot of papers and documentation have been written about how to define a custom template
While writing an article about Toeplitz matrices, I saw an interesting fact about the eigenvalues of tridiagonal Toeplitz matrices on Nick Higham's blog. Recall that a Toeplitz matrix is a banded matrix that is constant along each diagonal. A tridiagonal Toeplitz matrix is zero except for the main diagonal, the
A Toeplitz matrix is a banded matrix. You can construct it by specifying the parameters that are constant along each diagonal, including sub- and super-diagonals. For a square N x N matrix, there is one main diagonal, N-1 sub-diagonals, and N-1 super-diagonals, for a total of 2N-1 parameters. In statistics and applied
SoDA를 이용해 쉽게 배우는 데이터 과학 #1 SoDA(SAS OnDemand for Academics)는 SAS가 무료로 제공하는 교육용 데이터 분석 소프트웨어 프로그램입니다. 앞으로 4회에 걸쳐 'SoDA 를 이용해 쉽게 배우는 데이터 과학'을 자세히 소개해 드리도록 하겠습니다. SoDA란? SoDA (SAS OnDemand for Academics; SoDA)는 데이터 과학을 처음 배우는 입문자들에게 SAS를 무료로 배울 수 있도록
A previous article explains the Spearman rank correlation, which is a robust cousin to the more familiar Pearson correlation. I've also discussed why you might want to use rank correlation, and how to interpret the strength of a rank correlation. This article gives a short example that helps you to