All Posts

"Code golf" is a fun programming pastime that challenges you to solve a problem with the least amount of code possible. Like regular golf, the goal is to use fewest code "strokes" to hit the mark. Here's a recent challenge that was posted to me via Twitter. @cjdinger @SASJedi got

Joyce Norris-Montanari shares tips to help those who are still working to address GDPR requirements.

Quando, nos filmes vemos Tony Stark a combater o crime com a ajuda não só do Iron Man, mas também do Jarvis, um sistema de inteligência Artificial (AI) que gere basicamente toda a sua vida, imaginamos o tipo de ajuda que as autoridades poderiam ter no combate ao crime ou

Which character variables have the highest frequency count? You can easily determine this using a variety of procedures that calculate frequency count. For example, the FREQ Procedure or the MEANS Procedure. This blog post illustrates this process through two examples.

La Cuarta Revolución Industrial, el Internet de las Cosas y la Inteligencia Artificial son conceptos que hacen parte de la transformación digital, y tienen el poder de cambiar los modelos de negocios de una organización. Para que esta era ayude a que las empresas sigan el ritmo de las demandas

See how to sample unstructured (text) data using SAS Viya and CAS actions. This post includes complete code to cluster the text documents via k-means, and treats the cluster memberships as strata for analysis.

This article shows how to implement balanced bootstrap sampling in SAS. The basic bootstrap samples with replacement from the original data (N observations) to obtain B new samples. This is called "uniform" resampling because each observation has a uniform probability of 1/N of being selected at each step of the

텍스트 분석과 자연어처리(NLP; Natural Language Processing)는 소비자의 감성을 분석하는 방법으로 널리 알려져 있습니다. 실제 많은 기업이 비정형 데이터 분석 기술을 이용해 소셜 미디어(SNS) 상의 고객 불만이나 댓글을 분석하고 적절히 대응하고 있는데요. 오늘날 전체 데이터의 90% 이상은 텍스트, 음성, 이미지, 영상 등 구조화되어 있지 않은 비정형 데이터입니다. 수동 분석만으로 이 모든

SAS batch jobs can generate many log files that accumulate over time. In this post, we present a SAS program that cleans up old log files on your system.

The Base SAS DATA step has been a powerful tool for many years for SAS programmers. But as data sets grow and programmers work with massively parallel processing (MPP) computing environments such as Teradata, Hadoop or the SAS High-Performance Analytics grid, the data step remains stubbornly single-threaded. Welcome DS2 –

Like hiking without GPS directions, Jim Harris says businesses that don't use streaming data analytics will be lost in a crowd of competitors.

What’s the key to progressing your career? If you ask Manish Desai, a Sr. Director of Consulting, SAS Asia Pacific, based in India, it’s conquering the fear of failure and embracing opportunity – with a great mentor by your side. I sat down with Manish to hear more about his

Der neue International Financial Reporting Standard (IFRS 17) stellt Versicherer vor Veränderungen, wie sie die Branche selten zuvor erlebt hat. Die Vorgaben sollen die Transparenz im Finanzberichtswesen verbessern und außerdem eine Grundlage für die Vergleichbarkeit mit anderen Branchen schaffen. Analysten und andere Branchenspezialisten haben diese Vergleichbarkeit schon seit langer Zeit

My colleague Robert Allison recently blogged about using the diameter of Texas as a unit of measurement. The largest distance across Texas is about 801 miles, so Robert wanted to find the set of all points such that the distance from the point to Texas is less than or equal

You can use PROC SGPLOT, BY variables, and a SG annotation data set together to put separate annotations into each BY group. However, you need two more steps to make it happen. This post shows all of the steps necessary to put different annotations into each graph when you have a BY variable.