![](https://blogs.sas.com/content/sastraining/files/2017/12/PatternSearch.png)
Finding a pattern like a phone number or national ID number embedded in text can be difficult and time consuming.
Finding a pattern like a phone number or national ID number embedded in text can be difficult and time consuming.
I recently read an interesting article about petroleum coke (petcoke). A lot of it is produced in the US, and lately a lot of it is consumed (burned) in India ... contributing to air pollution there. The article mentioned some numbers in the text, but the data was really begging to
A steady drumbeat of news coverage makes one thing clear: Opioid abuse is rising and has reached epidemic levels throughout our country. Overdoses from the diversion and abuse of prescription opioids are one cause of the surge in deaths. Overdoses from heroin and other illicit synthetic opioids (such as heroin,
The internet is rich with data, and much of that data seems to exist only on web pages, which -- for some crazy reason -- are designed for humans to read. When students/researchers want to apply data science techniques to analyze collect and analyze that data, they often turn to
데이터 매니지먼트가 중요한 이유 우리는 지금 데이터가 사회와 경제를 움직이는 ‘데이터 이코노미’ 시대에서 살고 있다. 시장조사업체 IDC는 전 세계 데이터 양은 매년 약 30% 증가해 2025년에는 현재보다 10배 늘어난 163제타바이트(ZB)에 이를 것으로 전망했다. 이처럼 폭증하는 빅데이터를 가트너(Gartner)에서는 ‘21세기 원유’로 규정하기까지 했다. 하지만 이제는 빅데이터를 단순한 ‘콘텐츠’가 아닌 ‘프로세스’와 ‘인프라’ 관점에서
Imputing missing data is the act of replacing missing data by nonmissing values. Mean imputation replaces missing data in a numerical variable by the mean value of the nonmissing values. This article shows how to perform mean imputation in SAS. It also presents three statistical drawbacks of mean imputation. How
100년만의 최악의 허리케인, 푸에르토리코를 덮치다 지난 9월 20일, 북대서양과 카리브해 사이에 있는 미국 자치령 푸에르토리코(Puerto Rico)에 초강력 허리케인 마리아(Maria)가 상륙했습니다. 마리아는 시속 185마일(295㎞) 이상의 최고 단계인 5등급 허리케인으로 100년만의 최악의 피해를 남겼습니다. 더욱이 일명 괴물 허리케인이라고 불린 5등급 허리케인 어마(Irma)에 이어 단 2주만에 불어 닥친 재해로 340만 주민들은 엄청난 충격에
Until recently state-of-the-art for trade area analytics still meant analyzing historical store sales by location, together with some Nielsen market data to select merchandise assortments and allocation. Contrast that with the upcoming holiday season where retailers know where and how demand is initiated, and use that new understanding to create
Since Trump became the US president, many people have noticed that he posts a lot of tweets. While some people choose to analyze and critique the content of those tweets, I was more curious about something a little less controversial - the timing and frequency. Follow along as I dig into
최근 국내 의료진이 정상적인 인지 기능을 가진 노인이 알츠하이머 치매에 걸릴 가능성을 예측할 수 있는 새로운 분석 지표를 개발해 큰 주목을 받았습니다. 세계적인 신경 과학 학술지 ‘사이언티픽 리포트(Scientific Reports)’에 소개되며 치매 발병에 대한 예방적 조치를 할 수 있을 것으로 기대를 모으고 있는데요. 많은 과학자들이 오늘날 초고령화 사회에서 가장 두려운 질병 중
David Loshin explains what it means to be a data-driven business by describing three different models.
There are so many reasons why SAS programmers love SAS -- as a matter of fact, I wrote a blog on it back in 2012. I now realize that I could've written a whole series, not just a single post. And with the recent publishing of my first book, Big Data
In this education analytics series of blog posts, we have been on a journey to learn how education customers are turning their data into insights to be a more data-informed and analytical organizations. In my first five posts in the education analytics blog series, we learned how education customers are using SAS,
How do you define artificial intelligence? Would you define it differently if it was your job to prevent fraud and financial crimes, where the risks are constantly shifting? In a recent meeting with banking executives responsible for fraud and financial crimes risk mitigation, Wayne Thompson, Manager of Data Science Technologies
Whether or not to legalize marijuana is a hotly debated topic these days. And no matter which side of the debate you're on, I think you will be interested in seeing several ways to visualize which states have legalized marijuana, and when ... Their Version Here's the original graph that
November is National Diabetes Awareness Month. Did you know that according to the American Diabetes Association an estimated 30 million or 9.4 percent of Americans has diabetes? Over ten years ago two of my family members were diagnosed. Hearing this news was both scary and overwhelming for the entire family.
Missing values present challenges for the statistical analyst and data scientist. Many modeling techniques (such as regression) exclude observations that contain missing values, which can reduce the sample size and reduce the power of a statistical analysis. Before you try to deal with missing values in an analysis (for example,
SAS Viya is an exciting addition to the SAS Platform, allowing you to conduct analysis faster than ever before and providing you the flexibility to utilize open source technologies and generate insights from data in any computing environment. The blog post “Top 12 Advantages of SAS Viya” does a great
If you’re like me, you probably feel like there's more bad news than good in the world today. And, it makes me that much more grateful when I hear some good news. That’s part of why I love Giving Tuesday so much – it’s a day where my social media
You never know where you’ll see great teaching in action. That was made abundantly clear to me when my family ventured to rural Lillington, North Carolina to learn about falconry, civilization’s oldest form of hunting. We are not hunters ourselves, but my husband is fascinated by birds of prey and
The most highly anticipated business announcement this fall is probably the location for Amazon's second headquarters (dubbed HQ2). Amazon plans to spend $5 billion on their HQ2, and employ about 50,000 people in high-tech jobs. They received 238 proposals before their October 19 deadline, but haven't announced a winner yet.
SAS, la empresa líder en soluciones de analítica y Big Data en el mundo, predice los escenarios en los que se moverán las empresas y los negocios el próximo año. La necesidad de avanzar más decididamente en sus procesos de transformación digital y de experimentar nuevas posibilidades sacándole mayor provecho a su
When you run an optimization, it is often not clear how to provide the optimization algorithm with an initial guess for the parameters. A good guess converges quickly to the optimal solution whereas a bad guess might diverge or require many iterations to converge. Many people use a default value
Autounfall und positives Kundenerlebnis? Wie passt das denn zusammen? Vor einigen Monaten ist es dann doch passiert: einmal unaufmerksam gewesen und Auffahrunfall in der Stadt verursacht. So ein Mist! Nicht wirklich schlimm – nur ein bisschen Blechschaden, aber super ärgerlich … Nachdem das Auto abgeschleppt und der erste Ärger verflogen
If you're preparing a big Thanksgiving dinner, then you don't want to leave out the most popular side dish, do you?!? But what is the most popular side dish? ... If you don't already know, then perhaps some data & analytics can help! But before we get started, here's a
이 기사는 SAS Korea가 번역 및 편집했으며 원래 Rick Wicklin이 썼습니다. 원문이 여기에 있습니다. 이번 블로그를 통해 ODS 템플릿을 효율적으로 수정할 수 있는 SAS 프로그래밍 기법, 일명 ‘커펠드 템플릿 수정 기법(Kuhfeld’s Template Modification Technique; TMT)’을 소개하고자 합니다. 다섯 단계만 거치면 20줄 미만의 SAS 코드만으로 이 기법을 구현할 수 있는데요. 방법은 간단하지만
A statistical programmer read my article about the beta-binomial distribution and wanted to know how to compute the cumulative distribution (CDF) and the quantile function for this distribution. In general, if you know the PDF for a discrete distribution, you can also compute the CDF and quantile functions. This article
Social media has brought anniversary dates to the forefront. Every day, my view of Google Photos or Facebook shows me a collection of photos from exactly some number of years ago to remind me of how good things were back then. These apps are performing the simplest of date-based math
Here in the US, we're preparing to celebrate the Thanksgiving holiday. Therefore this Thursday most families in the US will be having a big turkey dinner. Although I'm a bachelor guy and eat out all the time, I'm actually a pretty good cook - and I'd like to share with
I recently spent two days with an innovative communications customer explaining exactly what SAS analytics can do to help them take their advertising platform to a whole new level. Media meets data resulting in addressable advertising. SAS would essentially be the brain behind all their advertising decisions, helping them ingest