Learn how to fit a decision tree and use your decision tree model to score new data. In Part 6 of this series we took our Home Equity data saved in Part 4 and fit a logistic regression to it. In this post we will use the same data and
Search Results: forest plot (77)
Empirical Mode Decomposition (EMD) is a powerful time-frequency analysis technique that allows for the decomposition of a non-stationary and non-linear signal into a series of intrinsic mode functions (IMFs). The method was first introduced by Huang et al. in 1998 and has since been widely used in various fields, such as signal processing, image analysis, and biomedical engineering.
Did you know that about 8% of the world's men are colorblind? (More correctly, 8% of men are "color vision deficient," since they see colors, but not all colors.) Because of the "birthday paradox," in a room that contains eight men, the probability is 50% that at least one is
Since 2008, SAS has supported an interface for calling R from the SAS/IML matrix language. Many years ago, I wrote blog posts that describe how to call R from PROC IML. For SAS 9.4, the process of installing R and calling R from PROC IML is documented in the SAS/IML
Data is crucial for the development of artificial intelligence (AI) applications. However, the rapid availability of data is a challenge due to increasingly strict privacy regulations. A possible solution is to use synthetic data. Gartner predicts by 2024 that 60% of the data used to develop AI and analytics applications
Many modern statistical techniques incorporate randomness: simulation, bootstrapping, random forests, and so forth. To use the technique, you need to specify a seed value, which determines pseudorandom numbers that are used in the algorithm. Consequently, the seed value also determines the results of the algorithm. In theory, if you know
SAS' Ricky Tharrington and Jagruti Kanjia explain two ways bias shows up in model predictions.
SAS' Brian Gaines provides a primer on GAMs.
Technological advancements in connectivity and global positioning systems (GPS) have led to increased data tracking and related business use cases to analyze such movements. Whether analyzing a vehicle, an animal or a population's movements - each use case requires analyzing underlying spatial information. Global challenges such as virus outbreaks, deforestation
Many cities have Open Data pages. But once you download this data, what can you do with it? I'm going to download several datasets from Cary, NC's open data page, and try to give you a few ideas to get you started on your own data exploration! And what data
I hope you're all doing well, in this year of plagues and locusts! I'm sure I don't even need to mention which plague I'm talking about. But what about the locusts? Are you up on your entomological studies? Follow along, and see if you really know what locusts are... Locusts
Luego de otro largo lapso, termino publicando el siguiente artículo de la serie ¡Explícate!. En este veremos cómo la teoría de juegos nos da una mano para interpretar mejor nuestros modelos de machine learning, utilizando las ideas del premio nobel de economía Lloyd Shapley. Entenderemos los conceptos detrás de
"O Christmas tree, O Christmas tree, how lovely are your branches!" The idealized image of a Christmas tree is a perfectly straight conical tree with lush branches and no bare spots. Although this ideal exists only on Christmas cards, forest researchers are always trying to develop trees that approach the
In the preceding two posts, we looked at issues around interpretability of modern black-box machine-learning models and introduced SAS® Model Studio within SAS® Visual Data Mining and Machine Learning. Now we turn our attention to programmatic interpretability.
In the first of a three-part series of posts, SAS' Funda Gunes and her colleague Ricky Tharrington summarize model-agnostic model interpretability in SAS Viya.
Diversas urgencias laborales y personales hicieron que dejara de escribir este blog con mis humildes opiniones y aportes técnicos. Superadas las mismas, aquí estoy de regreso. No les diré que muuuuuuchos seguidores al estilo de Wos pidieron a gritos mi regreso, pero he de decirles que ante mi sorpresa varias
SAS' Kris Stobbe shows how you can predict survival rates of Titanic passengers with a combination of both Python and CAS using SWAT, then see how the models performed.
I've read several articles that mentioned the north magnetic pole has been moving more in the past few decades, than in the previous few hundred years. And as a Map Guy, I knew I just had to plot this data on a map, and see it for myself! I provide
I think it's time to replace my 2008 Prius. It has served me well, been basically maintenance-free, and gotten good gas mileage ... so, why not just get a newer Prius? Well, I've got the itch to get back into an SUV for my daily driver (I had a Bronco
The ODS Graphics software, first released with SAS 9.2, supported creating graphs directly from statistical procedures. Prior to this, very few statistical procedures created graphs on their own, and in most cases creating graphs was a post process or creating the graphs from the saved data using SAS/GRAPH procedures. With
머신러닝이 마케팅 생태계 내에서 지속적으로 발전함에 따라 현대화된 알고리즘 접근법의 해석력이 중요해지고 있습니다. 지난 번 게시했던 머신러닝 해석력 관련 블로그에서 인공지능(AI)과 머신러닝을 신뢰하기 위한 필수 조건, 데이터 세트를 이해하고 해석하는 방법, 그리고 머신러닝 모델의 작동 원리에 대한 인사이트를 도출하는 변수를 표시하는 방법에 대해 설명한 바 있는데요. “우리는 머신러닝에 의해 구동되는 애플리케이션에 둘러싸여 있으며,
The data I was analyzing was about “trust.” Maybe that’s what got me thinking about Stephen Sondheim, the Broadway composer and lyricist of musicals like Sunday in the Park with George and Into the Woods and the lyricist for West Side Story. Trust is a heavy emotional topic. Developmental psychologists
Many of the most beautiful areas in the US are owned by the government, to preserve them and allow access for everyone to enjoy them. And most US schools are traditionally closed during the summer, which provides families a great opportunity to go visit state and federal lands (parks, forests,
As a resident of Northern California, I was interested in learning more about the causes of wildfires. My area has recently experienced large fires that caused many residents to evacuate their homes and some who have even lost their lives. Last October there were more than 170 fires that burned
‘국영수코(co)’라는 신조어 들어보셨나요? 국어, 영어, 수학, 코딩(coding)의 약자인데요. 교육부의 ‘2015 개정교육과정’에 따라 올해 3월부터 중학생, 내년부터는 초등학교 5, 6학년 학생의 소프트웨어(SW) 교육이 의무화되면서 코딩 교육과 관련 자격증 열풍이 불고 있습니다. 미국, 영국 등 IT 선진국들은 이미 발 빠르게 코딩 교육을 의무화하며, 4차 산업혁명 시대의 인재 확보에 나섰는데요. SAS 역시 2014년, 학습이나 비상업적인
We have updated our software for improved interpretability since this post was written. For the latest on this topic, read our new series on model-agnostic interpretability. As machine learning takes its place in many recent advances in science and technology, the interpretability of machine learning models grows in importance. We
Happy holidays to all my readers! My greeting-card to you is an image of a self-similar Christmas tree. The image (click to enlarge) was created in SAS by using two features that I blog about regularly: matrix computations and ODS statistical graphics. Self-similarity in Kronecker products I have previously shown
This article demonstrates a SAS programming technique that I call Kuhfeld's template modification technique. The technique enables you to dynamically modify an ODS template and immediately call the modified template to produce a new graph or table. By following the five steps in this article, you can implement the technique
In growing areas such as the Research Triangle Park, there is always the tough decision between developing land for business, or keeping it natural for parks and recreation. SAS is fortunate to have the great hiking trails at Umstead Park just across the road to the north - it's the
I sit here fascinated, watching a world population clock. According to the US Census, a baby is born and a person dies every 7 and 12 seconds, respectively, in the United States. In 1950 our world had about 2.5 billion people; currently our population is over 7.3 billion. Estimates predict