How can you generate data that contains outliers in a simulation study? The contaminated normal distribution is a simple but useful distribution you can use to simulate outliers. The distribution is easy to explain and understand, and it is also easy to implement in SAS. What is a contaminated normal
Search Results: simulation (462)
Imagine making $50K a day out of thin air. Did you know that NASDAQ routinely processes around 10,000,000 trades a day? What if instead of rounding cents for each transaction, market makers truncated fractions of cents in the amount they owe you? Under the assumption that each transaction, on average,
IDV (Individuelle Datenverarbeitung) ist ein Thema, das in den Banken als Teil von BCBS 239 seit Langem kritisch diskutiert wird. Ruppert Jaeschke betreut und berät seit fast 15 Jahren zahlreiche deutsche Banken im Umfeld Business Intelligence und SAS. Er hat eine klare Meinung zu diesem regulatorischen Thema. Frage: Welche Schmerzen
The 2016 INFORMS Annual Meeting will be held at the Music City Center and Omni Nashville Hotel in downtown Nashville, TN on November 13-16, with pre-conference events starting on Saturday, November 12. SAS will be a major participant in this conference. Over two dozen people from SAS will attend, with
Being an Eagle Scout, the data for good movement caught my attention. I wondered if I could apply my computer skills in a way that might help. How about showing people better ways to visualize HIV/AIDS data - that might help doctors better understand the data, and therefore better treat
In a previous blog article, Risk Data Aggregation and Reporting – Why now more than ever? – Part 1, I have discussed the requirements of the paper of the Basel Committee numbered 239 and one of the reasons the execution or application of the principles discussed therein is important for
Per tre giorni, la capitale italiana ospiterà Analytics Experience, evento in cui professionisti IT, figure accademiche, business user ed executive da tutto il mondo si riuniscono con l’obiettivo di approfondire lo stato dell’arte degli analytics. Un mix di keynote strategici, storie di successo internazionali, corsi di formazione, certificazione SAS, update
With my first open source software (OSS) experience over a decade ago, I was ecstatic. It was amazing to learn how easy it was to download the latest version on my personal computer, with no initial license fee. I was quickly able to analyse datasets using various statistical methods. Organisations
Are you ready to expand your programming skills and become a more versatile programmer? Then this new (and free!) course might be for you. SAS Programming for R Users is a free course aimed at helping R programmers who want to learn SAS. The goal is for you to be comfortable accomplishing
Although statisticians often assume normally distributed errors, there are important processes for which the error distribution has a heavy tail. A well-known heavy-tailed distribution is the t distribution, but the t distribution is unsuitable for some applications because it does not have finite moments (means, variance,...) for small parameter values.
Last week I showed how to compute nearest-neighbor distances for a set of numerical observations. Nearest-neighbor distances are used in many statistical computations, including the analysis of spatial point patterns. This article describes how the distribution of nearest-neighbor distances can help you determine whether spatial data are uniformly distributed or
Analytics Experience 2016 will be held on Sept. 12-14, 2016 at the Bellagio in Las Vegas, NV. There will be a great number of excellent talks and demonstrations at the conference, covering many aspects of SAS analytics and many practical applications. Several of these sessions deal directly with the use
A common question is "how do I compute a bootstrap confidence interval in SAS?" As a reminder, the bootstrap method consists of the following steps: Compute the statistic of interest for the original data Resample B times from the data to form B bootstrap samples. How you resample depends on
Letztens ist mir das Buch des Naturwissenschaftlers und Comedians Vince Ebert in die Hände gefallen. Es war anfangs sehr lustig und unterhaltsam, bis zu dem Kapitel, in dem es um das Thema Big Data ging. Danach führe die Analyse großer Datenmengen dank des Phänomens „Zufall“ zum Big Fail. Im Folgenden möchte
The digital disruption is creating unforeseen events, such as new competitors, products and services that threaten the performance and positioning of consolidated players. Big data and analytics prove themselves, through successful user cases, as the answer to intercept the demand, prevent churn, draw an integrated view of the customer, manage
Multi-echelon inventory optimization is ever more a requirement in this era of globalization, which is both a boon and bane for manufacturing companies. Optimizing pricing is also important. Global reach allows these companies to expand to new territories but at the same time increases the competition on their home turf.
Stellen Sie sich vor, Sie sind frühmorgens mit dem Auto „ab in den Urlaub“ gefahren. Durch vorausschauende Routenplanung sind Sie den größten Staurisiken glücklich ausgewichen und nähern sich bei geschätzten 38°C der letzten Landesgrenze vor Ihrem Urlaubsziel. Die ganze Familie sitzt mit ausgelassener Stimmung im vollgepackten Auto. Die Kinder auf
Starting in 2018, IFRS 9 will require banks around the world to change their processes for accounting of credit risk. This new impairment standard will move banks from the backward looking incurred loss model into a forward looking Expected Credit Loss (ECL) modelling approach. When talking with banks around the
"Shall we play a game?" If you’re a child of the ’80s like me, you might recognize this famous line from the movie WarGames. This innocent-sounding question comes not from one of the movie’s human stars, but from a military super-computer named Joshua, after a bored high school student, played
Financial institutions evaluating fraud management solutions face a crowded vendor landscape. Dozens of vendors claim to offer various pieces of the puzzle. With so many choices available, how will you sort through the marketing rhetoric to find the best fit for your organization? You could assemble a team of analysts
Optimization is a primary tool of computational statistics. SAS/IML software provides a suite of nonlinear optimizers that makes it easy to find an optimum for a user-defined objective function. You can perform unconstrained optimization, or define linear or nonlinear constraints for constrained optimization. Over the years I have seen many
Phase 2 von IFRS9 wird nun akut. Und damit steht jetzt auch das Thema Impairment im Mittelpunkt. Der Go-Live für Banken ist für das Jahr 2018 geplant. Bis dahin gibt es jedoch noch einiges zu tun. Viele Projekte beschäftigen sich mit der Umsetzung und mit der Auswahl der Software. Potenzielle
Last week I attended SAS Global Forum 2016 in Las Vegas. I and more than 5,000 other attendees discussed and shared tips about data analysis and statistics. Naturally, I attended many presentations that featured using SAS/IML software to implement advanced analytical algorithms. Several speakers showed impressive mastery of SAS/IML programming
Machen wir doch gemeinsam eine kleine Zeitreise in die Zukunft. Es ist Freitag, der 29. April 2016, der Tag nach der größten Konferenz für Business Intelligence im deutschsprachigen Raum, dem SAS Forum Deutschland. Und wir lauschen dem Bericht zweier Teilnehmer des SAS Forum Deutschland, die sich über die Highlights des
This year's SAS Global Forum conference will take place April 18-21 at The Venetian in Las Vegas. For SAS/OR, SAS staff will present two Super Demos and three papers:
I saw an interesting mathematical result in Wired magazine. The original article was about mathematical research into prime numbers, but the article included the following tantalizing fact: If Alice tosses a [fair]coin until she sees a head followed by a tail, and Bob tosses a coin until he sees two
SAS will have a major presence at the 2016 INFORMS Conference on Business Analytics and Operations Research, which will be held at the Hyatt Regency Grand Cypress hotel in Orlando, FL on April 10-12. Many SAS staff will participate in this conference. SAS/OR, the SAS Global Academic Program, and JMP
It is easy to generate random points that are uniformly distributed inside a rectangle. You simply generate independent random uniform values for each coordinate. However, nonrectangular regions are more complicated. An instructive example is to simulate points uniformly inside the ball with a given radius. The two-dimensional case is to
There are several ways to simulate multinomial data in SAS. In the SAS/IML matrix language, you can use the RANDMULTINOMIAL function to generate samples from the multinomial distribution. If you don't have a SAS/IML license, I have previously written about how to use the SAS DATA step or PROC SURVEYSELECT
Today is March 14th, which is annually celebrated as Pi Day. Today's date, written as 3/14/16, represents the best five-digit approximation of pi. On Pi Day, many people blog about how to approximate pi. This article uses a Monte Carlo simulation to estimate pi, in spite of the fact that