Sometimes when trying to fuzzy match names you want to fuzzy match just a portion of the name: for example, Family Name and/or Given Name. A common mistake that people make is to feed in the Family Name and Given Name columns separately into the Match Codes node instead of
Tag: data management
The phrase “business rules” is often loosely used. It can refer to things like constraints in a query, a data mapping, a data quality constraint, a data transformation, or a model. Business rules also reflect an enforced policy, a regulatory requirement and business constraints on model scores that trigger analytically-driven
Is your LASR implementation running short on memory? Since LASR tables are stored in memory, it can become scarce. So what can we do to minimize LASR table size and still get LASR’s legendary performance? Here are a few strategies on how to shrink LASR tables: Compression: When compression was
Sizing is a topic that solutions managers typically leave until the end after decisions about the application have been settled. But there are often many variables that can impact the final size requirement. We have seen across our customer base that sizing and the number of environments has been determined
I’ve spent some time over the past couple of months learning more about anonymization. This began with an interest in the technical methods used to protect sensitive personally-identifiable information in a SAS data warehouse and analytics platform we delivered for a customer. But I learned that anonymization has two rather different meanings; one in the
Solving the mystery of Malaysia Airlines Flight 370 hinges on the finding the plane's black boxes, or flight data and cockpit voice recorder. An airplane’s black box is something we hope never has to be used, but when there’s a problem, we sure are glad that it’s there. The black
In the movie, The Matrix: Reloaded, our heroes and the KeyMaker frantically navigated from world to world through a series of doors and locks trying to escape the villains. Fortunately for our heroes, the KeyMaker always had the right key on his ring, he just had to know what key
Although she’s an analyst, Anca Tilea estimates that she spends 80% of her time cleaning data. Tilea and co-author Deanna Chyn shared seven of their favorite methods for checking, cleaning and restructuring data. Attendees at MWSUG 2013 got a bonus tip: Ask SAS peers in one of the SAS Support
I'm happy to announce the SAS Data Management support community has a new look and feel! And there’s lots of additional content and resources now too. The SAS Data Management community on support.sas.com is a central hub for anyone interested in SAS data access, integration, quality and governance. Community Manager