Introducing SAS Information Management

Several of my recent posts have introduced the need for leveraging a strategic approach to information management backed by industry luminaries such as Gartner. Now it’s time to take a look at our definition of Information Management and discuss how it can be used by IT to effectively manage their information infrastructure and by the business to drive better decisions based on trusted insight.

SAS Information Management provides

unified technology solutions and strategy and implementation services

…that allow organizations to fully exploit and govern their information assets

…resulting in competitive differentiation and sustained business success.

Read More »

Post a Comment

#GartnerChat on Big Data

I participated in a lively TweetJam discussion sponsored by Gartner last Friday. In addition to learning that the iPhone twitter app is not the easiest way to read and tweet quickly, there were a number of interesting observations shared by the participants, along with a good bit of humor.

We discussed the definition and hype of Big Data, considered the role of the Data Scientist, and tweeted about the role of data quality in the context of Big Data.

Here is an idea of the tweets that were shared by 6 Gartner analysts and a number of other big data experts (click on the image for better resolution):

If you are interested in the a summary of the discussion, check out one of these blog posts

Highlights from Today's #GartnerChat on Big Data - Doug Laney, Gartner

HoardaBytes and the Big Data Lebowski - Jim Harris, OCDQ Blog

#GartnerChat's are held on Friday's at 12 noon EST.

Post a Comment

The end of tactical Data Integration

My last post focused on the business and technical aspects that are driving the need for a more strategic approach to managing data and information assets. This post discussed the natural  progression from data integration to data management.  The data integration market has gone through evolutionary changes over the last several years. This may seem like old news because many vendors have moved (or at least their marketing efforts have moved) to a unified data management approach that provides capabilities that span data integration, data quality, MDM, etc. That's all well and good, but as Gartner noted, it's not enough to address the challenges of the 21st century.

This becomes more clear when you think about your data - what exactly are you doing with your data? What is the ultimate goal of your organization?  Most organizations are not in the data business. What I mean by that is that most organizations are not driving revenue by selling their data. Most organizations are trying to turn their data into actionable information so that they can make better decisions faster than their competition. They are looking to treat their information as a strategic asset that will lead to competitive differentiation. Although data management disciplines like data integration, data quality, etc., are important, the end result of these activities is unlikely to lead to a high level of business differentiation. They are instrumental building blocks, but a more comprehensive information management approach that considers analytics and decisioning is necessary.

Let's take a look at the evolution:

Read More »

Post a Comment

"Outdated Data Management" & the 21st Century

Following on my Data Integration is Old News post, we are advising organizations to move to a more strategic way of managing their data, an approach that we refer to as Information Management. Going forward, you can expect to hear much more about this topic from SAS, but I thought I'd start the discussion with this series of blog posts:

  1. Business and technical factors that are driving a need for a strategic Information Management approach - this post
  2. Evolution from reactive Data Integration to "managed" Data Management to strategic Information Management
  3. SAS Information Management defined

So let's jump into the business and technical factors driving the need for strategic Information Management...

The rate of change in today’s business climate is staggering – competitive and economic pressures are constantly increasing. The rate, diversity and complexity of data that is coming at organizations is unrelenting. The need to transform this data to information and insight that drives business success is more important than ever. The need to view information as a strategic asset, that allows organizations to make accurate decisions within the decision window of the competition is key to success.  It’s not about the data or how much you can process, it’s about capitalizing on all the data assets that are available to organization to provide insight and drive fast and accurate decisions.


Gartner sums this up in recent research:

"Leveraging information will continue to fuel business success. But the growth in information volume, velocity, variety and complexity and the new information use cases makes information management infinitely more complex than it has been in the past. In addition to the new sources and the increased demand for multiple context delivery, shareability and reuse, practically all information assets must be available for delivery through varied, multiple and concurrent channels and mobile devices. To deal with these new demands, the IT organization needs to dramatically modernize its IT systems, transforming outdated data management infrastructure and replacing it with a more up-to-date and superior information environment able to support an entirely new set of requirements."

Source: Gartner, Information Management in the 21st Century, Regina Casonato, Anne Lapkin, Mark A. Beyer, Yvonne Genovese, Ted Friedman September 2, 2011


SAS considers the following factors since they represent the diversity or drivers as well as breadth of scope that call for a comprehensive information management approach:

Read More »

Post a Comment

EL-T Technique 1: Batch Loads with Teradata Example

This is the first post of a multi-part series on EL-T integration. Lately I’ve been inundated with requests for information about SAS’ ability to do EL-T style (Extract Load then Transform) integration with Data Integration Studio, especially in Teradata shops that use SAS, so we thought it would be useful to show how to perform EL-T style integration with SAS' Data Integration Studio.   As most data warehouse architects will tell you, a staging layer is required architectural component when dealing with data of size especially if you need to scale your integration processes. Staging is where raw data is first landed from extraction processes. These can be flat files, data sets, excel files, unstructured data etc. With EL-T style of processing we want to take advantage of the simplicity of SQL and the power of relational database technology.   You’ll be hard pressed to find folks more passionate about EL-T than our friends at Teradata.

A common load strategy that many of SAS & Teradata customers use is “Mini Batch”. With Mini Batch large or small amounts of data is appended to preexisting tables. Data is fed into staging tables, where it is structured and cleansed as it is merged with target tables, finally base views and application views are created on top.  As a best practice, users shouldn’t have direct access to tables; instead they should access data through at least one layer of base views. In Teradata all tables need to have ACCESS LOCKS before querying them in order to maintain a read-consistent view. Let’s see how we can do this in DI Studio…

Suppose I have a file of trades that lands a couple times a day, here is one way to use a mini-batch approach to populate data in Teradata.

Our file…

Step 1. Register the incoming files with Data Integration Studio.

Read More »

Post a Comment

Hurwitz on SAS & Big Data: Experience Matters

Mike Ames and I recently had an opportunity to talk to Fern Halper and Judith Herwitz from Hurwitz & Associates as they are doing a 4 part blog series on vendor views on big data and big data analytics. You can view Fern's blog post about the SAS perspective here.

Here are the highlights from our conversation:

Decision making is the key - Although the technical aspects of big data are interesting, and Hadoop is all the rage, it's really about analyzing big data so that organizations can make better decisions faster - within the decision window of their competition.

One size fits all doesn't work - We spoke about how you can't take a one size fits all approach to big data, that the approach that you leverage should be driven by your business goals translated into an analytical and technical requirements.

SAS High Performance Computing enables complete dataset support - For some scenarios, being able to factor in the complete dataset into the analytics is key. This isn't just about factoring in all forms of data into analytics like social media data, device data, etc., it's about leveraging an infrastructure that allows you to perform analysis at a very detailed level. For example, doing pricing optimization at an individual sku/store level vs. product category/region. It's also about being able to do variable selection on a massive vs. limited scale, being able to operationalize the analytics process for production, etc.

The 4th V is Relevance - Everybody recognizes volume, variety and velocity, but what is critically important in big data scenarios is to identify relevance and to understand that relevance changes over time. Building an infrastructure that can complement complete dataset support by identifying and ensuring that the relevant data is constantly available to your analytics infrastructure is becoming increasingly important. We refer to this information management style as "stream it, score it, store it", where we leverage the power of analytics on the front end of the information lifecycle to identify relevant data. And instead of using a simple filter to do this, we leverage analytics to define relevance based on extensive organizational knowledge. That results in information that is relevant to your organization or business vs. a generic Google style search. As events occur that change the relevance of data, data that was previously secondary in nature can be factored into the analytical process.

It's also great that Fern recognizes that although many vendors are jumping on the big data bandwagon, that SAS "has been growing its big data capabilities through time, all of the technologies are delivered or supported based on a common framework or platform."

And, I love her final quote: "Experience matters.  Enough said for now."

Make sure that you read Fern's complete post located here and note that there are many other blog posts on big data and Hadoop in the Information Architect blog.

Post a Comment

Data Integration is old news!!

This may be controversial coming from someone that has worked with data his entire career, someone that has been involved with software vendors focused on data integration and access for the last 6 years, and someone that is responsible for product marketing for SAS data management capability, but I’ll say it anyway… “Data Integration is just not that interesting”.

While there have been many good developments relating to data integration…

  • Moving from silo’d data integration capabilities to a unified platform that includes data integration, data quality, MDM, data governance features, etc.
  • Introduction of ELT as a mechanism to better leverage the database capabilities.
  • Expansion of data integration methods to include virtualization, event processing, etc.
  • Products that integrate cloud and on-premise resources.
  • Etc….

I find it much more interesting to consider the strategic value of information while encouraging organizations to implement strategies, capabilities and processes necessary to manage information as a strategic asset. Just as organizations moved from a data integration to a data management mindset, it’s now important to consider a more strategic, information management approach.

While I plan to address information management as a critical topic over the next several months, I thought I’d start by identifying key information management considerations.

Read More »

Post a Comment

7 keys to high performance data management for advanced analytics

Various research efforts and customer conversations clearly indicate that the data exploration and data preparation stage of the analytic lifecycle is complex and time consuming. Scarce data scientist or data analyst resources are spending the majority of their time on data preparation tasks vs. spending their time on deriving insight that can drive the business.  One aspect of addressing this challenge is to ensure that the data management processes that are used to support advanced analytics are optimized for performance and scalability. This topic will become more important as organizations tackle big data – not only big data volume, but also the variety of data sources such as social media sources.

We have sponsored a monograph report with David Stodder, from TDWI, to explore this topic in more depth. The report, Seven Keys to High Performance Data Management for Advanced Analytics, will soon be available and a webinar will be held on Wednesday, December 15th at 12 PM EST / 9 AM PST to preview the paper.

Dave will examine seven key technology trends in high performance computing that are changing the landscape for advanced analytics and enabling organizations to solve pressing data management challenges they are facing with traditional ETL and data warehousing systems.

The webinar will cover:

  • Seven steps you can take to leverage high-performance computing for advanced analytics
  • How in-database processing and ELT can increase the speed and manageability of analytics
  • Why in-memory processing is important for complex analytical query performance and how you can avoid potential problems with this approach
  • How workload management can help you gain the benefits of grid and parallel computing for advanced analytics

To register, click here.

Check back here for the link to the published report.

Post a Comment

Top 10 IT considerations for analytics in 2012

It’s hard to believe, but now that 2011 is almost over it’s time to look ahead. The technology pundits are starting to publish their 2012 predictions, and it’s not surprising to see topics like analytics, cloud, big data, mobile, social networking, virtualization, open source on these lists. Instead of creating another list of predictions, I thought I’d comment on analytic trends from an IT perspective. So here is my top 10 list of analytics considerations for IT.

1. Cloud: Think about how cloud technology should be used as part of your analytics infrastructure

2. Enterprise Architecture Approach: Leverage an enterprise architecture approach to your analytics infrastructure

3. Big Data: Start a pilot project leveraging big data possibly with unstructured text and Hadoop

4. Information Management: Start taking a strategic approach to data with information management

5. Open Source: Avoid the hype, think about your overall requirements and experience when considering open source

For the complete list of 2012 considerations and for practical advice for each consideration, please read on...

Read More »

Post a Comment

Three pillars of better business outcomes: information management, analytics, and campaign optimization

Lately, in consulting with customers about SAS solutions, I’m increasingly seeing discussions revolve around 3 inter-related but distinct segments: Information Management, Analytics, and Campaign Optimization. In the past, these discussions might have happened with 3 different audiences in 3 different meetings, but we are seeing a massive convergence across these areas, and SAS is well positioned to have that conversation.

Information Management drives the foundation of an agile, flexible, integrated, and high quality data platform. This foundation drives more accurate and effective analytics, which in turn drives better business outcomes in the form of more appropriate treatment of customers using the most appropriate channel. Making a timely, relevant offer to your customers is the best way to increase retention, improve the customer experience and increase revenue. It’s so easy to forget when you are down in the weeds with your data, for example, determining what attributes constitute your master entity for Customer, that it’s absolutely essential that IT aligns directly with the strategic goals of your company and that you govern your data with the focus of delivering business value.

As leaders in the Analytics, Customer Intelligence, Data Governance, Data Integration, and Data Quality spaces, SAS is uniquely situated to provide a comprehensive and integrated solution to meet these needs. No-one else knows better how to prepare data for analytical usage, how to apply predictive analytics to select the right campaign, or, more tactically speaking, how to embed scoring inside of an ETL or DQ job. As our CMO, Jim Davis, has said in this entertaining video: we don’t sell hardware, we don’t sell databases, we sell analytical solutions that drive better business outcomes.

What are some of the challenges you are seeing today? What are the 3 main pillars that drive business value for your organization?

Post a Comment