SAS Blogs

Data Management

Helmut PlinkeDecember 5, 2016 0

How SAS supports the four pillars of a data quality initiative

Data quality initiatives challenge organizations because the discipline encompasses so many issues, approaches and tools. Across the board, there are four main activity areas – or pillars – that underlie any successful data quality initiative. Let’s look at what each pillar means, then consider the benefits SAS Data Management brings

English

Learn SAS

BrennaDecember 2, 2016 0

5 Reasons to write your first SAS Press book

Editor's note: This series of blogs addresses the questions we are most frequently asked at SAS Press! Ever thought about writing your own SAS or JMP book? Here are a few reasons why writing a SAS Press book can be a fantastic career move! 1. Your book establishes you as

English

Data Management | Programming Tips

Chris HemedingerDecember 2, 2016 0

Reading data with the SAS JSON libname engine

JSON is the new XML. The number of SAS users who need to access JSON data has skyrocketed, thanks mainly to the proliferation of REST-based APIs and web services. Because JSON is structured data in text format, we've been able to offer simple parsing techniques that use DATA step and

English

Analytics

Rodney CarsonDecember 2, 2016 0

Data integration can alleviate citizen frustrations with government

Data integration helps a successful business make things simple and quick for customers, and keeps them coming back. While a company will have data silos, data held within one area is made available to others in order to help the customer. In most local, county and state governments that is

English

Government

Data Management

Joyce Norris-MontanariDecember 2, 2016 0

Importance of metadata – Bridging the gap (Part 1: source system)

Traditional data management includes all the disciplines required to manage data resources. More specifically, data management usually includes: Architectures that encompass data, process and infrastructure. Policies and governance surrounding data privacy, data quality and data usage. Procedures that manage a data life cycle from creation of the data to sunset

English

Data Management

小林泉December 1, 2016 0

Hadoopだからこそ必要なセルフサービス－そしてアダプティブ・データマネジメントの時代へ

2014 およそ2014年からSAS on Hadoopソリューションを本格展開してきました。時代背景的には、2014頃は依然として、業態の特性からデータが巨大になりがちで、かつそのデータを活用することそのものが競争優位の源泉となる事業を展開する企業にHadoopの活用が限られていたと思います。その頃は、すでにHadoopをお持ちのお客様に対して、SASのインメモリ・アナリティクス・エンジンをご提供するというケースが大半でした。その後、急速にHadoopのコモディティ化が進んだと感じます。 2015 2015頃になると、前述の業態以外においてもビッグデータ・アナリティクスの成熟度が上がりました。データ取得技術の発展も伴い、これまで活用していなかった種類や量のデータを競争優位性のために活用を志向するようになり、蓄積および処理手段としてのHadoopの選択が加速します。この頃になると、数年前には必ずあったHadoopそのものの検証ステップを踏まない企業が増えてきます。データ量、処理規模、拡張性、コスト効率を考えたときに妥当なテクノロジーがHadoopという結論になります。ビッグデータはデータのサイズだけの話ではありませんが、筆者の足で稼いだ統計によると、当時大体10TBくらいが、従来のテクノロジーのまま行くか、Hadoopを採用するかの分岐点として企業・組織は算段していたようです。この時期になると、従来のテクノロジーの代替手段としてのHadoopの適用パターンが見えてきました。新しいデータのための環境従来捨てていた、あるいは新たに取得可能になった新しいデータをとりあえず蓄積して、何か新しいことを始めるためのある程度独立した環境として、コスト効率を考慮してHadoopを採用するパターン既存のデータウェアハウスへ価値を付加（上の発展形であることが多い）新たなデータを使用してHadoop上で加工し、アナリティクス・ベーステーブルにカラムを追加し、アナリティクスの精度を向上 ETL処理負荷やデータ格納場所のHadoopへのオフロード BI & アナリティクスの専用基盤 SQLベースのアプリケーションだけをRDBMSに残し、その他の機械学習、ビジュアライゼーションなどSQLが不向きな処理をすべてHadoop上で実施多くは、インメモリアナリティクスエンジンと併用データレイク（筆者の意見としては）いざ新しいデータを使用しようと思ったときのスピード重視で、直近使用しないデータも含めて、全てのデータを蓄積しておく。よくあるのが、新しいデータを使用しようと思ったときには、まだデータが蓄積されておらず、利用開始までタイムラグが生じてしまうケース。その時間的損失すなわち利益の喪失を重要視し、そのような方針にしている企業が実際に当時から存在します。 2016 海外の事例等では数年前から見られましたが、2016になると、日本でも以下の傾向が見られます既存Hadoopをそのコンセプトどおりスケールアウトしていくケースグローバル・データ・プラットフォームとして、複数のHadoopクラスターを階層的に運用するケース AI、機械学習ブームにより機械学習のためのデータの蓄積環境として IoTの流れにより、ストリーミング処理（SASでいうと、SAS Event Streaming Processingという製品です）と組み合わせてまさに、Hadoopがデータプラットフォームとなる時代がやって来たと思います。その証拠に、SAS on Hadoopソリューションは、日本においても、金融、小売、通信、サービス、製造、製薬といったほぼ全ての業種において活用されています。 Hadoopの目的は、従来型のBI・レポーティングではなく、アナリティクスこのような流れの中で、Hadoopの採用には一つの確固たる特徴が浮かび上がっています。もちろん弊社が単にITシステムの導入をゴールとするのではなく、ビジネス価値創出を提供価値のゴールにしているというバイアスはあるのですが。。。 Hadoopの導入目的は、ビジネス価値を創出するアナリティクスのためであることがほとんどであるしたがって、Hadoopに格納されるデータには主にエンドユーザーがアナリティクス観点の目的志向でアクセスするケースがほとんどであるつまり、ある程度の規模のITシステムではあっても、Hadoopに格納されるデータはアナリティクスの目的ドリブンでしかアクセスされません。主たるユーザーは、分析者やデータ・サイエンティストです。彼らが、「使いたい」と思った瞬間にアクセスできる必要があるのです。このようなユーザーサイドのリクエストは、従来のBIすなわちレポーティングのような固定化された要件定義をするような依頼ではないため、その都度従来のようにIT部門と要件をすり合わせて、IT部門にお願いするという方法では成り立ちません。その数日、数週間というリードタイムが意思決定を遅らせ、企業の業績に悪影響をもたらすからです。あるいはIT部門の担当者を疲弊させてしまいます。つまり、アナリティクスにおいては、分析者・データサイエンティストが自分自身で、Hadoop上のデータにアクセスし、必要な品質で、必要な形式で、必要なスピードで取得するために自由にデータ加工できる必要があるのです。このあたりの話については、下記でも紹介していますので、是非ご覧ください。【ITmedia連載】IT部門のためのアナリティクス入門第2回やっと分かった　ビッグデータアナリティクスでHadoopを使う理由第3回データ分析で成功するためのデータマネジメントとIT部門の新たな役割【関連ブログ】アナリティクスの効果を最大化するデータマネジメント勘所これが、Hadoopにおいて、セルフサービス・データマネージメント（データ準備）ツールが不可欠な理由です。SASはアナリティクスのソフトウェアベンダーとして、このHadoop上でITスキルの高くない分析者・データサイエンティストでも自分自身で自由にデータを取得できるツールを開発し提供しています。それが、SAS Data Loader for Hadoopです。 SAS Data Loader

Japanese

Advanced Analytics

Muhammad Asif AbbasiDecember 1, 2016 0

SAS integration with Hadoop - one success story

Nearly every organization has to deal with big data, and that often means dealing with big data problems. For some organizations, especially government agencies, addressing these problems provides more than a competitive advantage, it helps them ensure public confidence in their work or meet standards mandated by law. In this

English

Data Visualization

Sanjay MatangeDecember 1, 2016 0

Mixing plots with different classification

One of the key benefits of creating graphs using GTL or SG Procedures is their support of plot layering to create complex graphs and layouts. Most simple graphs can be created by a single plot statement like a Bar Chart. Complex graphs can be created by layering appropriate plot statements to

English

Analytics

SAS PolandDecember 1, 2016 0

My Top Analytics Books (2) How-to Guides to Business Analytics

My last post described my top general business analytics books, those that would appeal to business leaders and analysts alike. This post is a bit more specific, and covers books that will help you to learn for yourself. It is therefore mainly aimed at analysts — but I still hope

English

Analytics | Data Visualization

Robert AllisonDecember 1, 2016 0

My top 10 graph blog posts of 2016!

When I was a kid, I always looked forward to Casey Kasem's American Top 40 song countdown at the end of the year. Did I listen to check whether my favorite songs had made the list, or to critique how well the people making the list had done in picking the 'right'

English

Programming Tips

Chris HemedingerNovember 30, 2016 0

Using the DATA step debugger in SAS Enterprise Guide

In my earlier post about WHERE and IF statements, I announced that the DATA step debugger has finally arrived in SAS Enterprise Guide. (I admit that I might have buried the lead in that post.) Let's use this post to talk about the new debugger and how it works. First,

English

Data Management | Programming Tips

Mary Kathryn QueenNovember 30, 2016 0

Securing sensitive data using SAS Federation Server at the data source level

Data virtualization is an agile way to provide virtual views of data from multiple sources without moving the data. Think of data virtualization as an another arrow in your quiver in terms of how you approach combining data from different sources to augment your existing Extract, Transform and Load ETL batch

English

Analytics | Data Visualization

SAS ColombiaNovember 30, 2016 0

Transformación vía analítica: 4 cosas que deben trabajar los países

En todos lados se habla de lo mismo: transformación digital. Es generalizada la necesidad de hacer las cosas diferentes, ya sea por medio de una nueva apuesta o reinventando la forma en la que hoy se hacen las cosas. Tener acceso a soluciones en todas partes gracias a la Nube

Spanish

Analytics | Machine Learning

Fabian BuchertNovember 30, 2016 0

Ein Wahlkampf mit Trumpf – mit Textanalytics verstehen was eigentlich gemeint ist

Mal ehrlich, wenn ich Sie fragen würde, worüber die Kandidaten im diesjährigen US-Wahlkampf in ihren Aufeinandertreffen debattiert haben – welche Kernthemen würden Sie mir spontan (abseits von Skandalen und Affären) nennen? Und könnten Sie diese Kernthemen den einzelnen Kandidaten zuordnen? Als ich mir diese Frage stellte, war die Antwort –

German

Previous 1 … 428 429 430 431 432 … 748 Next

Blogs

All Posts