In the first installment of this series on Hadoop, I shared a little of Hadoop's genesis, framing it within four phases of connectivity that we are moving through. I also stated my belief that Hadoop has already arrived in the mainstream, and we are currently moving from phases three of connecting people to phase four
Tag: Hadoop
So, you've heard the Hadoop hype and you are looking – or have already invested – into Hadoop. Maybe you have also realized some benefits from the Hadoop ecosystem. But now you want to maximize those benefits by using advanced analytics, or you might have heard about algorithms or machine learning libraries available
So, with the simple introduction in Understanding Hadoop security, configuring Kerberos with Hadoop alone looks relatively straightforward. Your Hadoop environment sits in isolation within a separate, independent Kerberos realm with its own Kerberos Key Distribution Center. End users can happily type commands as they log into a machine hosting the
The panel moderator looks out over the audience. It’s a large crowd. For the first time ever, Big Data, Hadoop, and the Internet of Things are appearing on stage together. The conversation has just begun, so let’s listen in for a minute. Big Data: “…and people have been trying to
In the world of IT, very few new technologies emerge that are not built on what came before, combined with a new, emerging need or idea. The history of Hadoop is no exception. To understand how Hadoop came to be, we therefore need to understand what went before Hadoop that led to its creation. To understand
A challenge for you – do a Google search for “Hadoop Security” and see what types of results you get. You’ll find a number of vendor-specific pages talking about a range of projects and products attempting to address the issue of Hadoop security. What you’ll soon learn is that security
Scalability is the key objective of high-performance software solutions. “Scaling out” is a concept which is accomplished by throwing more server machines at a solution so that multiple processes can run in dedicated environments concurrently. This blog post will briefly touch on several scalability concepts that affect SAS.
Okay, let's say your data is in Hadoop. The distributed, open source framework is configured as it should be across low-cost servers and your data is sitting in those clusters. It's been a meaningful effort to get to this point but how does it benefit your organization? If it's not doing something
Sie kennen den kleinen gelben Elefanten schon? Hadoop verändert gerade die Welt – zumindest in der IT. Es gibt Experten, die prophezeien, dass bereits in den nächsten drei Jahren mehr als die Hälfte aller Daten der Welt in Hadoop gespeichert werden. Fakt ist: Bereits heute liegen die durchschnittlichen Kosten pro
For Hadoop to be successful as part of the modern data architecture, it needs to integrate with existing tools. This integration allows you to reuse existing resources (licenses and personnel) and is typically 60% of the evaluation criteria for integration of Hadoop into the data center. One of the most
Even though it sounds like something you hear on a Montessori school playground, this theme “Share your cluster” echoes across many modern Apache Hadoop deployments. Data architects are plotting to assemble all their big data in one system – something that is now achievable thanks to the economics of modern
Perhaps you're a big data expert who is fluent in Pig, Hive, MapR and all the technologies associated with the open source big data framework. Or maybe your role hasn't yet been touched by the increasingly popular Hadoop system. It's certainly worth a few minutes of your time (or perhaps
SAS In-Memory Statistics for Hadoop is a single interactive programming environment for analytics on Hadoop that integrates analytical data preparation, exploration, modeling and deployment. It’s principle components are the IMSTAT procedure (PROC IMSTAT) and the SAS LASR Analytic Engine (or SASIOLA engine for input-output with LASR). Within the SAS In-Memory Statistics
We asked Lars George, EMEA Chief Architect at Cloudera, to share his opinion about Hadoop, Big Data and future market trends in Business Analytics. For all those who want to know more about Hadoop we recomment this TDWI whitepaper and how to apply Big Data Analytics. The last few years
Da war doch mal was, Sie erinnern sich, Hoodiejournalismus?! In dieser Diskussion über Digital gegen Print, jung gegen alteingessen, #hoodie vs. #schlipsy, über was ist Premium oder was ist hautnah dabei, über was erscheint modern, zeitgemäß und innovativ oder was bezahlt die Miete am Ende des Monats, ist ein Punkt
It was just a couple of years ago that folks were skeptical about the term "data scientist". It seemed like a simple re-branding of an established job role that carried titles such as "business analyst", "data manager", or "reporting specialist". But today, it seems that the definition of the "Data
El Big Data y la Nube se están volviendo inseparables: “se necesitan recursos en la nube para el almacenamiento y la ejecución de proyectos de big data, y el big data brinda a las compañías una buena ocasión de pasar a la nube”. Podríamos decir que el big data y
Are you wondering what to do next with your analytics program? The latest issue of sascom magazine provides a handy guide. Check it out to get help checking off must-do items like these: Establish an analytics center of excellence. Find out how SunTrust centralized all of the bank’s analytics teams –
Demand for analytics is at an all-time high. Monster.com has rated SAS as the number one skill to have to increase your salary and Harvard Business Review continues to highlight why the data scientist is the sexiest job of the 21st century. It is clear that if you want to be
Gerne erinnere ich mich noch an die Zeit, als ich mit dem Begriff Memory zunächst nur ein einfaches Kartenspiel assoziert habe. Diese frühe Form des "Gehirnjoggings" hat mich übrigens nie besonders lange begeistert. Ähnlich begrenzt wie meine Geduld damals sind heute viele Hardware-Systeme in Bezug auf die wachsenden Anforderungen. Zwar wächst
I was recently part of team discussing enterprise architecture with a chief IT architect, and we were explaining how SAS can integrate into their existing infrastructure, add business value on top it and even fit into their future planned infrastructure. This conversation was one of the reasons I blogged about
Interest in "data" is at an all-time high. The popularity of search terms like "big data," "Hadoop" and the "Internet of Things" spiked dramatically in the past year. The fact is, organizations are more interested in the potential of big data platforms and data management solutions than ever before. That’s
The "Internet of Things" is the latest buzzword characterizing the machine-generated big data that has outstripped our ability to derive value from it. Think of UPS delivering 16 million packages every day through various hubs and all the logistics and decisioning that goes into that. But how does an organization
You may still believe that Hadoop is going to solve all of the world’s problems with big data. It won’t. Hadoop is a framework for storing large-scale data processing with both pros and cons for organizations. Christopher Stevens, from Greenplum, explained that Hadoop is rapidly becoming the go-to for big
Nancy Rausch, from SAS R&D, is driving a short demonstration of how to access Hadoop via SAS Data Integration Studio. Take a look. You're probably going to want to take a look at this paper, too: What's new in SAS Data Management?
sasglobalforum2012 on livestream.com. Broadcast Live Free New this year to SAS Global Forum are Tech Talks. In this session, Chris Hemedinger is chatting with: High-Performance Data Mining Jared Dean, Director of SAS Enterprise Miner R&D Text Analytics and Sentiment Analysis: Case study of AllAnalytics.com Jim Cox, Senior Manager of
Many companies are challenged not only with analyzing big data, but with storing and accessing the data. In some cases, organizations can choose an open source storage solution to reduce costs. One popular open source solution is Hadoop. Anna Brown is talking with Paul Kent, Vice President Big Data at SAS,