Adoption of Hadoop, a low-cost open source platform used for processing and storing massive amounts of data, has exploded by almost 60 percent in the last two years alone according to Gartner. One primary use case for Hadoop is as a data lake – a vast store of raw, minimally processed data. But, in many ways, because
Tag: Hadoop
SAS und Hadoop: Immer mehr Unternehmen stellen Überlegungen zum Einsatz von Hadoop als Framework für verteiltes Speichern und Verarbeiten großer Datenmengen an. Gerade für das Speichern von un- oder semistrukturierten Daten wie soziale Medien, Blogs, Foren, Online-Shops oder maschinengenerierte Sensordaten bietet sich diese Plattform an. Natürlich besteht häufig der Wunsch,
Big Data ist wie ein Fass ohne Boden. Fängt man einmal an, sich damit zu beschäftigen, zieht sich ein nicht enden wollender Rattenschwanz hinterher. Im positiven Sinne! Ich möchte das Zusammenspiel mit der Open-Source-Technologie Hadoop beleuchten. Big Data braucht, wie jeder weiß, auch Big Speicherplatz. Das ist die Voraussetzung für
The data lake is a great place to take a swim, but is the water clean? My colleague, Matthew Magne, compared big data to the Fire Swamp from The Princess Bride, and it can seem that foreboding. The questions we need to ask are: How was the data transformed and
In my last two posts, we concluded two things. First, because of the need for broadcasting data across the internal network to enable the complete execution of a JOIN query in Hadoop, there is a potential for performance degradation for JOINs on top of files distributed using HDFS. Second, there are
Seguramente ya ha escuchado hablar sobre Hadoop y todas sus potentes capacidades, de no ser así, este sistema no es más que un marco para software de código abierto que permite almacenar y procesar grandes volúmenes de datos de forma distribuida en un gran número de productos de hardware. En
In my last post, I pointed out that an uninformed approach to running queries on top of data stored in Hadoop HDFS may lead to unexpected performance degradation for reporting and analysis. The key issue had to do with JOINs in which all the records in one data set needed
Mobile World Congress is quickly approaching. Attendees and exhibitors are feverishly scheduling meetings, doing research, and determining their areas of focus to maximize their experience of the event. If you're hoping to learn more about big data analytics at the conference, here are some helpful insights and resources to help you
As the point person for SAS joining the new Open Data Platform (ODP) initiative, I want to make it clear why SAS is involved with ODP, and why we think it’s important to our customers, and the Hadoop and big data ecosystem as a whole. SAS is not in it to
Hadoop is increasingly being adopted as the go-to platform for large-scale data analytics. However, it is still not necessarily clear that Hadoop is always the optimal choice for traditional data warehousing for reporting and analysis, especially in its “out of the box” configuration. That is because Hadoop itself is not
Imagine choosing one application for Linux that worked on the version you currently use. You choose another program but find that it doesn’t work on that version of Linux. A third application? It works with another version of Linux. Luckily, that rarely happens. In 2001, the Linux Foundation established Linux Standard
Data Management has been the foundational building block supporting major business analytics initiatives from day one. Not only is it highly relevant, it is absolutely critical to the success of all business analytics projects. Emerging big data platforms such as Hadoop and in-memory databases are disrupting traditional data architecture in
Esta es la experiencia de Felix Liao, Gerente de Soluciones de Data Management de SAS para Australia y Nueva Zelanda. Nuestro personaje hizo un viaje hace poco por toda Australia visitando algunos clientes para conversar con ellos sobre las tendencias de Analítica y Hadoop. Además de quedar gratamente sorprendido con
Zwischen den Jahren hat man Zeit für Familie, Freunde und Hobbies. Dass eines meiner Hobbies "TV-Serien" ist, hat vielleicht mit den in meiner Kindheit beliebten Weihnachtsserien zu tun. (Mein Favorit: Jack Holborn. Kennen Sie die Serie noch?) Aber die Zeiten ändern sich und wir uns mit: Die ZDF-Produktion war gestern, heute lebe das
Getting universal buy in for Hadoop needn’t be an uphill struggle. In many cases, it only takes one pilot project to realize the benefits of low cost storage combined with powerful analytics. The Hadoop topic provoked passionate conversatoin at a recent roundtable discussion attended by over 25 people from a range
Jeder der sich im Internet bewegt, den bewegt auch das Thema „Sind meine Daten sicher? Welche persönlichen Daten kann ich wo eintragen?“ Bei jedem neuen Verstoß gegen Datenschutz und dem Wahren der Privatsphäre, die bekannt wird, steigt der Grad der Verunsicherung weiter an. Dies gilt für alle Branchen, speziell aber
Un Chief Information Officer tiene ciertas características que lo hacen desempeñarse dentro de un rol único. Sin embargo, gracias a los retos que conlleva ahora la implementación de proyectos de analítica avanzada en las organizaciones, han surgido algunos mitos que rodean el rol de TI. Keith Collins, Vicepresidente Senior de
Earlier this week I managed to catch up briefly with Christoph Sporleder, Vice President Centers of Excellence for EMEA & Asia Pacific, to talk Hadoop, big data and get some of his views on where we might be headed with big data. Mark Torr: Is big data just a buzz
Data has value IF you can analyze it, said participants at a big data analytics roundtable at the Premier Business Leadership Series in Las Vegas. In attendance were executives from some of the largest Communications companies in the world including from the US, Canada, Turkey, Japan, Australia and the Philippines as well
Wussten Sie schon: Für 13% der Autokäufer ist ein Neu-Fahrzeug ohne Internetzugang ein "no-go"! Dreizehn Prozent! Das bedeutet gleichzeitig 13% weniger Umsatz für den OEM. Die Unternehmensberatung Bain erwartet, dass diese sogenannten Connected Cars in nur wenigen Jahren die Regel und nicht mehr die Ausnahme sein werden. Dabei sind Connected Cars nur der Anfang: OEMs stehen jetzt vor der Herausforderung, ihr Portfolio noch einmal deutlich zu
The Global Hadoop market was valued at $1.5 billion in 2012 and is expected to grow at a compound annual growth rate of 58.2 percent, to reach $50.2 billion by 2020, according to a Hadoop Market Analysis report prepared by Allied Market Research. There is no doubt that IT teams are
I have been on a whirlwind tour locally here in Australia visiting existing SAS customers where the focus of discussions have centered around SAS and Hadoop. I am happy to report that during these discussions, customers have been consistently surprised and excited about what we are doing around SAS on
Did you know: For 13 percent of car buyers a new vehicle without internet access is a no-go? Obviously, no-go means no-buy. Thirteen percent! If I have ever seen a market demand, it is this. For sure, the industry will respond to that. The management consulting company Bain even expects
All hail the data lake, destroyer of enterprise data warehouses and the solution to all our enterprise data access problems! Ok – well, maybe not. In part four of this series I want to talk about the confusion in the market I am seeing around the data lake phrase, including
In this post we dig deeper into the fourth recommended practice for securing the SAS-Hadoop environment through Kerberos authentication: When configuring SAS and Hadoop jointly in a high-performance environment, ensure that all SAS servers are recognized by Kerberos. Before explaining the complex steps in connecting to secure Hadoop within a
I recently caught up with Dr. Tom Davenport, analytics thought-leader and author of Big Data @ Work, in Dublin, where we talked about big data, the Internet of Things and Hadoop. I'll be sharing the conversation here with you in two parts. You'll find part one below, and you can check
Working out where Hadoop might fit alongside, or where it might replace components, of existing IT architectures is a question on the minds of every organization that is being drawn towards the promises of Hadoop. That is the main focus of this blog along with discussions of some of the reasons they
In previous posts, we’ve shared the importance of understanding the fundamentals of Kerberos authentication and how we can simplify processes by placing SAS and Hadoop in the same realm. For SAS applications to interact with a secure Hadoop environment, we must address the third key practice: Ensure Kerberos prerequisites are met
SAS has been developing "secret sauce" technology for more than 38 years. Whether it has to do with being platform independent, processing in-database, running across a grid, or analyzing data in-memory like our SAS LASR Analytic Server or our High Performance Analytics offerings, secret sauce makes everything taste or, in
At most banks, data is stored in separate databases and data warehouses. Customer data is stored in marketing databases, fraud analyses are done on transactional data, and risk data is stored in risk data warehouses. Oftentimes even liquidity, credit, market, and operational risk data is stored separately as well. Bringing