Big Data ist wie ein Fass ohne Boden. Fängt man einmal an, sich damit zu beschäftigen, zieht sich ein nicht enden wollender Rattenschwanz hinterher. Im positiven Sinne! Ich möchte das Zusammenspiel mit der Open-Source-Technologie Hadoop beleuchten. Big Data braucht, wie jeder weiß, auch Big Speicherplatz. Das ist die Voraussetzung für
Tag: big data
.@philsimon looks under the hood of 'analytics.'
England’s shambolic early exit from the Cricket World Cup has stirred up a hornet’s nest about the team’s supposed over-reliance on data. In the aftermath of their defeat to Bangladesh, coach Peter Moores said: ‘We thought 275 (runs) was chaseable. We’ll have to look at the data.’ It prompted outrage from
The data lake is a great place to take a swim, but is the water clean? My colleague, Matthew Magne, compared big data to the Fire Swamp from The Princess Bride, and it can seem that foreboding. The questions we need to ask are: How was the data transformed and
The Internet of Things is coming fast and furious. We clearly know what these “things” are, and were able to see prototypes at last week’s Mobile World Congress (MWC) which hosted some 93,000 attendees. Things = connected life = cars, homes (thermostats, washer and dryers, vacuum cleaners, security systems, refrigerators, etc.),
Innovation within hospitality drives awareness, service delivery, guest engagement, and brand differentiation. SAS asked a panel of experts to comment on how innovation is shaping the hospitality industry. According to many of our experts, analytics is at the heart of innovation. Learn more in this white paper on building
Today in manufacturing there has been a lot of investment in automation, supervisory controls, quality, and execution systems. The amount of data produced and now being captured is staggering. The data captured in industry will re-define what is “big” in big data. Yet, for all this investment: Equipment still fails. Scrap
In my last two posts, we concluded two things. First, because of the need for broadcasting data across the internal network to enable the complete execution of a JOIN query in Hadoop, there is a potential for performance degradation for JOINs on top of files distributed using HDFS. Second, there are
Seguramente ya ha escuchado hablar sobre Hadoop y todas sus potentes capacidades, de no ser así, este sistema no es más que un marco para software de código abierto que permite almacenar y procesar grandes volúmenes de datos de forma distribuida en un gran número de productos de hardware. En
In The Princess Bride, one of my favorite movies, our hero Westley – in an attempt to save his love, Buttercup – has to navigate the Fire Swamp. There, Westley and Buttercup encounter fire spouts, quicksand and the dreaded rodents of unusual size (RUS's). Each time he has a response to the
Warranties have a long - and some might say - interesting past. But the future is even brighter. New technologies and data sources are transforming our understanding of field quality, enabling deeper insights into product performance and customer preferences. These breakthroughs are accelerating the quest to reduce defects and satisfy customers.
In my last post, I pointed out that an uninformed approach to running queries on top of data stored in Hadoop HDFS may lead to unexpected performance degradation for reporting and analysis. The key issue had to do with JOINs in which all the records in one data set needed
Financial institutions are mired with large pools of historic data across multiple line of businesses and systems. However, much of the recent data is being produced externally and is isolated from the decision making and operational banking processes. The limitations of existing banking systems combined with inward-looking and confined data practices
Small data is akin to algebra; big data is like calculus.
From the pressures of a highly competitive marketplace to changing economic conditions, to the evolution of the distribution network - the challenges facing the hospitality are many and varied. In this video, SAS asked a panel of experts to share their views on the issues that will challenge the hospitality
The electoral battlespace for the upcoming general election in the United Kingdom is starting to take shape. Campaigners are busily debating the political landscape. They want to own the high ground that dominates areas that matter most to voters – the NHS and the economy. With an ageing population and
In the movie Big, a 12-year-old boy, after being embarrassed in front of an older girl he was trying to impress by being told he was too short for a carnival ride, puts a coin into an antique arcade fortune teller machine called Zoltar Speaks, makes a wish to be big,
Despite an increase in the availability of data in the federal government over the past few years, data and analytics could be doing even more for federal agencies. A strategic approach to managing and analyzing the data is needed. And, like many technology challenges – that’s a people problem. A
As the point person for SAS joining the new Open Data Platform (ODP) initiative, I want to make it clear why SAS is involved with ODP, and why we think it’s important to our customers, and the Hadoop and big data ecosystem as a whole. SAS is not in it to
Hadoop is increasingly being adopted as the go-to platform for large-scale data analytics. However, it is still not necessarily clear that Hadoop is always the optimal choice for traditional data warehousing for reporting and analysis, especially in its “out of the box” configuration. That is because Hadoop itself is not
When asked what his movement wanted around a century ago, the iconic American labor leader Samuel Gompers famously gave a one-word answer: "More." This annoyed his opponents at the negotiating table and many in the business community. He was not demanding a specific wage increase or fighting for a distinct cause like
Omnichannel, Internet of Things and customer loyalty were just three of the terms you heard over and over again on the conference floor and in presentations at retail's biggest conference last month. If you had to miss the Retail Big Show in New York City, the article "Retail's Omnichannel, Data-Driven Revolution is
Katherine Grainger paid a heart-felt tribute to the support staff who have been a pivotal part of her success in her opening speech ahead of last month’s World Sport Science and Medicine conference. The event was hosted in the UK jointly by British Rowing and FISA, the international rowing federation. “My career
Data Management has been the foundational building block supporting major business analytics initiatives from day one. Not only is it highly relevant, it is absolutely critical to the success of all business analytics projects. Emerging big data platforms such as Hadoop and in-memory databases are disrupting traditional data architecture in
For hotel companies, it is challenging to find new ways to differentiate in an ever evolving marketplace. There is a lot of talk in our industry about the increasing numbers of third party channels and distributors to have entered the marketplace, and how that impacts the hotel company’s core business.
Business analytics is about dramatically improving the way an organization makes decisions, conducts business and successfully competes in the marketplace. At the heart of business analytics is data. Historically, the philosophy of many insurers has been on collecting data, data and more data. However, even with all this data, many
Es bien sabido el Big Data cae bajo el amplio espectro del gobierno de datos y es un componente crítico dentro de una estrategia de TI. La confianza y la seguridad son un aspecto sumamente importante ya que los datos privados de las organizaciones están constantemente expuestos a caer en
In this blog series, I am exploring if it’s wise to crowdsource data improvement, and if the power of the crowd can enable organizations to incorporate better enterprise data quality practices. In Part 1, I provided a high-level definition of crowdsourcing and explained that while it can be applied to a wide range of projects
In this blog series, I am exploring if it’s wise to crowdsource data improvement, and if the power of the crowd can enable organizations to incorporate better enterprise data quality practices. In Part 1, I provided a high-level definition of crowdsourcing and explained that while it can be applied to a wide range of projects
Staying competitive in a big data world means working fast and making decisions even faster. You need to assess conditions, approve access, stop transactions and reroute activities quickly so you can seize opportunities or prevent problems. With increasing data volumes from the Internet of Things (Cisco predicts that fifty billion