Tag: Apache Hadoop
.@philsimon provides insights on whether a data strategy can result in competitive advantage.
Bigger doesn’t always mean better. And that’s often the case with big data. Your data quality (DQ) problem – no denial, please – often only magnifies when you get bigger data sets. Having more unstructured data adds another level of complexity. The need for data quality on Hadoop is shown by user
Small data is akin to algebra; big data is like calculus.
When asked what his movement wanted around a century ago, the iconic American labor leader Samuel Gompers famously gave a one-word answer: "More." This annoyed his opponents at the negotiating table and many in the business community. He was not demanding a specific wage increase or fighting for a distinct cause like
Even though it sounds like something you hear on a Montessori school playground, this theme “Share your cluster” echoes across many modern Apache Hadoop deployments. Data architects are plotting to assemble all their big data in one system – something that is now achievable thanks to the economics of modern
For some odd reason, the open-source Apache Hadoop ecosystem consists of cleverly named components that seem to have escaped from the Central Park Zoo. You may be aware that the little yellow elephant named Hadoop was actually a stuffed toy that Doug Cutting’s son owned. (Doug is the co-creator of