Blend, cleanse and prepare data for analytics, reporting or data modernization efforts
@philsimon on what we can learn about data quality from Jeff Bezos's behemoth.
Blend, cleanse and prepare data for analytics, reporting or data modernization efforts
@philsimon on what we can learn about data quality from Jeff Bezos's behemoth.
As the big data era continues to evolve, Hadoop remains the workhorse for distributed computing environments. MapReduce has been the dominant workload in Hadoop, but Spark -- due to its superior in-memory performance -- is seeing rapid acceptance and growing adoption. As the Hadoop ecosystem matures, users need the flexibility to use either traditional MapReduce
At a recent TDWI conference, I was strolling the exhibition floor when I noticed an interesting phenomenon. A surprising percentage of the exhibiting vendors fell into one of two product categories. One group was selling cloud-based or hosted data warehousing and/or analytics services. The other group was selling data integration products. Of