Tag: StrataSJC

Data for Good
Jill Dyché 0
Big data, one dog at a time

A few years ago, in the height of my workaholism, I took up a hobby. I go to sketchy neighborhoods around L.A. and hang out with dogs I don’t know. I have a long history of adopting and fostering shelter dogs, often getting them out on their “euth dates.” With

Data Management
Bill Davis 0
MapReduce vs. Apache Spark vs. SQL: Your questions answered here and at #StrataHadoop

As the big data era continues to evolve, Hadoop remains the workhorse for distributed computing environments. MapReduce has been the dominant workload in Hadoop, but Spark -- due to its superior in-memory performance -- is seeing rapid acceptance and growing adoption. As the Hadoop ecosystem matures, users need the flexibility to use either traditional MapReduce