Uncategorized

Data Management
David Loshin 2
Big data quality with continuations

I've been doing some investigation into Apache Spark, and I'm particularly intrigued by the concept of the resilient distributed dataset, or RDD. According to the Apache Spark website, an RDD is “a fault-tolerant collection of elements that can be operated on in parallel.” Two aspects of the RDD are particularly

Data Management
David Loshin 0
Planning the strategic data road map

One of the challenges my clients struggle with is figuring out how to execute against a proposed data strategy. The visionaries are always happy to participate in the process of assessing the current state and proposing a vision for the future. And adding business justifications and quantifiable metrics for success

1 28 29 30 31 32 105