On the way to data lakes and Hadoop – Part 1
Jim Harris advocates addressing data quality and governance issues on the way to data lakes and Hadoop.
Jim Harris advocates addressing data quality and governance issues on the way to data lakes and Hadoop.
Historically, before data was managed it was moved to a central location. For a long time that central location was the staging area for an enterprise data warehouse (EDW). While EDWs and their staging areas are still in use – especially for structured, transactional and internally generated data – big
Start with the end in mind -- wise words that apply to everything, and in the world of big data it means we have to change the way we look at managing the data we have. There was a time when we managed data quality, and the main goal was