The other day I was in a meeting with a client and there was an argument about who owns the data. Those arguing were IT people. In this scenario, the assumption was that data from source systems would flow into and integrate with a data warehouse.
I found the discussion very interesting. Here are some of the assumptions I noted during the debate, followed by a few observations.
- IT is the group that extracts or subscribes to the data from the source systems. IT also designs the extract process and the target data stores.
- IT developers are the ones who integrate, merge, check quality and propagate the data to the layers in the data warehouse.
- IT and infrastructure set up the development, test and production platforms.
- IT, with business consumers, set up the testing scenarios.
- Production support (based on IT specifications) sets up and meets any security requirements for the data.
- IT manages the quality of the data from the source systems.
- Assumption is there's a process in place to continually correct the quality of the source system data – that must be an ongoing task. Data does NOT get better unless we help it.
- IT is responsible for the consumption and use of technical and business metadata.
- From the IT perspective, they own almost every aspect of data movement, integration and set up.
- Funding, in many companies, is by project and not necessarily set up for ongoing maintenance of the data or the tasks associated with monitoring.
- IT usually gets blamed if the data is not exactly the way the business users want to see and use it.
- Why are business consumers not involved in the entire process? Soup to Nuts!
Find out how to get the right people on your big data bus.