Is it wise to crowdsource data governance?


Data governance doesn't lack for definitions and never has. Here is one that's as good as any:

The process of creating and agreeing to standards and requirements for the collection, identification, storage and use of data.

Clocking in at 19 words, this definition works just fine. To boot, it certainly jibes with the efforts of most traditional data governance efforts. That is, it emphasizes an inward focus predicated on boards, councils and stewards.

Data governance entities have generally focused on questions such as:

  • Are we allowed to do [insert task]with our data?
  • Should we do [insert task]with our data?
  • What data should constitute master data?
  • How many and what data sources exist for each type of master data?
  • Who is allowed access rights to which data type? And what actions can these individuals and groups perform?

group of people represent crowdsourcing data governanceAre these still important questions? Sure, but as I wrote recently, organizations need to start viewing data governance from external points of view. To me, it's downright silly to ignore the benefits of thinking about data-related issues in different and unexpected ways.

Still, you can go too far with this type of expanded, externally based mind-set. Yes, there are those who advocate crowdsourcing data governance. (You read right.) Imagine an organization taking a Wikipedia-type approach to critical issues surrounding user, customer, product and employee data.

This to me sounds like anarchy. The idea seems terrible in theory and I've yet to read a single case study on its successful implementation. Organizations are ultimately responsible for what they do with enterprise data, not "the crowd." (Along these lines, there's a famous axiom in the open-source software world: "Think free speech, not free beer.")

Simon says: Find a middle ground with data governance.

To be sure, crowds can certainly improve data quality. Groups can spot errors that we often overlook. They ask different questions. By virtue of distance, they often avoid many of the biases that plague individual decision-making. Forget proper crowds, though. We individuals often stand to benefit from a single, separate pair of eyes. (Pair programming exists for a reason.)

But fully crowdsourcing something as essential as data governance? I just don't buy it. My biggest hangup with the concept: it necessitates a loss of control that could very well cripple organizations. Are individual users or groups likely to accept responsibility for high-profile privacy breaches? Not unless they are affiliated with Anonymous.

Maybe, like Amazon's infamous empty chair, organizations should formalize the presence of those groups historically not at the table.


What say you?

Download The Intersection of Big Data, Data Governance and MDM


About Author

Phil Simon

Author, Speaker, and Professor

Phil Simon is a keynote speaker and recognized technology expert. He is the award-winning author of eight management books, most recently Analytics: The Agile Way. His ninth will be Slack For Dummies (April, 2020, Wiley) He consults organizations on matters related to strategy, data, analytics, and technology. His contributions have appeared in The Harvard Business Review, CNN, Wired, The New York Times, and many other sites. He teaches information systems and analytics at Arizona State University's W. P. Carey School of Business.

Related Posts

Leave A Reply

Back to Top