This post is a nod to one of my favourite plays, The Importance of Being Earnest by Oscar Wilde. As the title ‘Data Scientist’ becomes more common, what can we gain about the importance of titles and labelling from this century old play?
For those that haven't read it, the story revolves around a man who goes by the name Earnest and has a reputation for being earnest (i.e. truthful and trustworthy). He is much loved by a lady for having the name Earnest - she has always wanted to marry a man with that name, believing men named Earnest are earnest (deep breath).
Well it turns out his name isn't really Earnest (irony 1 - he's not actually earnest despite his name) and the lady considers dumping him, but by a comic twist of fate it turns out that it actually is Earnest (irony 2 - he actually was earnest even though he didn’t think he was).
So what is the importance of being <insert a name or title>? As another wit once said "a rose by any other name would smell as sweet". But is that true? Our experiences have probably told us "No". Despite whatever skills we may have, a title comes with a reputation and expectations. Whether that’s someone named Earnest actually being earnest … or a Data Scientist being a magician with big data.
There have been many attempts at explaining what a data scientist is since the term was first coined in 2008 – Wikipedia, HBR, KDnuggets, Marketing Distillery – but the general definition is someone who encompasses equally high skill levels in:
a. Statistics.
b. “Hacker” programming.
c. Communication.
d. Business.
The number of people that actually satisfy this definition is a popular subject in discussion forums and papers, and it’s interesting to also ask from what perspective the attributes are judged (good communication skills from a marketer are expected to be different than good from a programmer). But what everyone agrees is that many Data Analysts, Data Miners, Statisticians, Econometricians and the myriad of other titles over the last 50 years, all have these attributes, but in varying proportions.
I've met many analytics practitioners over the years from different parts of the world. Some quantitative analysts have chosen to change their titles to Data Scientist to make them more attractive to employers as the Statistician and Data Miner titles go out of favour. Some, who are very close to the purist definition of data scientist, may not title themselves as such, adamantly sticking with the title they have had for many years.
On the other hand, in many cases, employers who advertise for Data Scientists are actually looking for:
- Quantitative analysts with innate curiosity to learn and innovate – a trait of most people from mathematics, sciences, engineering and economics disciplines.
- Candidates who meet some minimum criteria in the four attributes – many of which can be taught.
- Those who are strong in a subset of prioritized attributes to suit a function within a team.
Over the years, given the right drivers, these partially defined Data Scientists could become strictly defined Data Scientists – but in a collaborative team environment, you will likely find that having a whole team of these individuals is not important. The two realities are that there are far fewer examples of organizations looking for the latter than the former, and these tend to be for commercial research and development arms; and individuals that embody all the attributes of a Data Scientist in the “right” amounts are rare.
Therefore, for those looking to hire Data Scientists, my advice is:
- It’s much more important to first start with understanding the functions and expectations of the team within an organisation.
- Then create roles that fit the needs of that team and “be much more specific about the type of worker you want to be or hire” (Tom Davenport, Wall Street Journal).
- Be realistic of current skills in the market and tertiary education programs available.
For those looking for a role as a Data Scientist:
- Start developing the attributes you are weakest at through classroom and self-service training because all round skills are always sought after.
- Keep developing the attributes you already excel at as the big data analytics market is constantly evolving.
- Stay curious of new techniques and worldwide trends.
So, the importance of being a Data Scientist is to be more attractive to employers, but that what employers are usually looking for is some flavour thereof, rather than a strictly defined criteria. Even though a prospect may not be the purest definition of a Data Scientist, he or she may turn out to be just what an organization needs. Therefore, be sure of what you want to be and what you’re looking for and don't judge prospects and opportunities based on a title… lest you end up dumping your Earnest before he becomes an Earnest!
Learn more. Stay curious.
http://support.sas.com/training/index.html
http://www.sas.com/en_us/news/sascom/2012q4/data-scientist.html
2 Comments
Great post from Anne on the importance of skills and not title when it comes to being a practitioner of data science!
Thanks Felix. Most people develop skills over a number of years, generally through necessity (personal, professional, market drivers). No matter how much we feel skilled-up at one point in time, necessities change over the years and we end up prioritizing our efforts - this is why it's very hard to find individuals with all the required skills.