Why build models?

2

why2We are all modelers.  Whenever you plan, you are building a model.  Whenever you imagine, you are building a model. When you create, write, paint or speak, you first build in your head a model of what you want to accomplish, and then fill in the details with words, movements or other actions in order to realize that model.

Models work via a three-part structure:  Input, Mechanism, and Output.  When we use models, we are generally confident in only two of the three stages, and we use the process to determine the unknown stage.

The most familiar construction is where we know, or have confidence in, our inputs and our mechanism (the mechanism being the rules or algorithm that generates output from inputs), which we call “prediction”.  We use this structure to predict, or forecast, a wide variety of outputs, from tomorrow’s weather to next month’s sales to next year’s election.

But prediction is not the only available model structure.  Operating in reverse, when we are confident in our outputs and our mechanism, but not our inputs, is “retro-diction” or “post-diction”.    Instead of asking ‘what will happen?’, this process asks, ‘what sort of inputs could have generated the observed output?’  We use this formulation regularly when we do root cause or failure analysis.  In the sciences this structure is commonly employed to discover the possible causes of diseases, or to understand the conditions that might have led to the last ice age or extinction event.

The final model configuration, where we have confidence in both our inputs and outputs, but not in our intervening mechanism, is called “explanation”.  What possible mechanism could get us from state A to state B?  In the sciences we are always asking what-if questions probing into the structure of plausible mechanisms.

“Explanation” is the model structure we generally have in mind when we ask “why”?  Why is the sky blue? (what is the mechanism by which sunlight is differentially scattered?).  Why are only certain people resistant to a particular disease?  In the social sciences we might ask what mechanisms impact scholastic achievement?  In business we might first ask why a certain demographic has different commercial outcomes from another, so that we can later use that more accurate mechanism for predictive purposes.

Modeling

Each of these modes, however, has its limitations.  Forecasts are only as good as the model they are built on. If you are a Weather Channel junkie you know that they use several different models to predict the paths and severity of winter storms and summer hurricanes, knowing that some models will work better than others in certain situations. And your forecast is of course dependent on the quality of the input data used.  The field of chaos theory famously grew out of an unintended truncation error in an early weather forecasting model, where dropping the third digit after the decimal point resulted in drastically different results in a matter of just a few days of forecast simulation.

I have already hinted at the primary limitations surrounding retro-diction and explanation – that there can be, and often are, more than one set of inputs, or more than one model, that could give rise to any specified output.  The difference between “necessary” and “sufficient”.

Is that all you can do with models?  Far from it.  There are many other valuable consequences of modeling beyond the Big Three:

  • Models can guide data selection, letting you know what data to collect, what data is important to your project.
  • Models can suggest useful analogies between and among disparate domains. John von Neumann, the polymath who was at the heart of everything important going on in physics, computer science and mathematics during the 1940s and 50s, was known to memorize those formulas at the end of textbooks, because he found that relationships (i.e. explanatory models) in one field often held the answer to a conundrum in another.
  • Discover new questions to ask.
  • Demonstrate trade-offs or suggest efficiencies. Building a model of how you do things today can often illuminate how and where to make improvements.

Most importantly, models can be the key to developing an analytic, data-driven culture.  When people see that their assumptions must be made explicit and will be questioned, tested and challenged, you are taking a necessary step in instilling the discipline required to become both a more agile and more creative organization.  It’s not any particular model or technique, it’s about the entire modeling enterprise and the analytic habits it encourages.

While it is true that we are all modelers, we no longer have to go it alone.  We have at our disposal a set of powerful analytic tools to augment the model building of our minds, from data quality to data transformation and management, to rapid model prototyping and deployment, to the visualization of output for insight.  Every part of the process but one can now reside outside the confines of our limited cranial capacity – the analytic, model-building discipline, and the imagination to apply it, is ours alone to nourish and grow.

Share

About Author

Leo Sadovy

Marketing Director

Leo Sadovy currently manages the Analytics Thought Leadership Program at SAS, enabling SAS’ thought leaders in being a catalyst for conversation and in sharing a vision and opinions that matter via excellence in storytelling that address our clients’ business issues. Previously at SAS Leo handled marketing for Analytic Business Solutions such as performance management, manufacturing and supply chain. Before joining SAS, he spent seven years as Vice-President of Finance for a North American division of Fujitsu, managing a team focused on commercial operations, alliance partnerships, and strategic planning. Prior to Fujitsu, Leo was with Digital Equipment Corporation for eight years in financial management and sales. He started his management career in laser optics fabrication for Spectra-Physics and later moved into a finance position at the General Dynamics F-16 fighter plant in Fort Worth, Texas. He has a Masters in Analytics, an MBA in Finance, a Bachelor’s in Marketing, and is a SAS Certified Data Scientist and Certified AI and Machine Learning Professional. He and his wife Ellen live in North Carolina with their engineering graduate children, and among his unique life experiences he can count a singing performance at Carnegie Hall.

Back to Top