20 encounters of the information management kind – calculating in reports or ETL

1

I am constantly faced with decisions on whether to do calculations in reports or store the calculated value in the data warehouse. So, I thought, why not share these thoughts with you?

Pros and cons:

  1. If I calculate a value in the report, I could be putting undue stress and complexity on my reporting software and hardware platform. I also have to configure the report to do this process.
  2. If I pre-calculate and store the value in the data warehouse, the ETL (extract, transform, load) must work a bit harder, and carry the burden of data validation and profiling with our data quality toolset.
  3. Changing the calculation in the report is probably easier than changing the ETL.
  4. ETL is meant to work hard with our processes and can usually capture metadata.
  5. ETL can interact with data quality processes and capture any data anomalies.

How I decide:

  1. If the calculated value is used by any other corporate analysis or reporting tool, then I need to pre-calculate and store in the table.
  2. If the calculated value requires input from the business user through prompts, then I usually calculate in the report.

No matter which way you choose, you must create metadata to capture this information for future use. I like pre-calculating in the table because I can capture that information in the data model. I also like making the ETL the workhorse as it interacts with my data quality software and captures all that metadata.

What is your preference?

Tags ETL
Share

About Author

Joyce Norris-Montanari

President of DBTech Solutions, Inc

Joyce Norris-Montanari, CBIP-CDMP, is president of DBTech Solutions, Inc. Joyce advises clients on all aspects of architectural integration, business intelligence and data management. Joyce advises clients about technology, including tools like ETL, profiling, database, quality and metadata. Joyce speaks frequently at data warehouse conferences and is a contributor to several trade publications. She co-authored Data Warehousing and E-Business (Wiley & Sons) with William H. Inmon and others. Joyce has managed and implemented data integrations, data warehouses and operational data stores in industries like education, pharmaceutical, restaurants, telecommunications, government, health care, financial, oil and gas, insurance, research and development and retail. She can be reached at jmontanari@earthlink.net.

1 Comment

  1. My preference is to pre-calculate and store in the table. Primarily, to reduce any ambiguity if multiple versions of the calculation are created by multiple users in various reports. It provides more of a single source of truth, especially when stored with associated business metadata.

Leave A Reply

Back to Top