20 encounters of the information management kind – highly summarized data warehouse tables vs. detailed records

0

Design of the data warehouse for staging, the lowest level of granularity (history), and the data marts has to be tuned based on how these levels are used. Let’s evaluate some characteristics about each level.

STAGING – Staging areas are used as a place to land data for propagation and integration into the data warehouse. Sometimes we just land the data and integrate on the way to the data warehouse, and sometimes we integrate within the staging area. My preference is to integrate and check the data quality on the way to the data warehouse. Tuning in this area requires us to understand the ETL integration process and index accordingly. We can also consider creating indexes JUST FOR A PROCESS and then drop them. Some staging areas have persistent data.

DATA WAREHOUSE – The data warehouse has integrated data, modeled as close to third normal form as efficiency will allow. History is prevalent in the data warehouse. Tuning in the data warehouse requires us to build and drop indexes for propagating information to the data warehouse.

DATA MARTS – I like to create low-level detail star schemas and highly summarized tables to be used with dashboards. This would require the dashboard technology to allow us to drill into another table for the detail. Tuning in the data mart requires us to index based on how the data is queried. With so many indexes in the data marts area, truncating and reloading data is difficult for most database management systems.

I create highly summarized tables when the requirements dictate fast response.

Share

About Author

Joyce Norris-Montanari

President of DBTech Solutions, Inc

Joyce Norris-Montanari, CBIP-CDMP, is president of DBTech Solutions, Inc. Joyce advises clients on all aspects of architectural integration, business intelligence and data management. Joyce advises clients about technology, including tools like ETL, profiling, database, quality and metadata. Joyce speaks frequently at data warehouse conferences and is a contributor to several trade publications. She co-authored Data Warehousing and E-Business (Wiley & Sons) with William H. Inmon and others. Joyce has managed and implemented data integrations, data warehouses and operational data stores in industries like education, pharmaceutical, restaurants, telecommunications, government, health care, financial, oil and gas, insurance, research and development and retail. She can be reached at jmontanari@earthlink.net.

Leave A Reply

Back to Top