Your company is migrating a 10-year-old snowflake-style data warehouse to BigQuery. Analysts run ad-hoc dashboards that typically aggregate daily sales by customer attributes and product hierarchy. Query latency must be under two seconds without maintaining complex BI caches, and data is loaded hourly from Cloud Storage. Which data modeling approach in BigQuery best satisfies the latency objective while keeping ETL logic and operating costs low?
Retain the star schema unchanged and enable BigQuery BI Engine to cache dashboard queries, avoiding any changes to the underlying tables.
Maintain a star schema but create materialized views that pre-join the fact and dimension tables for every common aggregation used in dashboards.
Denormalize the warehouse by embedding dimension attributes as nested and repeated fields inside the fact table, producing one wide, partitioned table that analysts query directly.
Keep the existing snowflake schema and simply partition the fact table on the load-date column, relying on BigQuery's distributed joins for interactive performance.
BigQuery delivers the best interactive performance when joins are avoided. Flattening the model by embedding the relatively small and slowly changing dimension records directly in the fact table as nested or repeated STRUCT fields eliminates the joins executed for each dashboard query. Because the data arrives hourly, the ETL pipeline can populate a single wide, partitioned table without additional caching layers or ongoing materialized-view maintenance. Relying on distributed joins, BI Engine, or numerous pre-joined materialized views would still introduce join or refresh overhead and higher cost, so they do not meet the stated latency and maintenance goals as effectively.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What are nested and repeated fields in BigQuery?
Open an interactive chat with Bash
Why does BigQuery perform better with denormalized data?
Open an interactive chat with Bash
What is the purpose of partitioned tables in BigQuery?
Open an interactive chat with Bash
What are nested and repeated fields in BigQuery?
Open an interactive chat with Bash
Why does BigQuery perform better when joins are avoided?
Open an interactive chat with Bash
How does partitioning impact query performance in BigQuery?
Open an interactive chat with Bash
GCP Professional Data Engineer
Storing the data
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99 $11.99
$11.99/mo
Billed monthly, Cancel any time.
$19.99 after promotion ends
3 Month Pass
$44.99 $26.99
$8.99/mo
One time purchase of $26.99, Does not auto-renew.
$44.99 after promotion ends
Save $18!
MOST POPULAR
Annual Pass
$119.99 $71.99
$5.99/mo
One time purchase of $71.99, Does not auto-renew.
$119.99 after promotion ends
Save $48!
BEST DEAL
Lifetime Pass
$189.99 $113.99
One time purchase, Good for life.
Save $76!
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .