🔥 40% Off Crucial Exams Memberships — Deal ends today!

11 minutes, 57 seconds remaining!

GCP Professional Data Engineer Practice Question

Your team supports a Looker Studio dashboard that reads from the 2-TB BigQuery table retail.fact_sales, which is partitioned by the sale_timestamp column (type: DATE). In the dashboard's SQL you see the predicate WHERE DATE(sale_timestamp) BETWEEN DATE_SUB(CURRENT_DATE(), INTERVAL 7 DAY) AND CURRENT_DATE(). In Query Execution Details, the scan stage reads 65 GB, far more than the ~5 GB stored in the last seven partitions, making every chart slow. Without changing the dashboard's filters or buying more slots, which action is most likely to reduce the scanned bytes and speed up the queries?

  • Enable automatic query caching so repeated dashboard queries are served entirely from cached results instead of scanning the table.

  • Convert fact_sales from a partitioned table to a table clustered on sale_timestamp, because clustering is more efficient than partitioning for date filters.

  • Purchase additional BigQuery slots so the query can run with more parallel workers and finish sooner despite the larger scan.

  • Rewrite the filter to compare the sale_timestamp column directly to DATE_SUB(CURRENT_DATE(), INTERVAL 7 DAY) and CURRENT_DATE(), eliminating the DATE() wrapper so partition pruning can occur.

GCP Professional Data Engineer
Preparing and using data for analysis
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot