GCP Professional Data Engineer Practice Question

Your team stores 2 TB of clickstream data in a BigQuery table partitioned on the event_date column. Analysts frequently query the most recent 90 days for a single user_id, yet each query still scans entire daily partitions, driving up on-demand query costs. You need to reduce the bytes read without changing existing SQL, scheduling extra maintenance jobs, or incurring additional storage cost. Which approach best meets the requirement?

  • Add a clustering specification on user_id for the existing partitioned table.

  • Export the last 90 days of data to Cloud Storage and query it through external tables.

  • Create materialized daily tables per user_id and have analysts query only their specific table.

  • Re-partition the table using an integer-range partition on user_id instead of event_date.

GCP Professional Data Engineer
Storing the data
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot