GCP Professional Data Engineer Practice Question

Your retail analytics team must deliver a near-real-time dashboard that charts the rolling five-minute average number of product page clicks per product category. Clickstream events are published to Pub/Sub with an event-time attribute. Static product metadata is stored in a BigQuery dimension table that is refreshed once per hour. Aggregated results have to be queryable in BigQuery within one minute after every window closes, and the solution should minimise operational overhead. Which design should you implement to meet these requirements?

  • Deploy a streaming Dataflow pipeline that uses the BigQuery dimension table as an hourly refreshed side input to enrich the aggregates before writing them to BigQuery.

  • Stream events directly into BigQuery and rely on a materialized view to compute the rolling five-minute averages and join with the dimension table, eliminating the need for Dataflow.

  • Build a streaming Dataflow pipeline that reads from Pub/Sub, assigns event-time timestamps, applies a five-minute hopping window advancing every minute to compute category-level averages, writes the aggregated results to BigQuery via streaming inserts, and expose a BigQuery view that joins this fact table with the product dimension table.

  • Schedule a Cloud Data Fusion batch ETL pipeline every five minutes that pulls recent events from Pub/Sub, performs the aggregation and join, and loads the result into BigQuery.

GCP Professional Data Engineer
Ingesting and processing the data
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot