🔥 40% Off Crucial Exams Memberships — Deal ends today!

44 minutes, 43 seconds remaining!

GCP Professional Data Engineer Practice Question

Your media company ingests millions of click events per second into Pub/Sub. Design a pipeline that:

  1. Groups events into user sessions separated by 30 minutes of inactivity and writes the enriched session records to BigQuery with sub-minute latency.
  2. After each UTC day ends, executes a dependent SQL job that aggregates the previous day's sessions into a summary table. You want minimal infrastructure management and a clear separation between the real-time transformation and the daily batch step. Which solution meets these requirements?
  • Use Cloud Data Fusion in batch mode to read from Pub/Sub, apply sessionization with Wrangler transformations, and load sessions into Cloud SQL; call a Workflows orchestration each night to aggregate into BigQuery.

  • Invoke Cloud Functions for every Pub/Sub event to write raw clicks to Bigtable; configure a daily BigQuery Data Transfer from Bigtable at 00:00 UTC and rely on materialized views for real-time sessionization.

  • Create a streaming Apache Beam pipeline on Dataflow that applies 30-minute session windows and writes to BigQuery via the Storage Write API; orchestrate a Cloud Composer DAG that, once the Dataflow watermark passes midnight UTC, triggers a BigQuery SQL job to build the daily summary table.

  • Run a long-lived Spark Streaming job on Dataproc to perform sessionization and write results to Cloud Storage; use Cloud Scheduler to invoke a Cloud Function at 00:10 UTC that runs a BigQuery aggregation query.

GCP Professional Data Engineer
Ingesting and processing the data
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot