🔥 40% Off Crucial Exams Memberships — Deal ends today!

1 hour, 54 minutes remaining!

GCP Professional Data Engineer Practice Question

Your e-commerce company ingests tens of thousands of click-events per second into Pub/Sub. Data engineers must build a pipeline that consumes the stream in real time, performs sliding-window aggregations, and writes the results to BigQuery. When business logic changes, the same code should be rerun in batch to reprocess a full day of raw event files stored in Cloud Storage. The team wants a fully managed, auto-scaling service that lets them implement the pipeline once in Python without having to create or manage clusters. Which Google Cloud service best satisfies these requirements?

  • Managed Spark Structured Streaming jobs on Cloud Dataproc clusters

  • A Cloud Composer DAG orchestrating BigQuery SQL transformation jobs

  • Cloud Dataflow with the Apache Beam SDK

  • A Cloud Data Fusion pipeline triggered by Pub/Sub and writing to BigQuery

GCP Professional Data Engineer
Ingesting and processing the data
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot