🔥 40% Off Crucial Exams Memberships — Deal ends today!

11 minutes, 58 seconds remaining!

GCP Professional Data Engineer Practice Question

Your team must plan a pipeline for a global e-commerce platform. Mobile apps produce 50 000 events/sec that must power millisecond-latency personalization and be available for ad-hoc analytics in BigQuery. Compliance demands customer-managed encryption for stored data, no public IP addresses on processing workers, and a perimeter that blocks data exfiltration. Which architecture meets every requirement while minimising operational overhead?

  • Publish events to Pub/Sub; process them with streaming Dataflow jobs that use CMEK, disable public IPs, and run inside a VPC Service Controls perimeter; have the pipeline write in parallel to Bigtable for low-latency serving and BigQuery for analytics.

  • Use Cloud Data Fusion real-time pipelines to read from Pub/Sub and write to Spanner for both analytics and serving; rely on Cloud NAT for egress and Cloud DLP for data encryption.

  • Stream events to a self-managed Apache Kafka cluster on Compute Engine; use Spark Streaming to land data in Cloud Storage; query it through BigQuery external tables.

  • Send events to Cloud Storage with default encryption; schedule Dataproc Spark batch jobs on preemptible VMs via Cloud NAT; load the results into BigQuery and Cloud SQL for serving.

GCP Professional Data Engineer
Ingesting and processing the data
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot