🔥 40% Off Crucial Exams Memberships — Deal ends today!

11 minutes, 5 seconds remaining!

GCP Professional Data Engineer Practice Question

Your company ingests 2 TB/day of raw IoT sensor data as daily Avro files in a Cloud Storage landing bucket cataloged by Dataplex. Each night a job must 1) mask customer PII, 2) aggregate readings by device-hour, and 3) write compressed Parquet files to a curated bucket. Volume can triple without notice. Ops requires a fully managed, pay-as-you-go service that auto-scales and needs no cluster maintenance. The same code must support back-fill runs via Cloud Scheduler with a date parameter. Which solution meets these needs?

  • Create a Spark SQL job in Cloud Dataproc, run it nightly on a static cluster sized for peak load, and orchestrate executions with Cloud Composer.

  • Load the Avro files into BigQuery staging tables with a scheduled load job, run SQL to mask and aggregate, then export the results as CSV files to Cloud Storage via a scheduled extract job.

  • Attach Cloud Functions to the landing bucket's object-finalize events; each function parses its file, removes PII, performs in-memory aggregation across all files, and writes a Parquet file back to the curated bucket.

  • Implement an Apache Beam pipeline, package it as a Dataflow Flex Template that reads Avro files from Cloud Storage, masks PII, aggregates by device-hour, writes compressed Parquet to the curated bucket, and trigger the template nightly from Cloud Scheduler with a date parameter.

GCP Professional Data Engineer
Storing the data
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot