🔥 40% Off Crucial Exams Memberships — Deal ends today!

1 hour, 53 minutes remaining!

GCP Professional Data Engineer Practice Question

Your analytics team runs a Spark ETL job every night. The job processes 3 TB of data, writes the cleansed result to a BigQuery table, and must finish within a two-hour window. Today it executes on a three-node Dataproc cluster that is kept running 24 × 7 and stores temporary files in HDFS on the cluster's persistent disks. The cluster is idle the rest of the day, and monthly Compute Engine and persistent-disk charges have become the largest cost item in the project. You have been asked to redesign the solution to cut operating costs while still meeting the existing SLA and without rewriting the Spark code. Which approach best meets these requirements?

  • Keep the current cluster but enable Dataproc autoscaling and resize the cluster to zero workers after the job finishes; restart the same cluster before the next run.

  • Migrate the Spark job to a scheduled BigQuery stored procedure that rewrites the ETL logic in SQL and leverages BigQuery's on-demand pricing.

  • Add local SSDs to the existing persistent cluster for faster I/O and purchase Committed Use Discounts on the VM instances to lower hourly costs.

  • Replace the persistent cluster with an ephemeral Dataproc workflow that spins up a job-scoped cluster each night, uses Cloud Storage instead of HDFS, and optionally adds pre-emptible secondary workers for extra capacity at lower cost.

GCP Professional Data Engineer
Maintaining and automating data workloads
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot