GCP Professional Data Engineer Practice Question

Your team operates a Dataproc cluster with three masters and 30 workers to run a two-hour Spark ETL job every night. For the remaining 22 hours the cluster is idle, yet the Spark application requires the same custom image and machine type for every run. After a 40 % budget cut, you must dramatically reduce compute spend while keeping job performance and configuration isolation intact. What should you do?

Use a Dataproc workflow template (or Cloud Composer DAG) that creates a job-scoped Dataproc cluster with the required image and machine type, runs the Spark job, and automatically deletes the cluster when the job completes.
Rewrite the Spark application for Cloud Dataflow and schedule it nightly with a template launch.
Convert the existing cluster to an autoscaling persistent cluster with a minimum of zero workers and only preemptible secondary workers.
Enable Dataproc High Availability and schedule the cluster to hibernate during idle hours with Cloud Scheduler scripts.

GCP Professional Data Engineer

Maintaining and automating data workloads

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

GCP Professional Data Engineer Practice Question

Answer Description

Ask Bash

What is a Dataproc workflow template?

Why would using Cloud Dataflow require code changes?

What is the difference between an ephemeral Dataproc cluster and a persistent cluster?

What is a Dataproc workflow template?

Why is an ephemeral Dataproc cluster cost-effective?

What is the difference between Dataproc and Cloud Dataflow?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

GCP Professional Data Engineer Practice Question

Report Issue

Answer Description

Ask Bash

What is a Dataproc workflow template?

Why would using Cloud Dataflow require code changes?

What is the difference between an ephemeral Dataproc cluster and a persistent cluster?

What is a Dataproc workflow template?

Why is an ephemeral Dataproc cluster cost-effective?

What is the difference between Dataproc and Cloud Dataflow?

Report Issue