GCP Professional Data Engineer Practice Question

Your analytics team executes four large Spark batch ETL jobs every night, each with different libraries and executor memory requirements. During business hours, data scientists occasionally run short interactive Hive queries that must return within minutes. You want to minimize Dataproc costs without sacrificing performance or isolating the nightly jobs from one another. Which strategy best meets these goals?

Submit each nightly batch job to its own ephemeral Dataproc cluster and delete the cluster on completion; maintain a small persistent cluster for interactive queries.
Run both the batch jobs and interactive queries on a single persistent Dataproc cluster with autoscaling disabled to avoid provisioning delays.
Provision a separate always-on persistent Dataproc cluster for each nightly batch job to guarantee resource isolation, and shut them down in the morning.
Keep an always-on persistent cluster sized for the nightly batch peak, and launch short-lived job-based clusters only for interactive queries.

GCP Professional Data Engineer

Maintaining and automating data workloads

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

GCP Professional Data Engineer Practice Question

Answer Description

Ask Bash

What is an ephemeral Dataproc cluster?

What is the difference between ephemeral and persistent Dataproc clusters?

Why use an ephemeral cluster for nightly batch jobs?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

GCP Professional Data Engineer Practice Question

Report Issue

Answer Description

Ask Bash

What is an ephemeral Dataproc cluster?

What is the difference between ephemeral and persistent Dataproc clusters?

Why use an ephemeral cluster for nightly batch jobs?

Report Issue