You deployed a streaming Dataflow pipeline in us-central1-a that ingests Pub/Sub events and writes to BigQuery. A zonal outage in us-central1-a terminated all worker VMs, and the job remained in a FAILED state until an engineer manually relaunched it, breaking the two-minute processingālag SLA. You must redesign the deployment so that it can survive a future zone outage without human intervention while requiring the least possible code change. What should you do?
Enable FlexRS on the existing Dataflow job so interrupted workers are restarted on preemptible VMs in the same zone.
Redeploy the pipeline to the regional endpoint us-central1, letting Dataflow place and restart workers in healthy zones automatically.
Migrate the job to a Dataproc cluster with high-availability masters spread across zones and run it with Spark Structured Streaming.
Keep the current setup but configure the Cloud Composer task to retry the Dataflow job up to three times with exponential backoff.
Running the pipeline against the regional endpoint us-central1 (instead of the single-zone endpoint us-central1-a) allows Dataflow to distribute and, if necessary, recreate workers in any healthy zone within the region. Because the service automatically restarts failed workers and maintains the job's state, a zonal outage is transparently mitigated with no manual action. The other options either do not address cross-zone resilience (Composer retries, FlexRS), are not supported for streaming jobs (FlexRS), or require a significantly larger re-engineering effort (migrating to Dataproc).
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is the difference between a regional and a zonal endpoint in GCP Dataflow?
Open an interactive chat with Bash
How does Dataflow handle worker failures during pipeline execution?
Open an interactive chat with Bash
What are the advantages of using Pub/Sub and BigQuery with Dataflow pipelines?
Open an interactive chat with Bash
What is the difference between a zonal and regional endpoint in GCP Dataflow?
Open an interactive chat with Bash
How does using a regional endpoint improve fault tolerance in Dataflow?
Open an interactive chat with Bash
Why wouldn’t enabling FlexRS be a good solution for zone outages in streaming Dataflow jobs?
Open an interactive chat with Bash
GCP Professional Data Engineer
Maintaining and automating data workloads
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .