Your company operates multiple Cloud Dataflow streaming pipelines across dev, staging, and prod projects. The SRE team wants to receive a PagerDuty notification when any production pipeline accumulates more than 10 minutes of system lag for at least 5 consecutive minutes, but they do not want alerts to fire after a job is drained or cancelled. Which Cloud Monitoring alerting configuration best satisfies these requirements?
Create a log-based metric that counts ERROR entries from Dataflow job logs and configure an uptime check alert when the metric count is non-zero for 5 minutes.
Set an alert on pubsub.googleapis.com/subscription/num_undelivered_messages aggregated across all streams and trigger when backlog exceeds the equivalent of 10 minutes of messages.
Create an alerting policy on metric dataflow.googleapis.com/job/current_system_lag for resource type dataflow_job, filter on label job_state="JOB_STATE_RUNNING", and fire when the maximum value exceeds 600 seconds for 5 minutes.
Alert on metric dataflow.googleapis.com/job/watermark with a condition that the minimum value is older than 600 seconds for 5 minutes; no additional label filters are required.
The metric that directly measures end-to-end delay in a streaming Dataflow job is dataflow.googleapis.com/job/current_system_lag. By creating an alerting policy on this metric with resource type dataflow_job, adding a filter that keeps only samples where the job_state label equals JOB_STATE_RUNNING, and setting a threshold of >600 seconds sustained for 5 minutes, the policy triggers only while the job is active and suppresses notifications once the job stops reporting. Using watermark metrics or Pub/Sub backlog does not capture Dataflow system lag directly, and relying on log-based or generic uptime checks would either miss the condition or generate false positives when a job ends.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is Cloud Dataflow's current_system_lag metric?
Open an interactive chat with Bash
What is the significance of filtering by job_state='JOB_STATE_RUNNING' in the alert configuration?
Open an interactive chat with Bash
Why is current_system_lag preferred over other metrics like watermark or pubsub undelivered messages?
Open an interactive chat with Bash
What is system lag in Cloud Dataflow?
Open an interactive chat with Bash
Why is the 'job_state="JOB_STATE_RUNNING"' filter important?
Open an interactive chat with Bash
What is the dataflow.googleapis.com/job/current_system_lag metric?
Open an interactive chat with Bash
GCP Professional Data Engineer
Maintaining and automating data workloads
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .