Your analytics team is building a batch pipeline that: (1) triggers when 1000+ CSV files land in a Cloud Storage bucket, (2) launches a Dataflow job to cleanse and join the files, (3) waits until the job finishes, (4) runs a series of BigQuery SQL transformations, and (5) emails stakeholders if any step misses its SLA. The solution must capture task lineage, allow retry policies, enable backfills, and support future integration with on-prem Hadoop jobs. Which Cloud service should you adopt to orchestrate this pipeline?
App Engine cron jobs invoking Cloud Functions for each step
Cloud Composer (managed Apache Airflow)
Dataflow Flex templates orchestrated through Pub/Sub notifications
Cloud Composer is a managed Apache Airflow service. Airflow DAGs let you sequence heterogeneous tasks-including Cloud Storage sensors, Dataflow operators, and BigQuery operators-while providing retries, SLA monitoring, e-mail alerting, and historical backfill execution. Composer therefore covers every listed requirement and can later incorporate additional operators (for example, SSH or Spark on-prem) without re-architecting. Workflows excels at orchestrating short-lived API calls but lacks built-in scheduling, dependency-aware backfills, and the rich operator ecosystem needed for Dataflow or Hadoop integration. Dataflow Flex templates and Cloud Functions can run isolated jobs but do not coordinate multi-stage dependencies or provide scheduling and lineage out-of-the-box. App Engine's cron service can invoke functions on a schedule but offers no workflow graph, retries, or task-level monitoring.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is Apache Airflow and how does it work?
Open an interactive chat with Bash
What is SLA monitoring in Cloud Composer?
Open an interactive chat with Bash
How does Cloud Composer support integration with on-prem Hadoop jobs?
Open an interactive chat with Bash
What is Cloud Composer used for?
Open an interactive chat with Bash
What is the importance of lineage tracking in pipelines?
Open an interactive chat with Bash
Why can't App Engine cron jobs or Workflows be used for this pipeline?
Open an interactive chat with Bash
GCP Professional Data Engineer
Ingesting and processing the data
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99 $11.99
$11.99/mo
Billed monthly, Cancel any time.
$19.99 after promotion ends
3 Month Pass
$44.99 $26.99
$8.99/mo
One time purchase of $26.99, Does not auto-renew.
$44.99 after promotion ends
Save $18!
MOST POPULAR
Annual Pass
$119.99 $71.99
$5.99/mo
One time purchase of $71.99, Does not auto-renew.
$119.99 after promotion ends
Save $48!
BEST DEAL
Lifetime Pass
$189.99 $113.99
One time purchase, Good for life.
Save $76!
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .