GCP Professional Data Engineer Practice Question

Your team ingests 15 TB of compressed application logs into Cloud Storage every night and immediately loads the data into a BigQuery staging table. A batch Dataflow pipeline then executes a series of SQL‐like joins, filters, and aggregations before writing the daily results into BigQuery reporting tables. The Dataflow job's worker and shuffle costs have grown significantly, and the team wants to reduce operational overhead while keeping the transformation logic in ANSI-compatible SQL under version control. What should you recommend?

Re-implement the pipeline with Dataflow SQL templates and trigger them nightly with Cloud Scheduler.
Move the transformation logic into BigQuery by creating version-controlled SQL files managed with Dataform or scheduled queries, and drop the Dataflow job.
Keep the Dataflow pipeline but orchestrate it with Cloud Data Fusion to simplify management.
Replace the Dataflow job with a Dataproc cluster that runs Spark SQL notebooks scheduled by Cloud Composer.

GCP Professional Data Engineer

Ingesting and processing the data

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

GCP Professional Data Engineer Practice Question

Answer Description

Ask Bash

How does BigQuery handle ELT-style transformations?

What is Dataform, and how does it work with BigQuery?

Why is Dataflow no longer ideal in this pipeline for transformations?

What is the purpose of Dataform in BigQuery transformations?

How does BigQuery eliminate Dataflow shuffle charges in ELT pipelines?

What are the advantages of using version-controlled SQL files for transformations?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

GCP Professional Data Engineer Practice Question

Report Issue

Answer Description

Ask Bash

How does BigQuery handle ELT-style transformations?

What is Dataform, and how does it work with BigQuery?

Why is Dataflow no longer ideal in this pipeline for transformations?

What is the purpose of Dataform in BigQuery transformations?

How does BigQuery eliminate Dataflow shuffle charges in ELT pipelines?

What are the advantages of using version-controlled SQL files for transformations?

Report Issue