GCP Professional Data Engineer Practice Question

Your team builds an ELT workflow in Dataform that lands raw click-stream data in BigQuery and publishes cleaned tables for analysts. Compliance requires the nightly job to stop immediately whenever the current load introduces duplicate primary keys or orphaned foreign keys. Which Dataform construct should you use to add these data-quality gates so that the pipeline run automatically fails when the rule-checking query returns rows?

  • Configure the tables as incremental in Dataform and filter out problematic records with a WHERE clause referencing the latest updated_at timestamp.

  • Create separate .sqlx files defined with type: "assertion", each containing a query that returns rows when the quality rule is violated.

  • Rely on BigQuery's built-in NOT NULL and UNIQUE table constraints to reject bad data during the load step.

  • Attach postOperations blocks to the target tables to delete duplicates and unresolved foreign keys after the load finishes.

GCP Professional Data Engineer
Ingesting and processing the data
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot