GCP Professional Data Engineer Practice Question

Your analytics platform stores several years of click-stream data in Parquet files on Cloud Storage. Data scientists query the most recent partitions interactively through BigQuery, but new compliance rules require user-specific row-level filters to be enforced on both the historic data in Cloud Storage and the fact tables already ingested into BigQuery. Engineering stipulates that you must:

Avoid copying or re-loading the Parquet data into new BigQuery tables.
Maintain a single, consistent security policy that governs data in both the warehouse and the lake.
Preserve the ability for future Spark jobs on Dataproc to read the same Parquet files directly. Which approach best meets these requirements while keeping operational overhead low?

Provide signed URLs protected by VPC Service Controls and instruct analysts to query the Parquet files with federated queries.
Create BigLake tables on the Parquet files and attach BigQuery row-level access policies so the same security model applies to both BigLake and existing native tables.
Load the Parquet partitions into new BigQuery managed tables and apply dataset-level IAM roles to enforce access controls.
Define BigQuery external tables on the Parquet objects and rely on Cloud Storage bucket-level IAM to restrict access to sensitive rows.

GCP Professional Data Engineer

Storing the data

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

GCP Professional Data Engineer Practice Question

Answer Description

Ask Bash

What are BigLake tables in GCP?

How do BigQuery row-level access policies work?

Why are Parquet files popular for storing analytics data?

What is a BigLake table?

What are row-level access policies in BigQuery?

Why is Parquet used for click-stream data storage?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

GCP Professional Data Engineer Practice Question

Report Issue

Answer Description

Ask Bash

What are BigLake tables in GCP?

How do BigQuery row-level access policies work?

Why are Parquet files popular for storing analytics data?

What is a BigLake table?

What are row-level access policies in BigQuery?

Why is Parquet used for click-stream data storage?

Report Issue