Your company's data platform team must provide a self-service environment where data engineers across multiple projects can discover, profile, and govern files stored in Cloud Storage and tables in BigQuery. The solution should automatically scan new and existing assets to harvest technical metadata, generate data profiles that include statistics such as null counts and cardinality, and surface the assets through a unified catalog that supports fine-grained access controls. The team wants to minimize custom code and avoid deploying third-party software. Which design best satisfies these requirements?
Create Dataplex lakes and governed zones that reference the Cloud Storage buckets and BigQuery datasets, enable automated discovery, data profiling, and quality scans in each zone, and use the Dataplex catalog for cross-project search and access control.
Use Cloud Asset Inventory to index storage objects and datasets, and schedule BigQuery Data Transfer Service jobs to load audit logs that analysts can query for metadata and quality metrics.
Centralize all data by copying it into a single BigQuery dataset with BigQuery Omni, then rely on INFORMATION_SCHEMA views and custom Cloud Composer DAGs to generate profiling reports.
Register every bucket and dataset in standalone Data Catalog entry groups and trigger Cloud Functions that launch Dataflow jobs to calculate statistics and update metadata tables.
Dataplex natively unifies governance for Cloud Storage and BigQuery by grouping assets into lakes and zones. When you attach a bucket or dataset to a zone, Dataplex automatically runs discovery jobs that register the assets in the Dataplex (Data Catalog) catalog, creates data profiles with statistics such as null counts, and can run built-in data quality scans. Access to the cataloged assets is controlled through IAM roles on Dataplex resources, giving fine-grained governance without custom pipelines. The other options either rely on separate services that do not provide automated profiling (BigQuery INFORMATION_SCHEMA), require custom functions and pipelines to keep metadata up to date, or use services (Cloud Asset Inventory, BigQuery Data Transfer Service) that are not intended for end-user data discovery and quality management.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is Dataplex and how does it support data governance in this solution?
Open an interactive chat with Bash
What are data profiles and how does Dataplex generate them?
Open an interactive chat with Bash
How does Dataplex provide fine-grained access control, and why is it beneficial?
Open an interactive chat with Bash
What is Dataplex in GCP?
Open an interactive chat with Bash
How does Dataplex perform automated discovery and profiling?
Open an interactive chat with Bash
Why is Dataplex better than standalone Data Catalog or other services?
Open an interactive chat with Bash
GCP Professional Data Engineer
Designing data processing systems
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99 $11.99
$11.99/mo
Billed monthly, Cancel any time.
$19.99 after promotion ends
3 Month Pass
$44.99 $26.99
$8.99/mo
One time purchase of $26.99, Does not auto-renew.
$44.99 after promotion ends
Save $18!
MOST POPULAR
Annual Pass
$119.99 $71.99
$5.99/mo
One time purchase of $71.99, Does not auto-renew.
$119.99 after promotion ends
Save $48!
BEST DEAL
Lifetime Pass
$189.99 $113.99
One time purchase, Good for life.
Save $76!
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .