A global media company stores raw logs in several Cloud Storage buckets across multiple regions and ingests curated data into multiple BigQuery projects that are owned by different business units. Data scientists complain that they cannot easily discover which tables contain video-stream metrics or which buckets store ad-impression logs without asking individual teams. The chief data officer wants a single place where all Cloud Storage objects and BigQuery tables are automatically indexed, enriched with business metadata, and made searchable through a common API while still enforcing existing IAM policies. As the lead data engineer, which design should you implement to satisfy these requirements with minimal custom development and maximum portability across present and future Google Cloud projects?
Use Cloud Asset Inventory to list resources across projects, write Cloud Functions that parse the export into Pub/Sub, and build a Looker dashboard for interactive search.
Enable BigQuery Data Catalog in every project, export catalog entries nightly to a central Cloud SQL instance, and build a custom front-end that merges the exports for search.
Create a Dataplex lake, attach each Cloud Storage bucket and BigQuery dataset as governed assets, define zones and business tags, and rely on the Dataplex Universal Catalog (searchable through Data Catalog APIs) for discovery.
Install an open-source metadata repository such as DataHub on Google Kubernetes Engine, build custom crawlers for Cloud Storage and BigQuery, and expose search through a REST endpoint.
Dataplex can attach Cloud Storage buckets and BigQuery datasets from any project into logical data lakes and zones. When an asset is attached, Dataplex's built-in metadata service automatically crawls the underlying storage, extracts technical metadata (schemas, locations, partitions), and surfaces it-together with user-defined business tags-in the Dataplex Universal Catalog. The catalog entries reside in Data Catalog, so analysts can use Data Catalog search APIs to look for business terms such as "video-stream metrics" or "ad-impressions" without knowing the physical location of the data. IAM policies applied at the project, dataset, or bucket level are respected because Dataplex manages only references and does not copy data. Running a third-party catalog or exporting metadata to another system would add operational overhead and would not benefit from Dataplex's tight integration with Google Cloud services and IAM, reducing portability and increasing maintenance effort. Therefore, enabling Dataplex, organizing assets into lakes and zones, and relying on the Universal Catalog best meets the stated goals.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is Dataplex and why is it useful in this scenario?
Open an interactive chat with Bash
How does Dataplex Universal Catalog integrate with Data Catalog?
Open an interactive chat with Bash
What are the advantages of using Dataplex over third-party metadata repositories?
Open an interactive chat with Bash
What is Dataplex in Google Cloud?
Open an interactive chat with Bash
What is the Dataplex Universal Catalog?
Open an interactive chat with Bash
How does Dataplex enforce IAM policies during data discovery?
Open an interactive chat with Bash
GCP Professional Data Engineer
Designing data processing systems
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99 $11.99
$11.99/mo
Billed monthly, Cancel any time.
$19.99 after promotion ends
3 Month Pass
$44.99 $26.99
$8.99/mo
One time purchase of $26.99, Does not auto-renew.
$44.99 after promotion ends
Save $18!
MOST POPULAR
Annual Pass
$119.99 $71.99
$5.99/mo
One time purchase of $71.99, Does not auto-renew.
$119.99 after promotion ends
Save $48!
BEST DEAL
Lifetime Pass
$189.99 $113.99
One time purchase, Good for life.
Save $76!
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .