GCP Professional Cloud Architect Practice Question
Your retail analytics team is migrating an on-premises computer-vision workflow to Google Cloud. Millions of product photos are already stored as JPEG objects in a regional Cloud Storage bucket. You need to design the data-ingestion step of a Vertex AI Pipeline that will:
Trigger human annotation jobs for any newly added images.
Maintain an auditable record of which images and labels fed each AutoML Vision training run so experiments are reproducible.
Allow future pipelines in the same project to reuse the labeled data without copying the objects. Which approach best meets these requirements?
Pass the Cloud Storage path directly to each AutoML Vision training component and store the list of processed object names in Artifact Registry for audit purposes.
Mount the Cloud Storage bucket into every training container with gcsfuse and track the exact file set in a Git repository alongside pipeline code.
Import the Cloud Storage URIs into a Vertex AI Managed Dataset and use that dataset as the source for labeling tasks and every training component.
Convert the images to base64 strings, load them into a BigQuery table, and point AutoML Tables at the table for training.
Creating a Vertex AI Managed Dataset that simply references the Cloud Storage object URIs gives the pipeline a first-class, versionable resource that stores metadata and annotations while leaving the image files in place. Vertex AI datasets integrate directly with Data Labeling so new items can be queued for human labeling, and the same dataset can be reused by any subsequent AutoML or custom-training step without duplicating data. Passing raw bucket paths, storing lists in Artifact Registry, or using gcsfuse + Git lacks tight integration with Vertex AI labeling and lineage tracking. Importing images into BigQuery and training with AutoML Tables is inappropriate because Tables expects structured tabular data, not images.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is a Vertex AI Managed Dataset?
Open an interactive chat with Bash
How does Vertex AI integrate with Data Labeling?
Open an interactive chat with Bash
Why is using Cloud Storage URIs in Vertex AI better than storing base64 strings or paths elsewhere?
Open an interactive chat with Bash
What is Vertex AI Managed Dataset?
Open an interactive chat with Bash
How does Cloud Storage integrate with Vertex AI pipelines?
Open an interactive chat with Bash
Why is it beneficial to use Vertex AI datasets for labeling tasks?
Open an interactive chat with Bash
GCP Professional Cloud Architect
Managing and provisioning a solution infrastructure
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99 $11.99
$11.99/mo
Billed monthly, Cancel any time.
$19.99 after promotion ends
3 Month Pass
$44.99 $26.99
$8.99/mo
One time purchase of $26.99, Does not auto-renew.
$44.99 after promotion ends
Save $18!
MOST POPULAR
Annual Pass
$119.99 $71.99
$5.99/mo
One time purchase of $71.99, Does not auto-renew.
$119.99 after promotion ends
Save $48!
BEST DEAL
Lifetime Pass
$189.99 $113.99
One time purchase, Good for life.
Save $76!
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .