Your team is designing the first landing layer for a new pipeline that ingests about 10 TB per day of compressed JSON and image files through Pub/Sub and Dataflow. Compliance requires that the raw, immutable data be retained for at least seven years at the lowest possible cost. The data will later be transformed in Dataproc and occasionally queried in BigQuery, so it must remain accessible to multiple analytics engines and be able to transition automatically to colder, cheaper tiers as it ages. Which Google Cloud sink best satisfies these requirements?
A Cloud Bigtable instance with a wide-row schema storing each file as a cell value
A BigQuery dataset that relies on BigQuery long-term storage pricing after 90 days
A Cloud Storage bucket configured with lifecycle rules to serve as the organization's data lake
A regional Cloud Spanner database using a BLOB column for each ingested object
Cloud Storage is Google Cloud's durable, low-cost object store and is commonly used as the data-lake landing zone. It can accept any file format produced by Pub/Sub and Dataflow, is query-engine-agnostic (schema-on-read), and supports lifecycle management rules that automatically move objects to Nearline, Coldline, or Archive classes for long-term retention at lower cost. BigQuery long-term storage is cheaper than its standard tier but still keeps data inside the data warehouse and does not handle binary objects such as images. Cloud Bigtable and Spanner are operational databases optimized for low-latency reads and writes, not for inexpensive archival of petabytes of unstructured files.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is the role of lifecycle rules in Cloud Storage buckets?
Open an interactive chat with Bash
How does schema-on-read work in Cloud Storage compared to structured storage solutions like BigQuery?
Open an interactive chat with Bash
Why is Cloud Storage better suited for a data lake compared to BigQuery, Cloud Bigtable, or Spanner?
Open an interactive chat with Bash
What are lifecycle rules in Cloud Storage?
Open an interactive chat with Bash
How do Pub/Sub and Dataflow work together in this pipeline?
Open an interactive chat with Bash
Why is Cloud Storage considered query-engine-agnostic?
Open an interactive chat with Bash
What are Cloud Storage lifecycle rules?
Open an interactive chat with Bash
What makes Cloud Storage suitable for a data lake?
Open an interactive chat with Bash
Why isn't BigQuery sufficient for long-term storage in this scenario?
Open an interactive chat with Bash
GCP Professional Data Engineer
Ingesting and processing the data
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .