Your analytics team processes 3 PB of application logs that arrive via Pub/Sub, and a Dataflow pipeline writes the raw events as Avro files. Compliance rules mandate that each file be retained unaltered for seven years, yet engineers access the logs only a few times after the first week. Analysts must be able to run occasional ad-hoc SQL queries from BigQuery without first loading the data into native tables. Which Google Cloud storage approach will minimize total cost while satisfying durability, retention, and query needs?
Write the logs to an AlloyDB for PostgreSQL instance using compressed columnar storage and grant BigQuery federated access.
Keep the Avro files in a Cloud Storage bucket configured with the Archive storage class via lifecycle rules, and define BigQuery external tables to query them in place.
Persist the logs in a regional Cloud Bigtable cluster and export snapshots to Cloud Storage once per quarter.
Load the Avro files into partitioned BigQuery tables and rely on BigQuery's long-term storage pricing to reduce costs.
Placing the Avro files in a Cloud Storage bucket that uses the Archive storage class keeps costs as low as possible for data that must be stored at least 365 days and is rarely accessed. Archive offers the same high durability and sub-second access latency as other Cloud Storage classes but at the lowest per-GB price; lifecycle rules can move objects from Standard (for the initial week of frequent use) to Archive automatically. BigQuery can define an external table over the Avro objects in Cloud Storage, allowing analysts to query the data on demand without ingest costs or duplicating storage. Storing the data inside BigQuery tables, a Bigtable cluster, or an AlloyDB instance would incur significantly higher ongoing storage costs and provide capabilities (high-performance scans or transactions) that the workload does not require.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is Google Cloud Storage's Archive storage class and how does it work?
Open an interactive chat with Bash
What are BigQuery external tables and how do they enable querying data stored in Avro files?
Open an interactive chat with Bash
How do lifecycle rules in Cloud Storage automate data management?
Open an interactive chat with Bash
What is the Archive storage class in Cloud Storage?
Open an interactive chat with Bash
What are BigQuery external tables, and how do they work?
Open an interactive chat with Bash
How do lifecycle rules in Cloud Storage work, and why are they useful?
Open an interactive chat with Bash
GCP Professional Data Engineer
Storing the data
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99 $11.99
$11.99/mo
Billed monthly, Cancel any time.
$19.99 after promotion ends
3 Month Pass
$44.99 $26.99
$8.99/mo
One time purchase of $26.99, Does not auto-renew.
$44.99 after promotion ends
Save $18!
MOST POPULAR
Annual Pass
$119.99 $71.99
$5.99/mo
One time purchase of $71.99, Does not auto-renew.
$119.99 after promotion ends
Save $48!
BEST DEAL
Lifetime Pass
$189.99 $113.99
One time purchase, Good for life.
Save $76!
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .