Your team must design a storage layout for an e-commerce platform that streams about 50 000 clickstream events per second (≈5 TB/day). The recommendation engine needs single-digit millisecond access to the most recent seven days of events, while data scientists expect to run large ad-hoc SQL analyses on the full event history every morning. You want a fully managed solution that minimizes operational overhead and storage cost. Which approach best satisfies these requirements?
Persist events in Cloud SQL for PostgreSQL with logical replication into BigQuery; query Cloud SQL for the latest seven days and BigQuery for historical analysis.
Stream events into Cloud Bigtable for operational access; use a Dataflow job to write each event to a partitioned BigQuery table and delete data in Bigtable after seven days.
Ingest events directly into BigQuery partitioned tables and build materialized views on the last seven days for the recommendation engine.
Write all events to Cloud Storage Nearline; define a BigQuery external table for analytics and use Memorystore to cache the most recent events for recommendations.
Cloud Bigtable is optimized for very high-throughput, sub-10 ms reads and writes, making it well suited for serving the most recent seven days of clickstream events to the recommendation engine. A streaming Dataflow pipeline can simultaneously persist each event to BigQuery, where partitioned tables support cost-effective, serverless SQL analytics on the full history. Periodically expiring data older than seven days in Bigtable keeps operational storage costs low.
Cloud SQL cannot sustain tens of thousands of writes per second or provide millisecond reads at this scale, and its replication to BigQuery would still leave performance and scalability gaps. Memorystore is an in-memory cache that is not intended to store terabytes of data and lacks persistence. Storing all data only in BigQuery would not meet the sub-10 ms latency requirement for online recommendations. Therefore, the Bigtable plus Dataflow-to-BigQuery pattern is the most appropriate choice.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
Why is Cloud Bigtable suited for high-throughput, low-latency workloads?
Open an interactive chat with Bash
How does Dataflow assist in the data pipeline for this architecture?
Open an interactive chat with Bash
Why use BigQuery partitioned tables for historical analysis?
Open an interactive chat with Bash
What is Cloud Bigtable and why is it suited for high-throughput data?
Open an interactive chat with Bash
How does Dataflow work in transferring data between Bigtable and BigQuery?
Open an interactive chat with Bash
Why is BigQuery partitioning necessary for cost-effective analytics?
Open an interactive chat with Bash
Why is Cloud Bigtable better for serving recent events compared to other options?
Open an interactive chat with Bash
What is the role of partitioned tables in BigQuery?
Open an interactive chat with Bash
How does Dataflow integrate with Cloud Bigtable and BigQuery?
Open an interactive chat with Bash
GCP Professional Data Engineer
Storing the data
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99 $11.99
$11.99/mo
Billed monthly, Cancel any time.
$19.99 after promotion ends
3 Month Pass
$44.99 $26.99
$8.99/mo
One time purchase of $26.99, Does not auto-renew.
$44.99 after promotion ends
Save $18!
MOST POPULAR
Annual Pass
$119.99 $71.99
$5.99/mo
One time purchase of $71.99, Does not auto-renew.
$119.99 after promotion ends
Save $48!
BEST DEAL
Lifetime Pass
$189.99 $113.99
One time purchase, Good for life.
Save $76!
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .