AWS Certified Data Engineer Associate DEA-C01 Practice Question
A retail company ingests about 1 TB of semi-structured clickstream data into AWS every day. The data must be retained for 6 months and be available for ad-hoc SQL queries by analysts who run a few dozen queries per day that typically return less than 2 GB of results. The company needs the lowest-cost solution while keeping query latency under 10 seconds. Which data storage approach should a data engineer implement to meet these requirements?
Load the data into an Amazon Redshift RA3 cluster with concurrency scaling enabled.
Persist the data in Amazon RDS for PostgreSQL on gp3 storage and use Amazon Redshift federated queries for reporting.
Ingest the data into an Amazon DynamoDB table configured for on-demand capacity and query it with PartiQL.
Store the data as compressed Parquet files in Amazon S3 and run interactive queries with Amazon Athena.
Storing the data in Amazon S3 and querying it with Amazon Athena is the most economical choice for infrequent, interactive analytics. S3 provides virtually unlimited, low-cost object storage, and Athena is a serverless query service that charges only for the amount of data scanned, so costs scale with actual analyst usage. Compressing the data into columnar Parquet files further reduces the number of bytes scanned, lowering both query cost and latency to well within the 10-second target.
Using an Amazon Redshift RA3 cluster or Amazon RDS PostgreSQL would require continuously running database instances, producing higher fixed costs for compute and storage. DynamoDB on-demand capacity is optimized for key-value access patterns; scanning multiple terabytes for SQL analytics would be expensive and could not meet the latency target.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
Why are Parquet files preferred for storing data in this solution?
Open an interactive chat with Bash
How does Amazon Athena achieve low-cost interactive querying?
Open an interactive chat with Bash
What factors influence query latency in Amazon Athena?
Open an interactive chat with Bash
Why are Parquet files preferred for storage on Amazon S3 for analytics?
Open an interactive chat with Bash
How does Amazon Athena charge for queries, and why is it cost-effective?
Open an interactive chat with Bash
What makes Amazon S3 suitable for storing large datasets like clickstream data?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Store Management
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .