AWS Certified Data Engineer Associate DEA-C01 Practice Question
A data engineering team ingests 10 TB of semi-structured clickstream events through Amazon Kinesis Data Firehose into an Amazon S3 data lake. Analysts will run frequent Amazon Athena queries that usually reference only a few columns. The team wants to minimize both query latency and the amount of data scanned to control cost. Which file format should the team configure Kinesis Data Firehose to deliver to S3 to best meet these requirements?
Amazon Athena charges by the amount of data it reads from Amazon S3. Columnar, splittable formats such as Apache Parquet store each column contiguously and include column statistics. When Athena reads only a subset of columns, far less data is scanned, which lowers query cost and improves performance. Row-oriented formats like CSV, raw JSON, or even compressed JSON still require Athena to read the full row set and do not provide predicate pushdown. Therefore, storing the data in Parquet with a compression codec such as Snappy is the most cost-effective and performant choice for the described workload.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
Why is Apache Parquet more efficient than JSON for querying data in Athena?
Open an interactive chat with Bash
What is the role of Snappy compression in Parquet files?
Open an interactive chat with Bash
What is predicate pushdown, and why is it important in Athena queries?
Open an interactive chat with Bash
Why are columnar file formats like Apache Parquet better for analytical queries in Amazon Athena?
Open an interactive chat with Bash
What is Snappy compression, and why is it recommended for storing data in Parquet files?
Open an interactive chat with Bash
What are predicate pushdowns, and how do they improve query performance in Amazon Athena?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Store Management
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .