AWS Certified Data Engineer Associate DEA-C01 Practice Question
A company ingests web server logs compressed with Gzip into Amazon S3 at the path s3://logs/yyyy/mm/dd/. Analysts frequently query a single day of data with Amazon Athena. The data engineer must minimize scan cost and avoid any jobs that add partitions when new log files arrive. Which solution meets these requirements?
Use an AWS Glue ETL job to convert the logs to Parquet and store them in an unpartitioned destination bucket queried by Athena.
Configure an S3 event notification to trigger an AWS Lambda function that runs ALTER TABLE ADD PARTITION for each newly written log object.
Define an AWS Glue table that uses partition projection for year, month, and day, and instruct analysts to include date predicates in their Athena queries.
Schedule a nightly MSCK REPAIR TABLE command in AWS Glue to discover and add the previous day's partitions to the catalog.
Partition projection allows Athena to infer Hive-style partitions (year, month, day) at query time without storing every partition in the Data Catalog. By enabling partition projection on the Glue table and requiring analysts to include date predicates, Athena reads only the relevant S3 prefixes, minimizing the amount of data scanned. The Lambda and MSCK REPAIR TABLE approaches both add operational overhead to maintain partitions, while converting to unpartitioned Parquet still causes Athena to scan the entire dataset for each query.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is partition projection in Athena?
Open an interactive chat with Bash
Why is partition projection better than using MSCK REPAIR TABLE?
Open an interactive chat with Bash
What are the advantages of using partitioned data in Athena queries?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .