AWS Certified Data Engineer Associate DEA-C01 Practice Question

A company ingests web server logs compressed with Gzip into Amazon S3 at the path s3://logs/yyyy/mm/dd/. Analysts frequently query a single day of data with Amazon Athena. The data engineer must minimize scan cost and avoid any jobs that add partitions when new log files arrive. Which solution meets these requirements?

  • Use an AWS Glue ETL job to convert the logs to Parquet and store them in an unpartitioned destination bucket queried by Athena.

  • Configure an S3 event notification to trigger an AWS Lambda function that runs ALTER TABLE ADD PARTITION for each newly written log object.

  • Define an AWS Glue table that uses partition projection for year, month, and day, and instruct analysts to include date predicates in their Athena queries.

  • Schedule a nightly MSCK REPAIR TABLE command in AWS Glue to discover and add the previous day's partitions to the catalog.

AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot