AWS Certified Data Engineer Associate DEA-C01 Practice Question

A data engineering team runs an AWS Glue Spark job that reads raw data from Amazon S3, applies transformations, and writes the output back to S3. The team must capture all driver, executor, and custom application logs, keep them searchable for 30 days, and minimize the operational effort required to manage the logging solution. Which approach best meets these requirements?

  • Turn on AWS CloudTrail data events for the source and target S3 buckets and query the resulting logs with Amazon Athena.

  • Enable execution logging for the AWS Glue job so that all stdout and stderr output is sent to an Amazon CloudWatch Logs group, configure a 30-day retention policy on the group, and use CloudWatch Logs Insights for log searches.

  • Stream AWS Glue logs through Amazon Kinesis Data Streams into an Amazon OpenSearch Service domain that rolls over indices every 30 days.

  • Modify the job script to write log files to a separate S3 bucket and schedule a daily Athena query to analyze the logs.

AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot