AWS Certified Data Engineer Associate DEA-C01 Practice Question

Your data engineering team stores daily AWS Glue Apache Spark job logs as compressed JSON files in an Amazon S3 bucket. Analysts must run ad-hoc SQL to find long-running stages and join the result with an existing reference dataset that also resides in S3. The solution must become queryable within minutes of log delivery, require no servers to manage, and minimize operational effort. Which solution best meets these requirements?

  • Run an AWS Glue crawler on the log prefix to update the Data Catalog and query both log and reference tables in Amazon Athena.

  • Stream the log files from S3 into Amazon CloudWatch Logs and analyze them with CloudWatch Logs Insights queries.

  • Deliver the logs to Amazon OpenSearch Service with Amazon Kinesis Data Firehose and query them alongside the reference data using OpenSearch Dashboards.

  • Launch an on-demand Amazon EMR cluster with Trino, mount the S3 buckets, and submit SQL queries through the Trino coordinator.

AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot