AWS Certified Data Engineer Associate DEA-C01 Practice Question
Your data engineering team stores daily AWS Glue Apache Spark job logs as compressed JSON files in an Amazon S3 bucket. Analysts must run ad-hoc SQL to find long-running stages and join the result with an existing reference dataset that also resides in S3. The solution must become queryable within minutes of log delivery, require no servers to manage, and minimize operational effort. Which solution best meets these requirements?
Run an AWS Glue crawler on the log prefix to update the Data Catalog and query both log and reference tables in Amazon Athena.
Stream the log files from S3 into Amazon CloudWatch Logs and analyze them with CloudWatch Logs Insights queries.
Deliver the logs to Amazon OpenSearch Service with Amazon Kinesis Data Firehose and query them alongside the reference data using OpenSearch Dashboards.
Launch an on-demand Amazon EMR cluster with Trino, mount the S3 buckets, and submit SQL queries through the Trino coordinator.
Creating a Glue crawler to catalog the new log files and letting analysts query them with Amazon Athena is the only option that is fully serverless, requires no cluster or domain management, and becomes available for SQL queries shortly after the data lands in S3. Athena reads directly from S3, and the Glue Data Catalog provides the schema that analysts can join to their reference table, which is already cataloged. An EMR cluster with Presto/Trino would work functionally but introduces nodes to provision, scale, and patch. Streaming the logs to Amazon OpenSearch Service creates an OpenSearch domain that the team must size and maintain, and its query language differs from standard SQL. CloudWatch Logs Insights cannot natively query objects that are already in S3; the logs would first have to be pushed to CloudWatch Logs, adding delay and extra steps.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is an AWS Glue crawler?
Open an interactive chat with Bash
How does Amazon Athena work with AWS Glue?
Open an interactive chat with Bash
Why is OpenSearch not suitable for this use case?
Open an interactive chat with Bash
What is an AWS Glue crawler and how does it work?
Open an interactive chat with Bash
How does Amazon Athena query data stored in Amazon S3?
Open an interactive chat with Bash
Why is a serverless solution like AWS Glue and Athena a better choice for this scenario?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .