AWS Certified Data Engineer Associate DEA-C01 Practice Question
A data engineer must enable analysts to run ad hoc SQL queries from Amazon Athena, Amazon Redshift Spectrum, and Amazon EMR Presto against semi-structured JSON files stored in an S3 data lake. The solution must avoid duplicating table definitions and should automatically detect new daily partitions that land in the same S3 prefix. Which approach meets these requirements with minimal operational overhead?
Embed the JSON schema in every Spark job and instruct analysts to load the data into temporary views before running SQL queries.
Configure an AWS Glue crawler on the S3 prefix to populate an AWS Glue Data Catalog table and have all query engines reference that catalog.
Store Avro schema definition files alongside the data in S3 and rely on each engine's SerDe to discover new partitions at query time.
Create separate external tables with identical names in Athena, Redshift Spectrum, and the EMR Hive metastore, updating each table manually when partitions arrive.
AWS Glue Data Catalog provides a centralized Hive-compatible metastore that is natively supported by Athena, Redshift Spectrum, and EMR. Creating an AWS Glue crawler on the S3 prefix automatically infers the schema and adds or updates partitions on a schedule, so all three query engines can immediately consume the new data without additional DDL. Defining external tables separately, embedding schemas in application code, or storing inline schema files would require manual updates for every new partition and would not give the services a shared catalog.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is AWS Glue and how do crawlers work?
Open an interactive chat with Bash
What is the AWS Glue Data Catalog and why is it important?
Open an interactive chat with Bash
How does partition detection work with AWS Glue crawlers?
Open an interactive chat with Bash
What is an AWS Glue Data Catalog?
Open an interactive chat with Bash
How do AWS Glue crawlers work?
Open an interactive chat with Bash
Why is a shared catalog like AWS Glue important for query engines?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Store Management
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .