AWS Certified Data Engineer Associate DEA-C01 Practice Question
A data engineering team processes log files stored in Amazon S3. Nightly AWS Glue ETL jobs write curated data back to S3, while analysts run ad-hoc queries with Amazon Athena and Apache Spark on Amazon EMR. Maintaining separate metastores for each service has resulted in schema drift and extra administration. The team needs a single, serverless data catalog that all three services can reference directly, with the least operational overhead. Which approach satisfies these requirements?
Use the AWS Glue Data Catalog as the unified metastore and configure both Athena and EMR to reference it.
Create external schemas in Amazon Redshift and have Athena and EMR issue federated queries against them.
Run an Apache Hive metastore on the EMR primary node and connect Athena to it with AWS Glue connectors.
Store table metadata in an Amazon DynamoDB table and update Athena and EMR Spark jobs to read from it using custom code.
The AWS Glue Data Catalog is a fully managed, serverless metastore used by Amazon Athena by default and can also be configured as the Hive metastore for Amazon EMR. Pointing both Athena and EMR to the same Glue Data Catalog gives all services a consistent view of table definitions without running or maintaining additional infrastructure. Storing metadata in DynamoDB would require custom integration logic. External schemas in Amazon Redshift do not act as a central Hive metastore. Running a self-managed Hive metastore on the EMR primary node introduces operational overhead and Athena cannot natively query it without Glue.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is the AWS Glue Data Catalog?
Open an interactive chat with Bash
How do Athena and EMR use the AWS Glue Data Catalog?
Open an interactive chat with Bash
Why is running a self-managed Hive metastore on EMR not optimal?
Open an interactive chat with Bash
Why is the AWS Glue Data Catalog preferred over Amazon DynamoDB for metadata storage?
Open an interactive chat with Bash
How does Amazon Athena use the AWS Glue Data Catalog?
Open an interactive chat with Bash
Can Amazon EMR use the AWS Glue Data Catalog as a Hive metastore?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Store Management
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .