AWS Certified Data Engineer Associate DEA-C01 Practice Question
A company runs two Amazon EMR on EC2 clusters in separate VPCs that process the same Parquet data stored in an Amazon S3 data lake. The analytics team also uses Amazon Athena for ad-hoc queries. The team wants a single place to store table definitions, automatically track schema changes, and avoid managing its own Hive metastore infrastructure. Which approach meets these requirements with minimal operational overhead?
Deploy a dedicated MySQL Hive metastore in each VPC, schedule nightly metadata exports to an S3 bucket, and have Athena load the exports before every query.
Configure each EMR cluster to use the AWS Glue Data Catalog as its external Hive metastore and grant Athena IAM permissions to the catalog.
Store Avro schema files in an S3 location and configure Spark jobs and Athena to reference the files directly at runtime.
Enable AWS Lake Formation on the S3 data lake and rely on the default settings without changing the EMR metastore configuration.
Configuring each EMR cluster to use the AWS Glue Data Catalog as its external Hive metastore provides a centralized, serverless repository for table and partition metadata. The Data Catalog automatically tracks schema versions, can be accessed concurrently by multiple EMR clusters, and is natively used by Athena without additional setup. Creating independent Hive metastores or maintaining schema files in S3 requires more administration and does not give Athena direct access. Lake Formation catalogs data but still relies on the underlying Glue Data Catalog; enabling it alone does not configure EMR to use the catalog as a metastore.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is the AWS Glue Data Catalog?
Open an interactive chat with Bash
How do you configure Amazon EMR to use the AWS Glue Data Catalog?
Open an interactive chat with Bash
What are the advantages of using AWS Glue Data Catalog with Amazon Athena?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Store Management
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .