AWS Certified Data Engineer Associate DEA-C01 Practice Question

An organization stores Parquet data in Amazon S3 and exposes it through external tables tracked in an on-premises Apache Hive metastore. During a phased migration, new Amazon EMR clusters and Amazon Athena must read and update the same catalog while the existing Hadoop cluster keeps running. The team wants a fully managed solution and to avoid manual schema synchronization. Which approach meets these goals with minimal operations?

Export the current metastore database, import it into an Amazon RDS MySQL instance, and point all future EMR clusters to that RDS endpoint while the on-premises cluster retains its original metastore.
Schedule an AWS Glue crawler to scan the S3 prefixes hourly to recreate the tables, and direct EMR and Athena to the resulting Glue databases while the on-premises cluster continues to use its local metastore.
Create Lake Formation resource links for each table and grant cross-account permissions, then query the data from EMR through Redshift Spectrum and from the on-premises cluster through its existing metastore.
Use the AWS Glue metastore-import utility to migrate the existing Hive schema into the AWS Glue Data Catalog, then configure both new EMR clusters and the on-premises Hadoop cluster to use the Glue Data Catalog as their Hive metastore.

AWS Certified Data Engineer Associate DEA-C01

Data Store Management

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Answer Description

Ask Bash

What is the AWS Glue metastore-import utility?

How do EMR clusters use the AWS Glue Data Catalog as a Hive metastore?

How does the Glue Data Catalog integrate with on-premises Hadoop clusters?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Report Issue

Answer Description

Ask Bash

What is the AWS Glue metastore-import utility?

How do EMR clusters use the AWS Glue Data Catalog as a Hive metastore?

How does the Glue Data Catalog integrate with on-premises Hadoop clusters?

Report Issue