AWS Certified Data Engineer Associate DEA-C01 Practice Question

An analytics team runs an Amazon EMR cluster that finishes a nightly Spark batch job at 02:00 UTC. The job writes partitioned Parquet files to HDFS under /data/events/date=YYYY-MM-DD. The new files must be ingested into an Amazon S3 data lake by 03:00 UTC. The solution must minimize operational effort, avoid opening inbound ports on the cluster, and control costs. Which approach meets these requirements?

Install AWS DataSync agents on the EMR core nodes and configure a nightly task to copy the HDFS folder to Amazon S3.
Create an AWS Glue JDBC connection to the Hive metastore on the EMR master node and have an AWS Glue job read the HDFS location each night.
Add a nightly Amazon EMR step that runs DistCp from HDFS to an S3 bucket, orchestrated by AWS Step Functions.
Reconfigure the Spark job to write its output directly to an Amazon S3 prefix by using EMRFS, then schedule an AWS Glue crawler on that prefix to catalog the daily partition.

AWS Certified Data Engineer Associate DEA-C01

Data Ingestion and Transformation

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Answer Description

Ask Bash

What is EMRFS in AWS?

What does an AWS Glue crawler do?

How does writing directly to Amazon S3 minimize costs in this scenario?

Monthly

$19.99 $11.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99 $26.99

One time purchase of $26.99,
Does not auto-renew.

Annual Pass

$119.99 $71.99

One time purchase of $71.99,
Does not auto-renew.

Lifetime Pass

$189.99 $113.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Report Issue

Answer Description

Ask Bash

What is EMRFS in AWS?

What does an AWS Glue crawler do?

How does writing directly to Amazon S3 minimize costs in this scenario?

Report Issue