AWS Certified Data Engineer Associate DEA-C01 Practice Question

You run an AWS Glue 3.0 Spark job written in Python that reads 50,000 gzip-compressed JSON files (about 100 KB each) from one Amazon S3 prefix, transforms the data, and writes Parquet files back to S3. The job uses the default 10 G.1X DPUs and currently completes in eight hours while average CPU utilization stays under 30 percent. Which modification will most improve performance without increasing cost?

Enable AWS Glue job bookmarking so previously processed files are skipped.
Write the Parquet output with the Zstandard compression codec to shrink the file sizes.
Use create_dynamic_frame_from_options with connection_options {"groupFiles": "inPartition", "groupSize": "134217728"} so Glue combines many small objects before processing.
Add --conf spark.executor.memory=16g to the job parameters to increase executor heap size.

AWS Certified Data Engineer Associate DEA-C01

Data Ingestion and Transformation

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Answer Description

Ask Bash

What is the purpose of grouping files in AWS Glue?

What is the difference between DPUs and the groupFiles option?

Why doesn't increasing Spark executor memory improve performance in this case?

Monthly

$19.99 $11.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99 $26.99

One time purchase of $26.99,
Does not auto-renew.

Annual Pass

$119.99 $71.99

One time purchase of $71.99,
Does not auto-renew.

Lifetime Pass

$189.99 $113.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Report Issue

Answer Description

Ask Bash

What is the purpose of grouping files in AWS Glue?

What is the difference between DPUs and the groupFiles option?

Why doesn't increasing Spark executor memory improve performance in this case?

Report Issue