AWS Certified Data Engineer Associate DEA-C01 Practice Question

An ecommerce company stores 5 GB of JSON click-stream records in an Amazon S3 prefix each day. The analytics team must convert the data to compressed Parquet, partition the output by event_date, and automatically adapt when new fields appear. The transformation must finish within one hour and require the least ongoing operational effort. Which solution meets these requirements?

Create an AWS Glue Spark ETL job that reads the JSON data into a DynamicFrame and writes compressed, event_date-partitioned Parquet files back to Amazon S3.
Define an external table on the JSON files in Amazon Redshift Spectrum and schedule an hourly CTAS query that writes Parquet partitions to a different S3 prefix.
Spin up an on-demand Amazon EMR cluster daily, run a PySpark script to convert the JSON to Parquet partitions, and terminate the cluster when the job completes.
Configure an Amazon S3 event to invoke an AWS Lambda function that processes each object and writes the transformed data to Parquet partitions in another bucket.

AWS Certified Data Engineer Associate DEA-C01

Data Ingestion and Transformation

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Answer Description

Ask Bash

What is AWS Glue, and why is it suited for this use case?

What is a DynamicFrame and how does it handle schema changes?

Why are Parquet files preferred over JSON in this context?

Monthly

$19.99 $11.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99 $26.99

One time purchase of $26.99,
Does not auto-renew.

Annual Pass

$119.99 $71.99

One time purchase of $71.99,
Does not auto-renew.

Lifetime Pass

$189.99 $113.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Report Issue

Answer Description

Ask Bash

What is AWS Glue, and why is it suited for this use case?

What is a DynamicFrame and how does it handle schema changes?

Why are Parquet files preferred over JSON in this context?

Report Issue