AWS Certified Data Engineer Associate DEA-C01 Practice Question

A data engineering team receives a 5-TB JSON file in an S3 bucket each day. They must flatten nested objects, convert the data to partitioned Parquet, and make it queryable in Athena within two hours. The team wants a fully managed, serverless solution and prefers to avoid provisioning persistent clusters. Which approach meets these requirements most cost-effectively?

Build an Amazon Kinesis Data Analytics for Apache Flink application that uses the Amazon S3 connector to process the file and output Parquet data to S3.
Create an AWS Glue Spark ETL job with job bookmarks enabled that reads the JSON file, flattens the data, writes partitioned Parquet back to S3, and updates the Glue Data Catalog.
Spin up an on-demand Amazon EMR cluster with Apache Spark each day, run a Spark transformation job, and terminate the cluster after the job finishes.
Run an Amazon Athena CTAS statement that reads the JSON file and writes the result as partitioned Parquet objects to a separate S3 location.

AWS Certified Data Engineer Associate DEA-C01

Data Ingestion and Transformation

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Answer Description

Ask Bash

What is AWS Glue and why is it suitable for this use case?

What are job bookmarks in AWS Glue and how do they help?

Why use partitioned Parquet and how does it benefit Athena queries?

Monthly

$19.99 $11.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99 $26.99

One time purchase of $26.99,
Does not auto-renew.

Annual Pass

$119.99 $71.99

One time purchase of $71.99,
Does not auto-renew.

Lifetime Pass

$189.99 $113.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Report Issue

Answer Description

Ask Bash

What is AWS Glue and why is it suitable for this use case?

What are job bookmarks in AWS Glue and how do they help?

Why use partitioned Parquet and how does it benefit Athena queries?

Report Issue