AWS Certified Data Engineer Associate DEA-C01 Practice Question

A retailer captures clickstream events in an existing Amazon Kinesis Data Stream. A data engineer must continuously land these events in an S3-based data lake as Apache Parquet files that are partitioned by event_date. The solution must require the least custom code, tolerate evolving event schemas without breaking downstream queries, and provide at-least-once processing guarantees. Which approach meets these requirements?

Create an AWS Glue streaming ETL job that reads from the Kinesis data stream, enables job bookmarks, and writes DynamicFrames in Parquet format partitioned by event_date to Amazon S3.
Configure an Amazon Kinesis Data Firehose delivery stream with an AWS Lambda transformation that converts records to Parquet and uses dynamic partitioning to deliver data to Amazon S3.
Attach an AWS Lambda function to the Kinesis stream that batches 100 records, converts them to Parquet, and uploads each batch to Amazon S3.
Develop a Spark Structured Streaming application on an always-on Amazon EMR cluster that consumes the Kinesis stream and writes partitioned Parquet files to Amazon S3.

AWS Certified Data Engineer Associate DEA-C01

Data Ingestion and Transformation

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Answer Description

Ask Bash

What are AWS Glue DynamicFrames and how do they handle schema evolution?

How does enabling job bookmarks in AWS Glue ensure at-least-once processing?

Why is Apache Parquet a preferred format for storing data in S3-based data lakes?

Monthly

$19.99 $11.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99 $26.99

One time purchase of $26.99,
Does not auto-renew.

Annual Pass

$119.99 $71.99

One time purchase of $71.99,
Does not auto-renew.

Lifetime Pass

$189.99 $113.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Report Issue

Answer Description

Ask Bash

What are AWS Glue DynamicFrames and how do they handle schema evolution?

How does enabling job bookmarks in AWS Glue ensure at-least-once processing?

Why is Apache Parquet a preferred format for storing data in S3-based data lakes?

Report Issue