AWS Certified Data Engineer Associate DEA-C01 Practice Question

An e-commerce company ingests clickstream events into an Amazon Kinesis data stream. A Lambda function, configured as an event source mapping, converts each record from JSON to Parquet and immediately stores it in Amazon S3. After one day, thousands of sub-100 KB Parquet files exist, inflating Athena query costs. The team needs exactly one Parquet file per shard every 15 minutes while keeping the solution fully serverless and low-cost. Which approach meets these requirements?

Send the stream to Kinesis Data Firehose with Parquet conversion enabled and a 15-minute buffering interval, eliminating the Lambda function.
Increase the event source mapping batch size to the maximum and use S3 multipart upload so each invocation appends new data to the same object key.
Create an AWS Glue streaming ETL job that reads from the Kinesis stream and writes partitioned Parquet files to S3 every 15 minutes.
Replace the event source mapping with an EventBridge rule that invokes the Lambda function every 15 minutes. In the function use GetShardIterator and GetRecords to read each shard, aggregate the data, write one Parquet file, and store the last sequence number in DynamoDB for checkpointing.

AWS Certified Data Engineer Associate DEA-C01

Data Ingestion and Transformation

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Answer Description

Ask Bash

What is DynamoDB checkpointing and why is it used here?

How do GetShardIterator and GetRecords work in Kinesis?

Why is Parquet preferred over JSON for Athena queries?

Monthly

$19.99 $11.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99 $26.99

One time purchase of $26.99,
Does not auto-renew.

Annual Pass

$119.99 $71.99

One time purchase of $71.99,
Does not auto-renew.

Lifetime Pass

$189.99 $113.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Report Issue

Answer Description

Ask Bash

What is DynamoDB checkpointing and why is it used here?

How do GetShardIterator and GetRecords work in Kinesis?

Why is Parquet preferred over JSON for Athena queries?

Report Issue