AWS Certified Data Engineer Associate DEA-C01 Practice Question

An engineering team ingests sensor readings at 5 MB/s using Amazon Data Firehose, which converts JSON to Parquet and writes hourly objects to an Amazon S3 landing bucket. A recent defect required raw JSON from the last 72 hours, but those events were unavailable. The team must add a 7-day replay capability at the lowest ongoing cost and with minimal changes. Which solution meets these requirements?

  • Turn on S3 object versioning for the landing bucket so older Parquet files can be recovered during reprocessing.

  • Insert an Amazon SQS queue between Firehose and Amazon EMR, set the queue retention to 7 days, and configure EMR to poll the queue.

  • Replace Firehose with Amazon Kinesis Data Streams, configure 7-day retention, and create a Firehose stream that delivers the data to the same S3 bucket.

  • Enable Source record backup on the existing Firehose delivery stream to store all incoming records in a separate S3 prefix, and set a lifecycle policy to delete objects older than 7 days.

AWS Certified Data Engineer Associate DEA-C01
Data Ingestion and Transformation
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot