AWS Certified Data Engineer Associate DEA-C01 Practice Question

An e-commerce company ingests about 800 GB of product images and related JSON metadata each day. The data must be stored with 11 nines durability, read by Spark jobs on Amazon EMR, and later queried using Amazon Athena. The solution should scale automatically, require minimal administration, and cut storage costs because the images are seldom accessed after the first few days. Which AWS storage option best meets these requirements?

  • Store the images and metadata in an Amazon S3 bucket and apply an S3 Lifecycle rule that transitions objects to S3 Glacier Instant Retrieval after 30 days.

  • Save the images as binary attributes in an Amazon DynamoDB table and scan the table from Amazon EMR.

  • Mount an Amazon EFS One Zone-IA file system on the EMR cluster and place the images and metadata there.

  • Load the images and metadata into an Amazon Redshift RA3 cluster and query the data with Redshift Spectrum.

AWS Certified Data Engineer Associate DEA-C01
Data Store Management
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot