AWS Certified Data Engineer Associate DEA-C01 Practice Question
An e-commerce company ingests about 800 GB of product images and related JSON metadata each day. The data must be stored with 11 nines durability, read by Spark jobs on Amazon EMR, and later queried using Amazon Athena. The solution should scale automatically, require minimal administration, and cut storage costs because the images are seldom accessed after the first few days. Which AWS storage option best meets these requirements?
Store the images and metadata in an Amazon S3 bucket and apply an S3 Lifecycle rule that transitions objects to S3 Glacier Instant Retrieval after 30 days.
Save the images as binary attributes in an Amazon DynamoDB table and scan the table from Amazon EMR.
Mount an Amazon EFS One Zone-IA file system on the EMR cluster and place the images and metadata there.
Load the images and metadata into an Amazon Redshift RA3 cluster and query the data with Redshift Spectrum.
Amazon S3 is an object store that delivers 11 nines of durability, scales without user intervention, and is the native storage layer for both Amazon EMR and Amazon Athena. Because objects are infrequently accessed after upload, an S3 Lifecycle rule can transition them to a lower-cost storage class such as S3 Glacier Instant Retrieval to reduce cost while still allowing occasional access.
Amazon EFS provides POSIX file storage but does not integrate directly with Athena and is generally more expensive for large, rarely accessed datasets. Loading binary images into Amazon Redshift is inefficient and costly because Redshift is optimized for structured, columnar data, not large unstructured files. DynamoDB cannot store multi-megabyte images because each item is limited to 400 KB and would still require another service for Athena queries. Therefore, storing the data in Amazon S3 with an appropriate Lifecycle policy is the most suitable choice.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What does '11 nines durability' mean in Amazon S3?
Open an interactive chat with Bash
How does an S3 Lifecycle rule work?
Open an interactive chat with Bash
Why is S3 Glacier Instant Retrieval suitable for infrequently accessed data?
Open an interactive chat with Bash
What are S3 Lifecycle rules and how do they work?
Open an interactive chat with Bash
How does Amazon S3 achieve 11 nines durability?
Open an interactive chat with Bash
What is the difference between Amazon S3 and Amazon EFS?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Store Management
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .