AWS Certified Data Engineer Associate DEA-C01 Practice Question
A startup collects 500 GB per day of IoT sensor readings in nested JSON and keeps relational product metadata in Amazon Redshift. Cost control rules out continuously loading the raw sensor files into Redshift. Analysts need to join the sensor data with the product tables and run ad-hoc queries that must stay performant as the sensor schema evolves. Which approach best models these structured and semi-structured datasets to meet the requirements?
Store sensor data as compressed Parquet files in Amazon S3, register the files with the AWS Glue Data Catalog, and query them from Redshift using Spectrum while keeping product metadata in native Redshift tables.
Keep the sensor files as raw JSON objects in S3, query them with Amazon Athena, and export nightly query results to Redshift for joins with the product tables.
Load both the JSON sensor data and the product metadata into Redshift tables that use the SUPER data type and late-binding views to handle nested attributes.
Persist the sensor data in Amazon DynamoDB and access it from Redshift with federated queries while maintaining the product metadata in DynamoDB global tables.
Keeping the product metadata as normal Redshift tables provides fast, optimized storage for structured data. Converting the sensor JSON to columnar Parquet on Amazon S3, cataloging it with AWS Glue, and exposing it to the cluster with Redshift Spectrum lets analysts join the external sensor data with internal tables without ingesting the raw files. Parquet's columnar layout, compression, and predicate pushdown lower scan costs and improve performance, while Spectrum automatically picks up new columns that are added to the Parquet files, satisfying the schema-evolution requirement. The other options either force all data into Redshift (raising cost and reducing flexibility), rely on services that cannot be directly joined from Redshift, or introduce extra extract-load steps that delay analysts and increase operational overhead.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is Redshift Spectrum and how does it work?
Open an interactive chat with Bash
Why is Parquet preferred for storing sensor data in this solution?
Open an interactive chat with Bash
What role does AWS Glue play in the proposed solution?
Open an interactive chat with Bash
How does Redshift Spectrum work?
Open an interactive chat with Bash
Why is Parquet format preferred for storing sensor data?
Open an interactive chat with Bash
What is schema evolution and how does Spectrum handle it?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Store Management
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .