AWS Certified Data Engineer Associate DEA-C01 Practice Question

A startup collects 500 GB per day of IoT sensor readings in nested JSON and keeps relational product metadata in Amazon Redshift. Cost control rules out continuously loading the raw sensor files into Redshift. Analysts need to join the sensor data with the product tables and run ad-hoc queries that must stay performant as the sensor schema evolves. Which approach best models these structured and semi-structured datasets to meet the requirements?

Persist the sensor data in Amazon DynamoDB and access it from Redshift with federated queries while maintaining the product metadata in DynamoDB global tables.
Keep the sensor files as raw JSON objects in S3, query them with Amazon Athena, and export nightly query results to Redshift for joins with the product tables.
Store sensor data as compressed Parquet files in Amazon S3, register the files with the AWS Glue Data Catalog, and query them from Redshift using Spectrum while keeping product metadata in native Redshift tables.
Load both the JSON sensor data and the product metadata into Redshift tables that use the SUPER data type and late-binding views to handle nested attributes.

AWS Certified Data Engineer Associate DEA-C01

Data Store Management

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Answer Description

Ask Bash

What is Redshift Spectrum and how does it work?

Why is Parquet preferred for storing sensor data in this solution?

What role does AWS Glue play in the proposed solution?

How does Redshift Spectrum work?

Why is Parquet format preferred for storing sensor data?

What is schema evolution and how does Spectrum handle it?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Report Issue

Answer Description

Ask Bash

What is Redshift Spectrum and how does it work?

Why is Parquet preferred for storing sensor data in this solution?

What role does AWS Glue play in the proposed solution?

How does Redshift Spectrum work?

Why is Parquet format preferred for storing sensor data?

What is schema evolution and how does Spectrum handle it?

Report Issue