AWS Certified Data Engineer Associate DEA-C01 Practice Question

A company loads 2 TB of time-series sensor events into an Amazon Redshift table every day by appending new rows. Business intelligence dashboards filter on event_date for the most recent 7 days and aggregate results by device_id, joining to a small device metadata table. The current heap table causes long scan times. Which schema change will most effectively reduce dashboard latency without adding load-time complexity?

Convert the table to a compound sort key of event_date, device_id and use device_id as the DISTKEY.
Leave the heap layout unchanged but schedule VACUUM and ANALYZE to run after each daily load.
Change the table to EVEN distribution and define an INTERLEAVED sort key on device_id, event_date.
Create an external Redshift Spectrum table partitioned by event_date and load new partitions daily.

AWS Certified Data Engineer Associate DEA-C01

Data Store Management

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Answer Description

Ask Bash

Why is a compound sort key better for range filtering compared to an interleaved sort key in Amazon Redshift?

What is the role of a DISTKEY in Amazon Redshift, and why is device_id suitable here?

Why can't Redshift tables be partitioned like other databases, and what alternatives are provided?

Monthly

$19.99 $11.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99 $26.99

One time purchase of $26.99,
Does not auto-renew.

Annual Pass

$119.99 $71.99

One time purchase of $71.99,
Does not auto-renew.

Lifetime Pass

$189.99 $113.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Report Issue

Answer Description

Ask Bash

Why is a compound sort key better for range filtering compared to an interleaved sort key in Amazon Redshift?

What is the role of a DISTKEY in Amazon Redshift, and why is device_id suitable here?

Why can't Redshift tables be partitioned like other databases, and what alternatives are provided?

Report Issue