AWS Certified Data Engineer Associate DEA-C01 Practice Question

A company loads 2 TB of time-series sensor events into an Amazon Redshift table every day by appending new rows. Business intelligence dashboards filter on event_date for the most recent 7 days and aggregate results by device_id, joining to a small device metadata table. The current heap table causes long scan times. Which schema change will most effectively reduce dashboard latency without adding load-time complexity?

  • Create an external Redshift Spectrum table partitioned by event_date and load new partitions daily.

  • Convert the table to a compound sort key of event_date, device_id and use device_id as the DISTKEY.

  • Leave the heap layout unchanged but schedule VACUUM and ANALYZE to run after each daily load.

  • Change the table to EVEN distribution and define an INTERLEAVED sort key on device_id, event_date.

AWS Certified Data Engineer Associate DEA-C01
Data Store Management
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot