AWS Certified Data Engineer Associate DEA-C01 Practice Question
A retail company is migrating its on-premises data warehouse to Amazon Redshift. Analysts typically run star-schema queries that filter on the last 90 days of sales and join a 3-billion-row fact table with small calendar and store dimension tables. The fact table is bulk-loaded nightly with about 5 million new rows. Which table design best balances query performance and load efficiency in Amazon Redshift?
Define the fact table with an interleaved sort key on sale_date and store_id and DISTSTYLE EVEN; keep all dimension tables at DISTSTYLE EVEN.
Define the fact table with a compound sort key (sale_date, store_id) and DISTKEY store_id; set the calendar and store dimensions to DISTSTYLE ALL.
Define the fact table without a sort key and use DISTSTYLE AUTO; keep the dimension tables at their default distribution style.
Define the fact table with a compound sort key (store_id, sale_date) and DISTKEY sale_date; set the dimension tables to DISTSTYLE KEY on sale_date.
Frequent queries restrict the fact table by sale_date and join it to the store and calendar dimensions on store_id and date surrogate keys. Making sale_date the leading column of a compound sort key lets Redshift skip older blocks, speeding the time-bound scans. Using store_id as the DISTKEY colocates rows that will be joined, minimizing network shuffles during joins. Because the dimension tables are small, DISTSTYLE ALL copies them to every node, eliminating distributed joins altogether. This combination provides the highest query performance without slowing the nightly COPY load, which only appends recent data and therefore largely preserves sort order. The other options either scatter data across nodes, lose sort-key pruning benefits, or replicate large tables unnecessarily, leading to higher scan or network costs.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is a compound sort key in Amazon Redshift?
Open an interactive chat with Bash
What is DISTKEY and how does it optimize joins in Amazon Redshift?
Open an interactive chat with Bash
What is DISTSTYLE ALL, and why is it used for small dimension tables in this case?
Open an interactive chat with Bash
What is a compound sort key in Amazon Redshift?
Open an interactive chat with Bash
What is DISTSTYLE ALL, and why is it ideal for dimension tables in this situation?
Open an interactive chat with Bash
Why is store_id chosen as the DISTKEY for the fact table?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Store Management
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .