AWS Certified Data Engineer Associate DEA-C01 Practice Question

Your analytics team queries click-stream events that are written as Parquet files to an Amazon S3 data lake. An AWS Glue Data Catalog table and an Amazon Redshift Spectrum external table reference the dataset, which is partitioned by year, month, and day. A new business requirement adds the string column user_country to every new event record. Historical Parquet files will not be backfilled. You must expose the new column to analysts without interrupting existing workloads, and older partitions should continue to return NULL for the column. Which action will meet these requirements with the least disruption?

Create a new Glue table and Spectrum external table that include the user_country column, and instruct analysts to switch their queries to the new tables.
Issue ALTER TABLE <external_table> ADD COLUMN user_country varchar(2); from Amazon Redshift, allowing Redshift Spectrum to update the Glue table and return NULL for the column in older partitions.
Run ALTER TABLE <external_table> REPLACE COLUMNS (...) specifying the full updated column list so that the table definition is replaced.
Drop and recreate the Glue and Redshift Spectrum tables with the new schema after all historical Parquet files are backfilled.

AWS Certified Data Engineer Associate DEA-C01

Data Store Management

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Answer Description

Ask Bash

Why does Redshift Spectrum allow NULL values for new columns in older partitions?

What is the risk of using `ALTER TABLE ... REPLACE COLUMNS` instead of `ADD COLUMN`?

How does Redshift Spectrum handle schema evolution for Parquet files?

Monthly

$19.99 $11.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99 $26.99

One time purchase of $26.99,
Does not auto-renew.

Annual Pass

$119.99 $71.99

One time purchase of $71.99,
Does not auto-renew.

Lifetime Pass

$189.99 $113.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Report Issue

Answer Description

Ask Bash

Why does Redshift Spectrum allow NULL values for new columns in older partitions?

What is the risk of using `ALTER TABLE ... REPLACE COLUMNS` instead of `ADD COLUMN`?

How does Redshift Spectrum handle schema evolution for Parquet files?

Report Issue