AWS Certified Data Engineer Associate DEA-C01 Practice Question
An Amazon Redshift cluster holds several 4 TB fact tables. Each night the data team must copy only rows inserted in the last 24 hours to Amazon S3 as partitioned Parquet files for analytics. They want minimal custom code, no full-table exports, and to keep the cluster free for daytime queries. Which solution delivers the required nightly extract with the least operations work?
Invoke an AWS Lambda function nightly from Amazon EventBridge; the function runs UNLOAD commands with a WHERE last_update_time > current_date - 1 and copies the resulting files to S3.
Run an AWS Glue crawler each night against the Redshift cluster and configure it to export newly discovered partitions as Parquet files in S3.
Create an AWS Glue ETL job that reads the Redshift tables with a JDBC connection, enable job bookmarks on the last_update_time column, write the output to Amazon S3 as Parquet, and schedule the job nightly with Amazon EventBridge.
Use AWS Database Migration Service to perform a full load and continuous replication from Redshift to an S3 target endpoint configured for Parquet output.
An AWS Glue ETL job can read from Amazon Redshift through a JDBC connection and write directly to Amazon S3 in Parquet format. When job bookmarks are enabled, Glue tracks the highest value it processed in the specified primary key or timestamp column (for example, last_update_time). On subsequent scheduled runs, Glue automatically filters the query so that Redshift returns only rows added since the previous extraction, reducing cluster load and network traffic. Scheduling the job with Amazon EventBridge removes the need to manage servers or custom code.
AWS DMS cannot perform ongoing replication (CDC) from Redshift, so it cannot provide incremental nightly extracts. A Lambda function that issues UNLOAD statements still requires code and manual state tracking. A Glue crawler builds catalog metadata but does not transfer data to S3 by itself.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is AWS Glue ETL, and how does it work?
Open an interactive chat with Bash
What are job bookmarks in AWS Glue, and why are they useful?
Open an interactive chat with Bash
Why is Amazon EventBridge used to schedule AWS Glue jobs?
Open an interactive chat with Bash
What are AWS Glue job bookmarks?
Open an interactive chat with Bash
Why is enabling job bookmarks important for incremental data extraction?
Open an interactive chat with Bash
How does Amazon EventBridge schedule AWS Glue jobs?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Ingestion and Transformation
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .