AWS Certified Data Engineer Associate DEA-C01 Practice Question
A retail company is moving its 20 TB on-premises data warehouse to AWS. For the next three months analysts must continue running ad-hoc SQL reports while historical data is transferred in batches over a low-bandwidth VPN. Management wants to minimize cost during migration and avoid rewriting existing SQL queries. Which data storage approach best meets the migration requirements?
Replicate tables into Amazon DynamoDB and use PartiQL to run the existing SQL reports from the analysts.
Spin up an Amazon EMR cluster, copy the data to HDFS, and expose the data through Apache Hive for analysts to query.
Load the data into Amazon Aurora PostgreSQL Serverless v2 and use Amazon Athena federated queries to analyze it until cutover.
Store exported tables in Amazon S3 as compressed Parquet files and create external tables in Amazon Redshift Spectrum so a small Redshift cluster can query the data during the migration.
Staging the exported tables in Amazon S3 keeps storage costs low and accommodates incremental uploads over a constrained network. Writing the data as compressed Parquet further reduces transfer size and improves scan efficiency. By defining external tables, Amazon Redshift Spectrum lets analysts run familiar SQL while a minimally sized Redshift cluster stores only metadata, so compute cost is limited until the final cutover. Moving the data into DynamoDB would force a non-analytic store that lacks full SQL support, Aurora would incur higher storage and instance costs for 20 TB, and maintaining an EMR cluster for several months adds unnecessary operational and infrastructure expense.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What are Parquet files, and why are they useful for storing data in AWS S3?
Open an interactive chat with Bash
How does Amazon Redshift Spectrum handle external tables and integrate with S3?
Open an interactive chat with Bash
Why is using Redshift Spectrum more cost-effective for ad-hoc SQL queries compared to alternatives like Aurora or EMR?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Store Management
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99 $11.99
$11.99/mo
Billed monthly, Cancel any time.
$19.99 after promotion ends
3 Month Pass
$44.99 $26.99
$8.99/mo
One time purchase of $26.99, Does not auto-renew.
$44.99 after promotion ends
Save $18!
MOST POPULAR
Annual Pass
$119.99 $71.99
$5.99/mo
One time purchase of $71.99, Does not auto-renew.
$119.99 after promotion ends
Save $48!
BEST DEAL
Lifetime Pass
$189.99 $113.99
One time purchase, Good for life.
Save $76!
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .