AWS Certified Data Engineer Associate DEA-C01 Practice Question
Your company stores clickstream and purchase data as partitioned Parquet files in separate Amazon S3 buckets. Analysts run dozens of ad-hoc Athena queries that join these datasets to report on daily revenue by campaign. You must reduce query costs and simplify the SQL for analysts, while using only the Athena service. Which approach meets these requirements?
Create an Athena materialized view that performs the join and daily aggregation, and schedule a daily REFRESH MATERIALIZED VIEW command.
Schedule a daily CREATE TABLE AS SELECT (CTAS) query to create a new table with the joined and aggregated data, stored in a partitioned, columnar format.
Create a standard Athena view over the two tables so analysts only need to query the view.
Run a nightly CTAS query that writes the joined results as a single unpartitioned CSV file, then have analysts query that file.
Because Amazon Athena is priced based on data scanned, the most effective way to reduce costs for repeated queries is to pre-compute the results. Since Athena does not support materialized views, the correct pattern is to use a 'CREATE TABLE AS SELECT' (CTAS) query to perform the join and aggregation once. Scheduling this CTAS query to run daily (for example, with Amazon EventBridge and AWS Lambda) and writing the output to a new table in a partitioned, columnar format like Parquet dramatically reduces the data scanned by subsequent analyst queries. A standard view simplifies SQL but re-runs the expensive join each time, scanning the full source tables and not reducing cost. Using CTAS to create a single, unpartitioned CSV file is inefficient because CSV is not columnar and a full table scan would be required for every query. Loading the data into Amazon Redshift violates the constraint to remain within the Athena service.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is a CTAS query in Athena?
Open an interactive chat with Bash
Why is Parquet preferred for storing data in Athena?
Open an interactive chat with Bash
How does scheduling a CTAS query improve efficiency in Athena?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .