🔥 40% Off Crucial Exams Memberships — This Week Only

3 days, 7 hours remaining!

AWS Certified Data Engineer Associate DEA-C01 Practice Question

An analytics team runs ad-hoc queries from an Amazon Redshift cluster against 15 TB of application logs in Amazon S3 by using Redshift Spectrum. The logs are gzip-compressed CSV files stored under a single prefix. Queries are slow and incur high data-scanned charges. The team cannot load the data into Redshift but can transform the S3 data once. Which change will most effectively improve performance and reduce Spectrum cost?

  • Re-compress the CSV files with bzip2 to achieve a higher compression ratio before running Spectrum queries.

  • Enable Amazon Redshift Concurrency Scaling and increase the number of query slots in the WLM configuration.

  • Create an Amazon Redshift materialized view that references the existing CSV files through a Spectrum manifest file.

  • Convert the log files to Parquet and partition the dataset (for example by date), then recreate the external table to reference the new location.

AWS Certified Data Engineer Associate DEA-C01
Data Store Management
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot