🔥 40% Off Crucial Exams Memberships — This Week Only

3 days, 7 hours remaining!

AWS Certified Data Engineer Associate DEA-C01 Practice Question

A company lands 2 TB of comma-separated log files in an Amazon S3 landing prefix every night at 01:00. Analysts query the data with Amazon Athena and want the new records available within 30 minutes in a curated S3 prefix, stored as Apache Parquet and partitioned by ingestion date. The data engineering team wants the lowest operational overhead and to minimize compute costs when the nightly workload is idle. Which approach meets these requirements?

  • Spin up a long-running Amazon EMR cluster with Apache Spark. Schedule a daily step at 01:05 that converts the files to Parquet and writes them to the curated prefix, leaving the cluster running for the next day's job.

  • Configure an AWS Glue Spark job that is triggered when new files arrive. The job converts the CSV input to Parquet, partitions by date, writes to the curated S3 prefix, and uses Glue job auto-scaling so no compute is billed when idle.

  • Load the CSV data into an Amazon Redshift table each night, then run an UNLOAD command to write Parquet files partitioned by date back to S3 for Athena queries.

  • Invoke an AWS Lambda function from each S3 PUT event. The function uses pandas to read the CSV objects, convert them to Parquet, and store the results in the curated prefix.

AWS Certified Data Engineer Associate DEA-C01
Data Ingestion and Transformation
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot