AWS Certified Data Engineer Associate DEA-C01 Practice Question

A data engineering team ingests 10 TB of semi-structured clickstream events through Amazon Kinesis Data Firehose into an Amazon S3 data lake. Analysts will run frequent Amazon Athena queries that usually reference only a few columns. The team wants to minimize both query latency and the amount of data scanned to control cost. Which file format should the team configure Kinesis Data Firehose to deliver to S3 to best meet these requirements?

  • Apache Parquet files with Snappy compression

  • Gzip-compressed JSON files

  • Line-delimited JSON text (.txt)

  • Comma-separated values (CSV)

AWS Certified Data Engineer Associate DEA-C01
Data Store Management
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot