AWS Certified Data Engineer Associate DEA-C01 Practice Question

An e-commerce company ingests 20 GB of click-stream logs every 15 minutes into AWS. An AWS Glue Spark job will convert the logs from JSON to partitioned Parquet before loading them into Amazon Redshift. The solution must minimize cost, decouple ingestion from transformation jobs, and scale automatically with data volume without manual capacity management. Where should the logs be placed as the intermediate staging location?

Store the raw logs in an Amazon S3 bucket that serves as the landing zone.
Persist the logs on an Amazon EFS file system mounted on an ingestion EC2 instance.
Write the logs into an Amazon DynamoDB table using binary (BLOB) attributes.
Load the logs directly into Amazon Redshift staging tables in a separate schema.

AWS Certified Data Engineer Associate DEA-C01

Data Ingestion and Transformation

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Answer Description

Ask Bash

Why is Amazon S3 recommended as a landing zone for raw logs?

What is the difference between Amazon S3 and Amazon EFS as storage options?

Why can’t Amazon Redshift or DynamoDB be used for storing raw logs?

Monthly

$19.99 $11.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99 $26.99

One time purchase of $26.99,
Does not auto-renew.

Annual Pass

$119.99 $71.99

One time purchase of $71.99,
Does not auto-renew.

Lifetime Pass

$189.99 $113.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Report Issue

Answer Description

Ask Bash

Why is Amazon S3 recommended as a landing zone for raw logs?

What is the difference between Amazon S3 and Amazon EFS as storage options?

Why can’t Amazon Redshift or DynamoDB be used for storing raw logs?

Report Issue