AWS Certified Data Engineer Associate DEA-C01 Practice Question

A data engineering team must design an AWS data lake that stores three datasets: relational product catalog tables, clickstream events arriving as semi-structured JSON, and high-resolution product images. The solution must enable ad-hoc ANSI SQL analytics across the catalog and clickstream data, catalog object metadata for the images, require minimal ongoing administration, and keep storage costs as low as possible. Which approach best meets these requirements?

Ingest every dataset into a single Amazon Redshift cluster, storing the product images in a BYTEA column and using standard Redshift tables for the catalog and clickstream data.
Stream all data through Amazon MSK and index it in Amazon OpenSearch Service, including the images through an attachments plug-in, then run reports with OpenSearch SQL queries.
Load the catalog tables into an Amazon RDS PostgreSQL instance, write the JSON events to Amazon DynamoDB, keep images in Amazon S3, and use Amazon Redshift federated queries for analytics.
Store Parquet files for the catalog, raw JSON files for clickstream events, and the image objects in Amazon S3; register all locations and image metadata in the AWS Glue Data Catalog and query them with Amazon Athena or Amazon Redshift Spectrum.

AWS Certified Data Engineer Associate DEA-C01

Data Store Management

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Answer Description

Ask Bash

What is Amazon Athena and how is it used in this solution?

What benefits do Parquet files provide for storing catalog data?

Why is Amazon S3 a good choice for storing images in this solution?

Monthly

$19.99 $11.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99 $26.99

One time purchase of $26.99,
Does not auto-renew.

Annual Pass

$119.99 $71.99

One time purchase of $71.99,
Does not auto-renew.

Lifetime Pass

$189.99 $113.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Report Issue

Answer Description

Ask Bash

What is Amazon Athena and how is it used in this solution?

What benefits do Parquet files provide for storing catalog data?

Why is Amazon S3 a good choice for storing images in this solution?

Report Issue