AWS Certified Data Engineer Associate DEA-C01 Practice Question

An analytics team must build an AWS Glue Spark job that enriches 500 GB of Parquet click-stream data stored in Amazon S3 with a 5 GB customer dimension table that resides in an Amazon RDS for PostgreSQL instance. The solution must minimize infrastructure management, let multiple future jobs reuse the same metadata, and ensure that all traffic stays within the VPC. Which approach meets these requirements?

Configure Amazon Athena with the PostgreSQL federated query connector and have the Glue job retrieve the customer table by querying Athena during each run.
Use AWS DMS to replicate the RDS table into Amazon DynamoDB and query DynamoDB from the Glue Spark job for the customer dimension data.
Create an AWS Glue JDBC connection to the RDS endpoint in the VPC, run a crawler with that connection to catalog the customer table, and have the Glue Spark job read the cataloged JDBC table alongside the Parquet files.
Set up AWS Database Migration Service to export the RDS table to Amazon S3 each night, crawl the exported files, and join them with the click-stream data in the Glue job.

AWS Certified Data Engineer Associate DEA-C01

Data Ingestion and Transformation

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Answer Description

Ask Bash

What is an AWS Glue JDBC connection?

What role does the AWS Glue Data Catalog play in this solution?

How does AWS Glue ensure traffic stays within the VPC?

Monthly

$19.99 $11.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99 $26.99

One time purchase of $26.99,
Does not auto-renew.

Annual Pass

$119.99 $71.99

One time purchase of $71.99,
Does not auto-renew.

Lifetime Pass

$189.99 $113.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

AWS Certified Data Engineer Associate DEA-C01 Practice Question

Report Issue

Answer Description

Ask Bash

What is an AWS Glue JDBC connection?

What role does the AWS Glue Data Catalog play in this solution?

How does AWS Glue ensure traffic stays within the VPC?

Report Issue