AWS Certified Data Engineer Associate DEA-C01 Practice Question
An analytics team must build an AWS Glue Spark job that enriches 500 GB of Parquet click-stream data stored in Amazon S3 with a 5 GB customer dimension table that resides in an Amazon RDS for PostgreSQL instance. The solution must minimize infrastructure management, let multiple future jobs reuse the same metadata, and ensure that all traffic stays within the VPC. Which approach meets these requirements?
Configure Amazon Athena with the PostgreSQL federated query connector and have the Glue job retrieve the customer table by querying Athena during each run.
Set up AWS Database Migration Service to export the RDS table to Amazon S3 each night, crawl the exported files, and join them with the click-stream data in the Glue job.
Create an AWS Glue JDBC connection to the RDS endpoint in the VPC, run a crawler with that connection to catalog the customer table, and have the Glue Spark job read the cataloged JDBC table alongside the Parquet files.
Use AWS DMS to replicate the RDS table into Amazon DynamoDB and query DynamoDB from the Glue Spark job for the customer dimension data.
Creating an AWS Glue JDBC connection to the RDS instance keeps network traffic inside the VPC and removes the need to manage custom drivers or endpoints. A crawler that uses this connection can catalog the PostgreSQL table in the AWS Glue Data Catalog. The Spark job can then read both the Parquet dataset and the cataloged JDBC table through the same catalog, allowing other Glue or EMR jobs to reuse the metadata. Exporting to S3, using Athena federation, or replicating into DynamoDB adds extra components, increases management overhead, or changes the data store, so they do not best satisfy the stated constraints.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is an AWS Glue JDBC connection?
Open an interactive chat with Bash
What role does the AWS Glue Data Catalog play in this solution?
Open an interactive chat with Bash
How does AWS Glue ensure traffic stays within the VPC?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Ingestion and Transformation
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .