AWS Certified Data Engineer Associate DEA-C01 Practice Question
A data engineer needs to let business analysts visually build and test data transformations on a 5 GB sample of CSV files stored in Amazon S3, then run those same transformations every night on 2 TB of new data and write the output to Parquet for Amazon Athena. The company does not want to manage clusters or write code. Which approach meets these requirements with the least operational effort and cost?
Schedule an AWS Lambda function with Amazon EventBridge that runs an Amazon Athena CTAS query to convert the CSV files to Parquet each night.
Author a visual ETL job in AWS Glue Studio that uses Apache Spark to convert the data and trigger it nightly with a Glue workflow.
Create an AWS Glue DataBrew project to build a transformation recipe on the sample data and schedule a DataBrew job to run nightly on the full S3 dataset, outputting Parquet.
Launch an Amazon EMR cluster running Apache Spark, store the transformation script in Amazon S3, and invoke it nightly with Amazon EventBridge.
AWS Glue DataBrew provides a browser-based interface where analysts can visually build a recipe of transformations on a sampled subset of data without writing code. The same recipe can be executed as a DataBrew job that scales out serverlessly to process large S3 datasets and write the results in Parquet. No clusters or code management is required.
A Glue Studio Spark job or an EMR cluster would satisfy the technical needs but still require Spark script maintenance and cluster capacity management. An Athena CTAS statement automated with Lambda removes cluster administration but offers only SQL-based transforms and no visual recipe creation for analysts.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is AWS Glue DataBrew?
Open an interactive chat with Bash
How does AWS Glue DataBrew differ from AWS Glue Studio?
Open an interactive chat with Bash
Why is Parquet used as the output format for Athena?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .