AWS Certified Data Engineer Associate DEA-C01 Practice Question
A data engineering team stores AWS Glue PySpark scripts in an AWS CodeCommit repository. An AWS CodePipeline currently builds the artifacts and deploys them through AWS CloudFormation. The team must add an automated test stage that installs Python dependencies, executes pytest-based unit tests that import AWS Glue 4.0 libraries, and fails the pipeline if any test fails, while minimizing cost and infrastructure management. Which solution meets these requirements?
Insert an AWS Batch job action that spins up an EMR on EKS cluster during the pipeline run to execute the PySpark unit tests and then terminates the cluster.
Introduce a manual approval action that requires a data engineer to execute the tests on a dedicated EC2 instance before continuing the pipeline.
Add an AWS CodeBuild stage to the pipeline that uses the public AWS Glue 4.0 Docker image as its build environment, runs pip install -r requirements.txt followed by pytest, and lets the build's exit code determine pipeline success.
Add an AWS Glue job configured with job bookmarks in "Test run" mode; configure CodePipeline to monitor a CloudWatch metric from the job and fail if the metric reports errors.
Using an AWS CodeBuild action inside CodePipeline delivers an on-demand, fully managed build environment that can run any Docker image. By specifying the publicly available amazon/aws-glue-libs:glue_libs_4.0 image (or a custom image that extends it), the build container already contains the same AWS Glue libraries used in production jobs. The buildspec can pip-install additional dependencies and run pytest; CodeBuild returns a non-zero exit code when tests fail, which automatically causes the CodePipeline stage to fail. No persistent infrastructure is required and you pay only for the build minutes used. The other options either introduce unnecessary infrastructure (EMR on EKS, EC2 host) or do not provide an automated, unit-test capable stage (Glue job bookmarks, manual approval).
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is AWS CodeBuild and how does it integrate with CodePipeline?
Open an interactive chat with Bash
What is the purpose of the amazon/aws-glue-libs:glue_libs_4.0 Docker image?
Open an interactive chat with Bash
Why is CodeBuild a better choice than EMR on EKS or Glue jobs for unit testing?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Ingestion and Transformation
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .