AWS Certified Data Engineer Associate DEA-C01 Practice Question

A data engineer is writing a Python script that Jenkins will call after each hourly ingest. The script must start the AWS Glue job named csv_to_parquet, pass the S3 object path as a parameter, then block until the job finishes or fails so that Jenkins can act on the result. The engineer wants to minimise the number of AWS API calls and avoid writing custom polling logic. Which approach meets these requirements when using the AWS SDK for Python (boto3)?

  • Call glue.start_job_run with the path argument, then repeatedly call glue.get_job_run every 5 seconds until the JobRunState is SUCCEEDED or FAILED.

  • Instantiate boto3.resource("glue").Job("csv_to_parquet").run(path=object_path); the run method blocks until the job finishes.

  • Call glue.start_job_run, capture the JobRunId, then invoke glue.get_waiter("job_run_succeeded").wait(JobName="csv_to_parquet", RunId=job_id).

  • Call glue.start_job_run, capture the JobRunId, and poll the corresponding CloudWatch Logs stream until log events stop.

AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot