AWS Certified Data Engineer Associate DEA-C01 Practice Question
A data engineer is developing a production ML workflow that uses Amazon SageMaker Pipelines to read raw files from Amazon S3, perform data preprocessing, train a model, and deploy the model to a SageMaker endpoint. The company must keep an auditable, end-to-end record of every dataset, processing job, model version, and endpoint created by the pipeline while writing as little custom tracking code as possible. Which solution meets these requirements?
Enable SageMaker ML Lineage Tracking in the SageMaker Pipeline so that each step automatically registers its artifacts and relationships, then query the lineage graph through the SageMaker Lineage API.
Turn on AWS CloudTrail for all SageMaker API calls and analyze the resulting logs with Amazon Athena to reconstruct the lineage of artifacts.
Refactor the workflow into AWS Step Functions and enable AWS X-Ray tracing so that each state transition captures lineage information for audit queries.
Run an AWS Glue crawler after every pipeline step and store the results in the AWS Glue Data Catalog to represent lineage between datasets, jobs, and models.
Amazon SageMaker ML Lineage Tracking is natively integrated with SageMaker Pipelines. When lineage tracking is enabled, each pipeline execution automatically records artifacts such as datasets, processing jobs, training jobs, models, and endpoints, and registers the relationships among them. These artifacts and their dependencies can be queried through the SageMaker Lineage API, used in automated compliance reports, or visualized in SageMaker Studio. AWS Glue crawlers catalog only data locations and cannot track transformations or model artifacts. CloudTrail logs must be parsed and correlated manually and do not provide semantic lineage. Step Functions with X-Ray trace requests but do not capture domain-specific ML artifacts without extensive custom instrumentation.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is SageMaker ML Lineage Tracking?
Open an interactive chat with Bash
What is the role of the SageMaker Lineage API?
Open an interactive chat with Bash
How does SageMaker Pipelines integrate with ML Lineage Tracking?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Store Management
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .