CompTIA DataX DY0-001 (V1) Practice Question

Your organization's GitHub repository contains code for an ML pipeline while the training data (≈200 GB) lives in an Amazon S3 bucket that is overwritten every week. Compliance rules require that anyone who checks out any past Git commit can automatically restore exactly the dataset that was used for that commit, without bloating the repository or exceeding GitHub file-size limits. Which approach best satisfies these requirements?

Track the dataset with DVC: commit the lightweight .dvc pointer files to Git and configure an S3 DVC remote so that "git checkout" followed by "dvc pull" retrieves the exact snapshot.
Store the full dataset in Git Large File Storage so each commit contains a pointer to the data blobs managed by Git LFS.
Package every weekly snapshot as a compressed archive and upload it as a GitHub release asset referenced by a repository tag.
Enable S3 object versioning and save the object version IDs in a YAML configuration file that the pipeline reads at runtime.

CompTIA DataX DY0-001 (V1)

Operations and Processes

Your Score:

SAVE $64

CompTIA DataX Voucher

v1 / DY0-001

$529.00 $465.00

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA DataX DY0-001 (V1) Practice Question

Answer Description

Ask Bash

What is DVC and how does it integrate with Git?

Why is storing large datasets in Git directly not recommended?

What are the limitations of Git Large File Storage (Git LFS) compared to DVC?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

CompTIA DataX DY0-001 (V1) Practice Question

Report Issue

Answer Description

Ask Bash

What is DVC and how does it integrate with Git?

Why is storing large datasets in Git directly not recommended?

What are the limitations of Git Large File Storage (Git LFS) compared to DVC?

Report Issue