CompTIA DataX DY0-001 (V1) Practice Question

A machine learning engineering team is migrating its model training environment from a single, high-memory on-premises server. Their primary challenge is training a complex deep learning model on a 10TB dataset, which requires both data and model parallelism. The team requires a solution that provides high availability, dynamic scaling of compute resources, and efficient distributed processing. However, they must also minimize the manual effort required for resource management and fault recovery. Which cluster deployment strategy best meets all of these criteria?

  • Deploy the training workload using a distributed computing framework like Apache Spark or a TensorFlow distribution strategy, running on a containerized cluster managed by Kubernetes.

  • Provision a cluster of virtual machines and use a shared network file system (NFS) to allow each node to access the data, coordinating the process with custom shell scripts.

  • Deploy the entire 10TB dataset and training script into a single, massive container image and rely on a container orchestrator to restart the container on another node upon failure.

  • Implement a high-availability database cluster and perform model training inside the database using distributed stored procedures.

CompTIA DataX DY0-001 (V1)
Operations and Processes
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot