CompTIA DataX DY0-001 (V1) Practice Question

A data scientist is tasked with optimizing the execution of a complex data processing pipeline, which consists of multiple interdependent jobs structured as a Directed Acyclic Graph (DAG). Each job has a specific execution duration and requires a set of resources (e.g., CPU cores, GB of RAM). The entire pipeline must be run on a computing cluster with a fixed, limited amount of total resources. The primary objective is to schedule the jobs to minimize the total time to completion (makespan) for the entire pipeline, while ensuring that job dependencies are respected and the cluster's resource capacity is never exceeded. Which of the following optimization approaches is the MOST appropriate for finding an optimal schedule for this problem?

  • Formulating the problem as an Integer Programming (IP) model to solve for the optimal start times of each job.

  • Using a Multi-armed Bandit algorithm to dynamically allocate resources to jobs based on their expected performance.

  • Modeling the problem as a Traveling Salesman Problem (TSP) to find the most efficient sequence of jobs.

  • Applying a greedy algorithm that prioritizes scheduling the shortest jobs first.

CompTIA DataX DY0-001 (V1)
Specialized Applications of Data Science
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot