CompTIA DataX DY0-001 (V1) Practice Question

Your e-commerce company has collected only implicit-feedback signals such as clicks and purchase counts and must train a large-scale collaborative-filtering model for more than 10 million users and 2 million products on a distributed Spark cluster. The team is deciding between stochastic gradient descent (SGD) and Alternating Least Squares (ALS) for matrix factorization. Which property of ALS provides the strongest reason to choose it for this distributed workload?

  • ALS guarantees convergence to the global minimum of the non-convex matrix-factorization objective, ensuring higher accuracy than SGD.

  • ALS automatically enforces non-negative latent factors, a requirement for models trained on implicit-feedback data.

  • During each alternating step, the least-squares solves for different users (or items) are independent, so their factor vectors can be computed in parallel across cluster nodes.

  • ALS needs only a single pass over the interaction data and therefore avoids costly data shuffling in distributed streaming environments.

CompTIA DataX DY0-001 (V1)
Machine Learning
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot