CompTIA DataX DY0-001 (V1) Practice Question

A movie-streaming provider keeps a 1-5 star rating matrix and wants to build a user-based, similarity-based recommender. Some customers are "tough graders" who rarely rate above three stars, while others routinely give four or five stars even to average titles. To make sure that neighbor selection reflects relative preferences rather than each customer's personal rating scale, which similarity measure should the data scientist choose when constructing the user-user similarity matrix?

  • Pearson correlation coefficient computed on co-rated items

  • Euclidean distance between raw rating vectors

  • Cosine similarity applied to the raw rating vectors

  • Jaccard similarity on the sets of movies each user has rated

CompTIA DataX DY0-001 (V1)
Machine Learning
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot