CompTIA DataX DY0-001 (V1) Practice Question

Your team must create a ground-truth sentiment dataset for 10 000 social-media posts. Because of budget limits, you can hire no more than three crowd workers per post, but the chief data scientist insists on at least 95 % label accuracy before the data are used for model training. Which strategy should you implement first to guarantee label quality without exceeding the budget?

  • Pre-train a weak language model on synthetic data and automatically overwrite any crowd label whose predicted probability is below 0.95.

  • Have every post labeled twice and keep only labels from pairs whose Cohen's kappa exceeds 0.8.

  • Increase the pay rate to attract experienced annotators but assign each post to a single worker to stay within budget.

  • Mix a hidden set of expert-labeled "gold" posts into each task and block annotators whose accuracy on these posts falls below a defined threshold.

CompTIA DataX DY0-001 (V1)
Operations and Processes
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot