CompTIA DataX DY0-001 (V1) Practice Question

Your team must create a ground-truth sentiment dataset for 10 000 social-media posts. Because of budget limits, you can hire no more than three crowd workers per post, but the chief data scientist insists on at least 95 % label accuracy before the data are used for model training. Which strategy should you implement first to guarantee label quality without exceeding the budget?

Pre-train a weak language model on synthetic data and automatically overwrite any crowd label whose predicted probability is below 0.95.
Have every post labeled twice and keep only labels from pairs whose Cohen's kappa exceeds 0.8.
Increase the pay rate to attract experienced annotators but assign each post to a single worker to stay within budget.
Mix a hidden set of expert-labeled "gold" posts into each task and block annotators whose accuracy on these posts falls below a defined threshold.

CompTIA DataX DY0-001 (V1)

Operations and Processes

Your Score:

SAVE $64

CompTIA DataX Voucher

v1 / DY0-001

$529.00 $465.00

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA DataX DY0-001 (V1) Practice Question

Answer Description

Ask Bash

What is a ground-truth dataset and why is it important?

What are 'gold' posts and how do they improve labeling quality?

How does Cohen's kappa measure inter-annotator agreement?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

CompTIA DataX DY0-001 (V1) Practice Question

Report Issue

Answer Description

Ask Bash

What is a ground-truth dataset and why is it important?

What are 'gold' posts and how do they improve labeling quality?

How does Cohen's kappa measure inter-annotator agreement?

Report Issue