CompTIA DataX DY0-001 (V1) Practice Question

A data science team is tasked with extracting information from thousands of biomedical research papers. They are using a powerful, pre-trained transformer-based Named Entity Recognition (NER) model that was trained on a general news and web text corpus. The model performs poorly, frequently failing to identify or misclassifying domain-specific entities such as protein names, gene sequences, and complex chemical compounds. Which of the following represents the most effective and direct strategy to significantly improve the model's performance on this specialized corpus?

Develop an extensive set of regular expressions and dictionary-based rules to specifically target and extract the biomedical entities.
Fine-tune the pre-trained transformer model using a manually annotated dataset of biomedical research papers.
Replace the transformer-based architecture with a Conditional Random Field (CRF) model trained from scratch on the specialized biomedical corpus.
Apply aggressive text normalization techniques, such as stemming and stop word removal, to the biomedical text before processing it with the existing model.

CompTIA DataX DY0-001 (V1)

Specialized Applications of Data Science

Your Score:

SAVE $64

CompTIA DataX Voucher

v1 / DY0-001

$529.00 $465.00

Bash, the Crucial Exams Chat Bot

AI Bot

CompTIA DataX DY0-001 (V1) Practice Question

Answer Description

Ask Bash

What does fine-tuning a pre-trained model involve?

Why is aggressive text normalization harmful for NER tasks?

How do rule-based systems compare to machine learning models like transformers for NER tasks?

Monthly

$19.99

Billed monthly,
Cancel any time.

3 Month Pass

$44.99

One time purchase of $44.99,
Does not auto-renew.

Annual Pass

$119.99

One time purchase of $119.99,
Does not auto-renew.

Lifetime Pass

$189.99

One time purchase,
Good for life.

All Exams

Unlimited Tests

Unlimited Questions

AI Tutor

Track scores

Report Cards

Voucher Discounts

Advanced PBQs

Included Exams

CompTIA DataX DY0-001 (V1) Practice Question

Report Issue

Answer Description

Ask Bash

What does fine-tuning a pre-trained model involve?

Why is aggressive text normalization harmful for NER tasks?

How do rule-based systems compare to machine learning models like transformers for NER tasks?

Report Issue