CompTIA DataX DY0-001 (V1) Practice Question

A data science team is building a convolutional neural network (CNN) to detect a rare type of retinal anomaly from a small and imbalanced dataset of fundus images. The initial model shows high accuracy on the training set but performs poorly on the validation set, indicating significant overfitting. The team has a limited budget and cannot collect more real-world data.

To improve the model's generalization and mitigate overfitting, which of the following data augmentation strategies would be the most effective and appropriate first step for this specific computer vision task?

  • Enriching the dataset by adding features extracted via geocoding the hospital location where each image was taken.

  • Implementing one-hot encoding on the image labels and then generating synthetic data using a Generative Adversarial Network (GAN).

  • Applying geometric transformations such as random rotations, flips, and zooms, combined with photometric distortions like adjustments to brightness and contrast.

  • Using Synthetic Minority Over-sampling Technique (SMOTE) directly on the flattened pixel values of the images.

CompTIA DataX DY0-001 (V1)
Modeling, Analysis, and Outcomes
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot