CompTIA DataX DY0-001 (V1) Practice Question

A data science team is designing a data lake architecture on a distributed file system to store terabytes of structured event data for analytical querying. The primary use case involves running complex, read-heavy queries for feature engineering, which frequently select a small subset of columns from a wide table containing over 200 columns. The system must also support schema evolution as new event properties are added over time. Given these requirements, which data format is the most appropriate for storing the processed data in the data lake to optimize query performance and storage efficiency?

  • Parquet

  • JSON

  • Avro

  • CSV

CompTIA DataX DY0-001 (V1)
Operations and Processes
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot