CompTIA DataX DY0-001 (V1) Practice Question

During a performance review of a cloud-based data lake, engineers notice that most analytical queries read only a handful of numeric columns out of hundreds stored in high-volume IoT event logs that arrive as nested JSON objects. They want to cut scan time and storage costs by converting the raw ingestion files to a different format. The ideal replacement format must preserve the events' nested schema, enable column pruning and predicate push-down for efficient querying, and provide high compression without hurting read performance. Which file format best satisfies all of these requirements?

  • Keep the events as RFC 4180-compliant CSV text to maximize compatibility.

  • Compress the existing JSON files using GZIP without changing the file format.

  • Convert the events to Apache Parquet files (for example, with Snappy compression).

  • Serialize the events as Apache Avro binary files.

CompTIA DataX DY0-001 (V1)
Operations and Processes
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot