CompTIA DataX DY0-001 (V1) Practice Question

A data architect at a major e-commerce company is designing an ingestion and storage solution for a new analytics platform. The platform will process high-velocity user clickstream data, which arrives as semi-structured JSON objects. The primary requirements are to support fast, complex analytical queries on specific columns while minimizing storage costs and providing data that is refreshed every few minutes. Which of the following approaches best meets all of these requirements?

  • Set up a daily batch process to collect all clickstream events, flatten them, and store them as compressed CSV files.

  • Stream the incoming JSON data directly into a structured, relational database, normalizing the data into multiple tables.

  • Ingest the data in micro-batches, converting the nested JSON into a flattened, columnar Parquet format for storage.

  • Implement a real-time streaming pipeline that writes the raw, nested JSON data directly to object storage as individual files.

CompTIA DataX DY0-001 (V1)
Operations and Processes
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

SAVE $64
$529.00 $465.00
Bash, the Crucial Exams Chat Bot
AI Bot