AWS Certified Solutions Architect Associate SAA-C03 Practice Question

A data engineering team stores hundreds of gigabytes of raw CSV files in an Amazon S3 data lake. They need to convert this data to Apache Parquet on a daily schedule as part of an ETL pipeline. The team wants a fully managed, serverless solution that lets them define the pipeline visually and perform the conversion without writing any code. Which AWS service or feature best meets these requirements?

  • Launch an Amazon EMR cluster running a custom Spark script that converts the files.

  • Configure Amazon S3 event notifications to trigger an AWS Lambda function that runs a Python conversion script.

  • Create an AWS Glue Studio visual ETL job that reads the CSV files and writes the output in Parquet format.

  • Set up AWS Data Pipeline with a ShellCommandActivity that uses the parquet-mr tool to rewrite the files.

AWS Certified Solutions Architect Associate SAA-C03
Design High-Performing Architectures
Your Score:
Settings & Objectives
Random Mixed
Questions are selected randomly from all chosen topics, with a preference for those you haven’t seen before. You may see several questions from the same objective or domain in a row.
Rotate by Objective
Questions cycle through each objective or domain in turn, helping you avoid long streaks of questions from the same area. You may see some repeat questions, but the distribution will be more balanced across topics.

Check or uncheck an objective to set which questions you will receive.

Bash, the Crucial Exams Chat Bot
AI Bot