AWS Certified Data Engineer Associate DEA-C01 Practice Question
A travel company ingests daily reservation files into an S3 data lake. An AWS Glue ETL job converts the CSV input to Parquet for downstream analytics. The data owner requires that every output record contains non-empty values for the BookingId and TravelerEmail columns. Records that do not meet this rule must be excluded from the Parquet dataset and stored separately for review. Which solution will satisfy the requirement with the least custom code and without adding additional compute services?
Insert a DropFields transformation for BookingId and TravelerEmail; rows with null values will automatically be excluded when the columns are dropped.
Use a ResolveChoice transformation to cast both columns to string; Glue implicitly skips rows where the cast fails because of null or empty values.
Add an ApplyMapping transformation that converts BookingId and TravelerEmail to the non-nullable data type; Glue will remove rows that violate the schema.
Add a filter transformation to the DynamicFrame that returns True only when both columns are not null and not an empty string, then write the rejected records to a different S3 prefix.
AWS Glue provides built-in transformations on DynamicFrames. The filter transformation lets you pass a native Python Boolean function that returns True for rows to keep and False for rows to discard. By checking that both BookingId and TravelerEmail are neither null nor empty strings, the ETL script keeps only valid rows. The same job can write the rejected data that failed the Boolean test to a separate S3 prefix, so no extra services or complex error-handling frameworks are required. ApplyMapping does not evaluate null or empty values; ResolveChoice only fixes ambiguous types, and DropFields removes entire columns rather than invalid rows.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is a DynamicFrame in AWS Glue?
Open an interactive chat with Bash
How does the `filter` transformation work in AWS Glue?
Open an interactive chat with Bash
Why is Parquet a preferred format for downstream analytics?
Open an interactive chat with Bash
AWS Certified Data Engineer Associate DEA-C01
Data Operations and Support
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
Pass with Confidence.
IT & Cybersecurity Package
You have hit the limits of our free tier, become a Premium Member today for unlimited access.
Military, Healthcare worker, Gov. employee or Teacher? See if you qualify for a Community Discount.
Monthly
$19.99
$19.99/mo
Billed monthly, Cancel any time.
3 Month Pass
$44.99
$14.99/mo
One time purchase of $44.99, Does not auto-renew.
MOST POPULAR
Annual Pass
$119.99
$9.99/mo
One time purchase of $119.99, Does not auto-renew.
BEST DEAL
Lifetime Pass
$189.99
One time purchase, Good for life.
What You Get
All IT & Cybersecurity Package plans include the following perks and exams .