Your analytics team is integrating web-server logs coming from data centers in New York, Chicago, and Los Angeles. Each record currently stores its event time as the string "2025-11-02 01:30:15" with no zone designator. During the November daylight-saving switch-back, that local clock time occurs twice, so the parsed datetimes cannot be ordered uniquely and a join with European event data fails. Before any joins or aggregations, which action is the most appropriate first step to eliminate the ambiguity and create a single, comparable timeline for the entire data set?
Remove all records whose local time falls inside the daylight-saving transition window and later re-create them with linear interpolation.
Extract only the date portion ("YYYY-MM-DD") and aggregate logs at a daily granularity, discarding the time-of-day detail.
Parse the string in the server's local zone and convert the resulting datetime to UTC, storing it in ISO 8601 format with a trailing "Z".
Keep the original local timestamp strings and add a Boolean column indicating whether each record was captured during daylight-saving time.
Parsing the local strings in their original time zones and immediately converting every value to Coordinated Universal Time (UTC) removes the duplicate "fall-back" hour. ISO 8601's trailing "Z" (or an explicit offset) records the relationship to UTC, so each moment is represented exactly once and can be compared or joined across regions. Dropping records discards data, adding a daylight-saving flag still leaves two identical local timestamps, and stripping the time of day prevents any sub-daily analysis; none of those approaches solves the fundamental ambiguity.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What does UTC stand for, and why is it important in data analysis?
Open an interactive chat with Bash
What is ISO 8601 format, and why use it for storing timestamps?
Open an interactive chat with Bash
How does the daylight-saving 'fall-back' hour create ambiguity in timestamps?