Skip to content

Conversation

haileyplusplus
Copy link

The code that reads raw GTFS data into dataframes doesn't have an explicit data type for service_id, requiring it to be inferred. In schedules with a mix of numeric and alphanumeric service ids, this can cause some to be read as numbers and some as strings, preventing joins from working properly. This change makes service_id values be from CSV as strings.

The following snippet should illustrate the issue for at least the next few weeks:

feed = Feed("https://www.transitchicago.com/downloads/sch_data/google_transit.zip",
            time_windows=[0, 6, 10, 16, 19, 24],
            start_date='2024-05-26', end_date='2024-05-26')
print(feed.lines_freq)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant