Back to Showcase
Chicagocrashes
Vijay Agnihotri
Hope I'm not out of time but I created a project that analyses Chicago Crash data which creates an end-to-end data pipeline that ingests, transforms, and stores Chicago traffic crash data for analysis. It pulls from three public datasets on the City of Chicago Data Portal - Crashes, People, and Vehicles via the SODA2 REST API, loads the raw records into a local DuckDB database, and then joins them into a single analytics-ready table. The pipeline runs on a daily schedule, fetching only new records each run by filtering the API on the Bruin-provided start and end dates, so it avoids re-downloading the entire dataset (~1M+ crash records) every time. Hosted at: