Reproducible data science for road safety research

Introduction

This workshop will take place at the University of Leeds’ Institute for Transport Studies (ITS) as part of the RS5C conference that runs from 3rd to 5th September 2025. The workshop takes place on the 2nd September 2025, 14:00-16:00, the day before the main conference starts.

The workshop will cover the fundamentals of reproducible data science for road safety research, building on a decade’s worth of experience working with road traffic casualty datasets for policy-relevant road safety research. The UK’s open access STATS19 database will be the basis of the session but the skills learned will be applicable to any road safety datasets. The session will cover:

  • Importing collision, casualty and vehicle tables
  • Temporal visualisation and aggregation
  • Spatial visualisation and aggregation
  • Joining STATS19 tables
  • Spatial joins linking infrastructure to collisions

The course will be taught in R, a free and open-source programming language for data analysis and visualisation that excels at the kind of statistical modelling and visualisation workflows required for high-impact, reproducible and correct road safety research. The course will be taught by Professor Robin Lovelace, who has over a decade of experience teaching R for data science and is author of the popular book Geocomputation with R. You will learn how to add value to road traffic casualty date for more data-driven and effective interventions to save lives in relation to the largest cause of death for young people worldwide, as highlighted in the map below.

Road danger levels worldwide in 2016. Data source: World Bank. Reproducible source code: Reproducible Road Safety Research with R, freely available at itsleeds.github.io/rrsrr/.

Who is this for?

The workshop is aimed at anyone interested in road safety research, especially for students, researchers, and practitioners who are already working with road safety data and who would like to improve their data science skills for more reproducible and impactful research.

You are welcome to sign-up and learn from the session if you are new to R or if you want to work through the practical content in another language such as Python or Julia. However, the session will be most useful if you have some prior experience with R and RStudio (see links below for recommended reading and places to learn R). We highly recommend that attendees already use R or dedicate some time to learning the basics of R before the session.

Prerequisites

Attendees should have the following before signing-up:

  • Basic familiarity with R and RStudio or expertise in another programming language for data science
  • A laptop with R and RStudio installed (or VS Code with the R extension or similar for advanced users)
  • A willingness to learn (see recommended reading) and share knowledge

Where and when

2nd September 2025, 14:00-16:00, Room 1.11, ITS, University of Leeds

Sign-up!

Sign-up (£50) at the University of Leeds conference website at eu.eventscloud.com/ereg/newreg.php?eventid=200280778

References

Lovelace, R., 2020. Reproducible road safety research with R. RAC Foundation.
Lovelace, R., Nowosad, J., Münchow, J., 2025. Geocomputation with R. CRC Press.
Lovelace, R., Roberts, H., Kellar, I., 2016. Who, where, when: The demographic and geographic distribution of bicycle crashes in west yorkshire. Transportation Research Part F: Traffic Psychology and Behaviour, Bicycling and bicycle safety 41, Part B. https://doi.org/10.1016/j.trf.2015.02.010
Tait, C., Beecham, R., Lovelace, R., Barber, S., 2024. Build it but will they come? Exploring the impact of introducing contraflow cycling on cycling volumes with crowd-sourced data. Journal of Transport & Health 35, 101758. https://doi.org/10.1016/j.jth.2024.101758
Tait, C., Beecham, R., Lovelace, R., Barber, S., 2023. Contraflows and cycling safety: Evidence from 22 years of data involving 508 one-way streets. Accident Analysis & Prevention 179, 106895. https://doi.org/10.1016/j.aap.2022.106895
Vidal Tortosa, E., Lovelace, R., Heinen, E., Mann, R.P., 2021. Socioeconomic inequalities in cycling safety: An analysis of cycling injury risk by residential deprivation level in England. Journal of Transport & Health 23, 101291. https://doi.org/10.1016/j.jth.2021.101291